wind shell Nov 24, 2022, 1:40 PM

#

it did more loops

#

but it's faster

rose schooner Nov 24, 2022, 1:41 PM

#

wind shell it did more loops

what did?

wind shell Nov 24, 2022, 1:41 PM

#

the first did less loops

#

actually faster

#

so, the conversation was good but i've gotta go.

#

bye

rose schooner Nov 24, 2022, 1:45 PM

#

wind shell the first did less loops

in timeit the less loops produced by the command means the slower it is

#

just so it won't take too long to time something

dusk comet Nov 24, 2022, 1:47 PM

#

Try it again with bigger initial fast_thing:
fast_thing=(1,)*1000

radiant garden Nov 24, 2022, 2:48 PM

#

testing performance with your shell time is like measuring the temperature of tea by tossing the thing into lava and seeing how quickly it evaporates

#

but also starting the stopwatch the moment you toss the liquid

#

it tells you something, and it might even be related to what you want to know!

quick trellis Nov 24, 2022, 7:27 PM

#

attempting to redefine function with modified ast; should this be enough?
func.__code__ = compile(tree, '<preprocessed>', 'exec')

quick trellis Nov 24, 2022, 7:48 PM

#

wait no that's dumb

#

or not
not sure if im losing info like this or what

#

maybe i need a way to compile only the FunctionDef node

#

is that possible

quick trellis Nov 24, 2022, 9:01 PM

#

^ solution: compile(tree, '<preprocessed>', 'exec').co_consts[0]

gray galleon Nov 25, 2022, 3:09 AM

#

does python have a frozen dict type

feral cedar Nov 25, 2022, 3:10 AM

#

no

gray galleon Nov 25, 2022, 3:12 AM

#

so if i have to store dicts in sets i have to convert it into an immutable type like tuple or frozen dataclass?

raven ridge Nov 25, 2022, 3:14 AM

#

yes

gray galleon Nov 25, 2022, 3:15 AM

#

~~how about making a hashable dict subclass~~

raven ridge Nov 25, 2022, 3:16 AM

#

I'm not sure - that might work

feral cedar Nov 25, 2022, 3:16 AM

#

i think there's a frozendict that uses a HAMT on pypi

gray galleon Nov 25, 2022, 3:19 AM

#

also its weird to think that not all collections in python are recursive data structures
sets cannot contain sets

feral cedar Nov 25, 2022, 3:19 AM

#

feral cedar i think there's a `frozendict` that uses a HAMT on pypi

ah, it's immutables on pypi. there's also frozendict which i think is just an immutable hashmap

median palm Nov 25, 2022, 3:22 AM

#

gray galleon also its weird to think that not all collections in python are recursive data st...

well, sets can only contain immutable/hashable types. A set is neither of those

gray galleon Nov 25, 2022, 3:24 AM

#

median palm well, sets can only contain immutable/hashable types. A set is neither of those

ik
still weird and unintuitive to think about

unkempt rock Nov 25, 2022, 3:25 AM

#

hello everyone
AS a full stack developer, I have 6 years exp in python
If you have any problem, just feel free to ask

gray galleon Nov 25, 2022, 3:25 AM

#

unkempt rock hello everyone AS a full stack developer, I have 6 years exp in python If you ha...

not here

#

#python-discussion

gray galleon Nov 25, 2022, 3:59 AM

#

is there a reason why reduce is tucked in functools while map and filter isn’t

feral island Nov 25, 2022, 4:00 AM

#

gray galleon is there a reason why reduce is tucked in functools while map and filter isn’t

it was a builtin in Python 2 but got demoted because it was rarely used

gray galleon Nov 25, 2022, 4:01 AM

#

isn’t map and filter just as rarely used as reduce since comprehensions exist

feral island Nov 25, 2022, 4:07 AM

#

gray galleon isn’t map and filter just as rarely used as reduce since comprehensions exist

not in my experience. I see map() and (less so) filter() used pretty regularly, reduce() very rarely

spark magnet Nov 25, 2022, 4:07 AM

#

gray galleon isn’t map and filter just as rarely used as reduce since comprehensions exist

i use map sometimes, filter and reduce never.

feral island Nov 25, 2022, 4:08 AM

#

e.g. the mypy source code has about 11 uses of map(), none of filter and reduce

gray galleon Nov 25, 2022, 4:09 AM

#

i guess comprehensions and genexprs become more unwieldy when you introduce multiple collections so there is still merit in using map?

feral island Nov 25, 2022, 4:10 AM

#

if you want to map/filter using a named function, it's shorter to use map/filter instead of a comprehension

#

e.g. this one in mypy str_ver = ".".join(map(str, python_version))

#

the alternative would be something like ".".join(str(part) for part in python_version)

#

which is longer and requires you to come up with a variable name

gray galleon Nov 25, 2022, 4:12 AM

#

str(_) for _ in python_version 😎

gray galleon Nov 25, 2022, 5:44 AM

#

does python use chaining or open addressing for hash collisions

raven ridge Nov 25, 2022, 5:58 AM

#

gray galleon does python use chaining or open addressing for hash collisions

Open addressing

quick snow Nov 25, 2022, 6:14 AM

#

gray galleon does python have a frozen dict type

!pep 603

fallen slateBOT Nov 25, 2022, 6:14 AM

#

**PEP 603 - Adding a frozenmap type to collections**

Link

Status

Draft

Created

12-Sep-2019

Type

Standards Track

native flame Nov 25, 2022, 6:34 AM

#

that looks interesting

dusk comet Nov 25, 2022, 7:33 AM

#

feral cedar i think there's a `frozendict` that uses a HAMT on pypi

There is also builtin HAMT class, but you cant access it

dusk comet Nov 25, 2022, 7:34 AM

#

feral island it was a builtin in Python 2 but got demoted because it was rarely used

I think the most rare builtin is "ascii"

#

Also "oct" and "memoryview"

grave jolt Nov 25, 2022, 7:48 AM

#

also id

dusk comet Nov 25, 2022, 9:04 AM

#

id is useful for debugging

rose schooner Nov 25, 2022, 10:17 AM

#

why shouldn't set be hashable? it's the only built-in iterable that automatically removes duplicates and stores only hashable elements

#

i get that it's mutable but that doesn't give a good enough reason as to why it shouldn't be hashable

#

can we not do dynamic hashing like ```py

python pseudocode; added .last_hashed_size field

def set_hash(self):
if self.hash == -1 or len(self) != self.last_hashed_size:
... # hash the set
self.last_hashed_size = len(self)
return self.hash

flat gazelle Nov 25, 2022, 10:22 AM

#

there is frozenset

#

Hashing set by contents doesn't make sense for the same reason hashing a list by contents doesn't make sense

rose schooner Nov 25, 2022, 10:27 AM

#

flat gazelle there is frozenset

yeah but i'll have to unfreeze and freeze it back again to put it in a set

elder blade Nov 25, 2022, 10:29 AM

#

That's a good thing, because now it is clear in the code that you'll need to reinsert it wherever it is stored

flat gazelle Nov 25, 2022, 10:31 AM

#

it is more or less impossible to write a hash-based data structure that can handle the hash of its contained elements changing

#

the hash being stable is fairly important since that's what decides where in the datastructure the thing goes

dusk comet Nov 25, 2022, 10:35 AM

#

rose schooner why shouldn't `set` be hashable? it's the only built-in iterable that automatica...

Hash of object must not change during lifetime of object.
Also equality must imply equality of hashes.

Also hashes should be good to get fewer hash collisions (so hash===42 is not a good idea)

Can you satisfy this requirements for set hashes?

#

If hash chnages, you will get weird behaviour (segfault, memory leak, idk)

rose schooner Nov 25, 2022, 10:38 AM

#

dusk comet If hash chnages, you will get weird behaviour (segfault, memory leak, idk)

assuming people don't go through the proper APIs, yes

#

after considering the implementation of set and dict i've concluded that the worst that can probably happen is duplication of set-type elements

flat gazelle Nov 25, 2022, 10:47 AM

#

I doubt you can segfault, but it will not actually work

quick snow Nov 25, 2022, 10:51 AM

#

!e syntactic sugar omitted for brevity

from functools import reduce
from operator import xor
class Set:
    def __init__(self, elements):
        self.elements = set(elements)
    def __hash__(self):
        return reduce(xor, self.elements)

d = {}
s = Set((2,3,4))
d[s] = 42
s.elements.add(1)
print(d[s])

fallen slateBOT Nov 25, 2022, 10:51 AM

#

@quick snow :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 13, in <module>
003 | KeyError: <__main__.Set object at 0x7f2ae91b05d0>

wind shell Nov 25, 2022, 11:04 AM

#

I don't really know why PEP-8 specifies that the space between function has to be 2 lines. I don't like it

flat gazelle Nov 25, 2022, 11:19 AM

#

!e

from functools import reduce
from operator import xor
class Set:
    def __init__(self, elements):
        self.elements = set(elements)
    def __hash__(self):
        return reduce(xor, self.elements)

d = {}
s = Set((2,3,4))
d[s] = 42
s.elements.add(1)
s2 = Set((1,2,3,4))
print(d[s2])
```this also won't work

fallen slateBOT Nov 25, 2022, 11:19 AM

#

@flat gazelle :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 14, in <module>
003 | KeyError: <__main__.Set object at 0x7f7cfdc80690>

dusk comet Nov 25, 2022, 11:52 AM

#

wind shell I don't really know why PEP-8 specifies that the space between function has to b...

IMO, it is good unless you have a lot of similar short methods

dusk comet Nov 25, 2022, 11:54 AM

#

rose schooner assuming people don't go through the proper APIs, yes

Is dict and set behaviour proper? What I said is written in the documentation

rose schooner Nov 25, 2022, 12:01 PM

#

dusk comet Is `dict` and `set` behaviour proper? What I said is written in the documentatio...

so far i don't see the possibility of crashes

halcyon trail Nov 25, 2022, 12:31 PM

#

rose schooner so far i don't see the possibility of crashes

Idk if crashing is possible but you can simply screw up the data structure

#

You won't be able to retrieve the object, which makes the data structure useless

rose schooner Nov 25, 2022, 12:32 PM

#

wdym by "screw up the data structure"

#

is that the issue i said earlier

#

after considering the implementation of set and dict i've concluded that the worst that can probably happen is duplication of set-type elements

halcyon trail Nov 25, 2022, 12:33 PM

#

If you mutate a key while it's in a dict

#

You can duplicate but obviously part of that is also that you won't ever find the original element

rose schooner Nov 25, 2022, 12:34 PM

#

halcyon trail You can duplicate but obviously part of that is also that you won't ever find th...

i forgot to mention that

halcyon trail Nov 25, 2022, 12:34 PM

#

Well that defeats the whole purpose of the data structure

flat gazelle Nov 25, 2022, 12:35 PM

#

the only correct way to do this operation is to remove the element and rehash and reinsert it.

halcyon trail Nov 25, 2022, 12:35 PM

#

Seems like a good enough reason to disallow mutable keys

flat gazelle Nov 25, 2022, 12:35 PM

#

which you can already do with frozenset

halcyon trail Nov 25, 2022, 12:35 PM

#

Right

#

In languages with mutation control like C++ and rust

#

Mutable types can be keys

#

They just prevent mutation while it's in the dict

#

Python doesn't have that option

rose schooner Nov 25, 2022, 12:36 PM

#

flat gazelle which you can already do with frozenset

that does require unfreezing and refreezing which is not optimal for large sets

flat gazelle Nov 25, 2022, 12:37 PM

#

quick snow !e syntactic sugar omitted for brevity ```py from functools import reduce from o...

you could do this and manually make sure you do the operations correctly

#

that is, remove, mutate, readd

#

if you can't afford the copies

rose schooner Nov 25, 2022, 12:37 PM

#

yeah that's probably faster

flat gazelle Nov 25, 2022, 12:39 PM

#

gods that is a minefield upon thinking about it

#

good luck

halcyon trail Nov 25, 2022, 12:53 PM

#

I think Java and Kotlin don't try to enforce this the way python does

#

I'm not sure if it's that big a minefield

#

But it is definitely a minefield. Only the size is in question 🙂

flat gazelle Nov 25, 2022, 1:06 PM

#

quick snow !e syntactic sugar omitted for brevity ```py from functools import reduce from o...

yeah, java gives you the same behaviour as the above, just for standard collections. Different ways, the java option is a hair more powerful, but also more error prone.

feral island Nov 25, 2022, 2:42 PM

#

halcyon trail Idk if crashing is possible but you can simply screw up the data structure

If you can crash CPython this way that would be a bug. I believe the effect is simply that you can't retrieve the element by key and checks like in could be wrong

#

(because those are going to look in the wrong hash table bucket)

halcyon trail Nov 25, 2022, 2:54 PM

#

Right

halcyon trail Nov 25, 2022, 3:09 PM

#

I guess crashing cpython with python code is always a bug

#

Hard for me to get used to the managed language mentality 😛

feral island Nov 25, 2022, 3:18 PM

#

halcyon trail I guess crashing cpython with python code is always a bug

with some exceptions, e.g. ctypes and constructing code objects manually

#

and sys.setrecursionlimit(100000000)

halcyon trail Nov 25, 2022, 3:23 PM

#

I'm not familiar with code objects

#

For setrecursion limits, sure, same with forcing enormous allocations, manually sending in signals into the python interpreter

feral island Nov 25, 2022, 3:25 PM

#

enormous allocations should just fail with MemoryError

rose schooner Nov 25, 2022, 3:31 PM

#

halcyon trail For setrecursion limits, sure, same with forcing enormous allocations, manually ...

it's probably impossible to crash with enormous allocations due to bounds checks in every part of the interpreter that allocates memory

rose schooner Nov 25, 2022, 3:32 PM

#

halcyon trail I'm not familiar with code objects

!e ```py
exec((lambda:0).code.replace(co_code=b'\1\0'))

fallen slateBOT Nov 25, 2022, 3:32 PM

#

@rose schooner :warning: Your 3.11 eval job has completed with return code 139 (SIGSEGV).

[No output]

halcyon trail Nov 25, 2022, 4:22 PM

#

I don't see how you can avoid memory failures

#

Linux doesn't tell you that you allocate more memory than is available by default

#

Allocate a lot of memory and the OS will just eventually kill your program

#

Windows is different

radiant garden Nov 25, 2022, 4:32 PM

#

if creating a MemoryError fails with oom you're probably in trouble and are about to get sniped by the OS

#

But I'm not sure that counts as crashing as opposed to getting killed externally

flat gazelle Nov 25, 2022, 4:36 PM

#

The way the Linux kernel works by default is by letting you allocate more memory than is available, and if you use too much of it, you get killed. To my knowledge you can't actually stop that from happening as a process

rose schooner Nov 25, 2022, 4:49 PM

#

halcyon trail I don't see how you can avoid memory failures

check it

flat gazelle Nov 25, 2022, 4:50 PM

#

The kernel won't tell you that you are overallocating and will die soon.

#

At least afaik

quick snow Nov 25, 2022, 5:07 PM

#

You can definitely get MemoryErrors on Linux. malloc() can return NULL, and when the OOM killer gets you is a configuration thing IIRC

radiant garden Nov 25, 2022, 5:53 PM

#

You'll raise it if you try to allocate something silly

halcyon trail Nov 25, 2022, 6:07 PM

#

When does python raise MemoryError

#

Is it based on something other than malloc returning null

quick snow Nov 25, 2022, 6:51 PM

#

@rose schooner et al.: It just so happens dabeaz just posted on his mailing list about just this topic (hashability of mutable data structures): https://tinyletter.com/dabeaz/letters/dangerous-flexibility

TinyLetter

Dangerous Flexibility

In the recent book "Software Design for Flexibility ", the authors make reference to the fact that they're going to explore ...

halcyon trail Nov 25, 2022, 7:26 PM

#

I don't disagree it can be useful

#

But not nearly that often I think

grave jolt Nov 25, 2022, 8:27 PM

#

quick snow <@310263589913100288> et al.: It just so happens dabeaz just posted on his maili...

I get a 406 Not Acceptable on the stylesheet 😦

raven ridge Nov 25, 2022, 8:57 PM

#

flat gazelle The kernel won't tell you that you are overallocating and will die soon.

You can configure the kernel to not overcommit memory, and I believe if you do then the OOM killer is never invoked because there's never a need to free memory owned by a userspace process

flat gazelle Nov 25, 2022, 9:03 PM

#

indeed

#

you can get a memory error on linux by default via something silly like [0] * 2 ** 40

brave ore Nov 26, 2022, 3:36 AM

#

halcyon trail Windows is different

Does Windows die along with the process? 🤔

full parcel Nov 26, 2022, 4:03 AM

#

Is there any function related programing in python please suggest me best online website

quick snow Nov 26, 2022, 5:49 AM

#

brave ore Does Windows die along with the process? 🤔

No, Windows just refuses to allocate memory for you if there isn't enough space. It doesn't kill the process either.

raven ridge Nov 26, 2022, 6:27 AM

#

that's opt-in behavior on Linux, too. https://www.kernel.org/doc/Documentation/vm/overcommit-accounting

#

it's usually disabled in favor of overcommitting, though, because fork makes it extremely easy for a child process to inherit many gigabytes of memory that it has no intention of actually using. In practice, overcommitting is usually the better call for a POSIX system. That rationale doesn't apply to Windows, though, because it has CreateProcess instead of fork+exec

gray galleon Nov 26, 2022, 9:24 AM

#

full parcel Is there any function related programing in python please suggest me best online...

functional programming is kinda discouraged in python

#

but its still an option

gray galleon Nov 26, 2022, 9:30 AM

#

full parcel Is there any function related programing in python please suggest me best online...

https://realpython.com/python-functional-programming/

Functional Programming in Python: When and How to Use It – Real Pyt...

In this tutorial, you'll learn about functional programming in Python. You'll see what functional programming is, how it's supported in Python, and how you can use it in your Python code.

gusty vessel Nov 26, 2022, 10:26 AM

#

fwewe

#

few

tawdry pond Nov 26, 2022, 3:08 PM

#

gray galleon https://realpython.com/python-functional-programming/

RealPython also gives you some free courses if you sign up and one of them is a functional programming course

amber nexus Nov 27, 2022, 12:07 AM

#

Speaking of realpython and internals, I'm not sure if this is the case in other countries but the CPython internals book is currently 10 dollars for a physical copy on Amazon Australia https://www.amazon.com.au/CPython-Internals-Guide-Python-Interpreter/dp/1775093344/ref=asc_df_1775093344/, very good deal if it's available to you as well

CPython Internals: Your Guide to the Python 3 Interpreter

rose schooner Nov 27, 2022, 12:08 AM

#

amber nexus Speaking of realpython and internals, I'm not sure if this is the case in other ...

who buys a book about something that can change from time to time

#

3.9 is like 2 years ago

amber nexus Nov 27, 2022, 12:09 AM

#

For 10 bucks I think that's a very solid deal regardless, yeah the interpreter changes but a lot of the fundamentals are remaining the same

rose schooner Nov 27, 2022, 12:13 AM

#

amber nexus For 10 bucks I think that's a very solid deal regardless, yeah the interpreter c...

10 bucks is like a month's worth of lunch money for me

amber nexus Nov 27, 2022, 12:13 AM

#

Different economies I suppose 👀

white wren Nov 27, 2022, 12:14 AM

#

Hey

paper echo Nov 27, 2022, 3:35 AM

#

rose schooner who buys a book about something that can change from time to time

this is always my hangup about programming books

#

i'm happy to spend $15 on an e-book or donate to someone making good blog content, but i don't really want to purchase a physical book that will go out of date soon

broken sluice Nov 27, 2022, 7:54 AM

#

Is there a forum here to discuss small change proposals to cpython internals, or even python's requirements as a language? I have a change I want to have an online discussion about, the scope is quite small though so posting a PEP would seem like too much red tape for what it does

#

are there core devs present on discord?

quick snow Nov 27, 2022, 8:09 AM

#

broken sluice Is there a forum here to discuss small change proposals to cpython internals, or...

This is the right place for discussing something like that. A non-zero amount of core devs at least occasionally look into this channel.

charred wagon Nov 27, 2022, 8:46 AM

#

broken sluice Is there a forum here to discuss small change proposals to cpython internals, or...

You can also make a post here https://discuss.python.org/ as something in between Discord and writing a PEP

broken sluice Nov 27, 2022, 8:56 AM

#

I'll make a publicl shared doc - where I could keep track of all the counterpoints made, and respond to them just once. Let's try that at least ... I will post the link to the doc when I'm done writing it

broken sluice Nov 27, 2022, 10:25 AM

#

https://docs.google.com/document/d/1et5x5HckTJhUQsz2lcC1avQrgDufXFnHMin7GlI5XPI/edit?usp=sharing
It is shared for viewing and commenting
I'm open for a discussion in here or comments on the doc... thanks!

Google Docs

Allowing or requiring None to hash to a persistent value

Preface / meta First I will make the case for the proposal in the title, out of first principles, then I’ll list in a separate section the counter-arguments I’ve seen so far, and my replies to them. Managing this as a doc I’ve implemented this change as a PR to CPython and that was promptly clo...

halcyon trail Nov 27, 2022, 1:40 PM

#

broken sluice https://docs.google.com/document/d/1et5x5HckTJhUQsz2lcC1avQrgDufXFnHMin7GlI5XPI/...

I mean it's.very surprising the setting python hash seed doesn't leave.the hash of None deterministic

#

I would just figure out if there's a reason for that

#

The whole purpose of that variable is to make hashing deterministic for testing or reproducibility purposes

#

No obvious reason to exclude None

flat gazelle Nov 27, 2022, 1:44 PM

#

for the same reason object() and function hashes are non-deterministic, it's just computed off of the memory address, rather than any deterministic value. Which I do agree is kind of dumb.

broken sluice Nov 27, 2022, 1:48 PM

#

I do mention the possibility of making the hash not constant, but rather a deterministic function of the hash secret (which makes it so if you specify PYTHONHASHSEED, it will be constant across your runs).
That's arguably a better choice from a practical POV

halcyon trail Nov 27, 2022, 1:52 PM

#

flat gazelle for the same reason `object()` and function hashes are non-deterministic, it's j...

Ah, interesting

#

It didn't even occur to me they would apply that to None

#

Im not really a fan of allowing hash by address.to start with
Most languages don't allow it

dusk comet Nov 27, 2022, 1:54 PM

#

im getting same results in every interpreter launch
im not using PYTHONHASHSEED
3.11

grave jolt Nov 27, 2022, 1:54 PM

#

that's because None happens to be on the same address every time

broken sluice Nov 27, 2022, 1:55 PM

#

Yes. the values are different every run only on systems that apply ASLR

halcyon trail Nov 27, 2022, 1:55 PM

#

It's not cooncidence

grave jolt Nov 27, 2022, 1:55 PM

#

For me the values are different, I'm on Linux

halcyon trail Nov 27, 2022, 1:55 PM

#

Those sessions are open at the same time it looks like

#

The binary only gets loaded into memory once when you.run an executable multiple times

broken sluice Nov 27, 2022, 1:56 PM

#

but ASLR is a pretty important infosec feature, so "disable it if you want your hashes to be stable" is a pretty bad argument

grave jolt Nov 27, 2022, 1:56 PM

#

.wiki ASLR

neon troutBOT Nov 27, 2022, 1:56 PM

#

Wikipedia Search Results

Address space layout randomization
Address space layout randomization (ASLR) is a computer security technique involved in preventing exploitation of memory corruption vulnerabilities. In

IOS
Darwin 21. iOS 16 is based on Darwin 22. In iOS 6 the kernel is subject to ASLR, similar to that of OS X Mountain Lion. This makes exploit possibilities

halcyon trail Nov 27, 2022, 1:56 PM

#

Yeah no argument

grave jolt Nov 27, 2022, 1:56 PM

#

ah icic

dusk comet Nov 27, 2022, 1:57 PM

#

>py -c "print(hash(None))"
-9223363242347854132

ok, now it is different

halcyon trail Nov 27, 2022, 1:57 PM

#

Did you me close all the sessions and relaunch?

grave jolt Nov 27, 2022, 1:57 PM

#

flat gazelle for the same reason `object()` and function hashes are non-deterministic, it's j...

how would you do it for an arbitrary object() though?

#

store some random piece of data?

dusk comet Nov 27, 2022, 1:58 PM

#

grave jolt how would you do it for an arbitrary `object()` though?

__hash__ = id

halcyon trail Nov 27, 2022, 1:58 PM

#

You wouldn't, folks who use address hashing deserve what they get 😛

flat gazelle Nov 27, 2022, 1:58 PM

#

I meant specifically for None, of course object() will have a non-deterministic hash no matter what

grave jolt Nov 27, 2022, 1:58 PM

#

dusk comet `__hash__ = id`

that's literally what lakmatiol said is bad

#

ah

#

I misunderstood then

flat gazelle Nov 27, 2022, 1:58 PM

#

same with user-defined functions

dusk comet Nov 27, 2022, 2:00 PM

#

>>> x = object()
>>> hex(id(x))
'0x189ab5d5070'
>>> hex(hash(x))
'0x189ab5d507'
>>> assert hash(x) == id(x) // 0x10

dusk comet Nov 27, 2022, 2:00 PM

#

grave jolt I misunderstood then

same with classes

halcyon trail Nov 27, 2022, 2:00 PM

#

Java has the same default hash implementation

#

Smh

broken sluice Nov 27, 2022, 2:01 PM

#

id and object.__hash__ are both deterministically calculated from the object's memory address, but not in the same way

grave jolt Nov 27, 2022, 2:02 PM

#

flat gazelle I meant specifically for None, of course object() will have a non-deterministic ...

I guess it could be problematic if CPython wanted to move objects around in memory

#

but that would break like... everything innit

flat gazelle Nov 27, 2022, 2:02 PM

#

yeah, that's probably not on the table

#

considering all of CPython is built on passing pointers around

grave jolt Nov 27, 2022, 2:02 PM

#

your table is very non-deterministic

broken sluice Nov 27, 2022, 2:03 PM

#

Py_hash_t
_Py_HashPointerRaw(const void *p)
{
    size_t y = (size_t)p;
    /* bottom 3 or 4 bits are likely to be 0; rotate y by 4 to avoid
       excessive hash collisions for dicts and sets */
    y = (y >> 4) | (y << (8 * SIZEOF_VOID_P - 4));
    return (Py_hash_t)y;
}

and I think id simply returns the address itself, but I'm not sure

#

@grave jolt : if CPython moved objects in memory it would alter their id and hash under current implementation, thus break correctness

grave jolt Nov 27, 2022, 2:04 PM

#

yeah that's what I meant

#

well, that would be the least of the problems

broken sluice Nov 27, 2022, 2:06 PM

#

In languages where objects are allowed to move, something like id has to be stored inside the object (and even then, taking the initial address as the id is wrong, because something else could be allocated there after the original is moved away)

spark magnet Nov 27, 2022, 2:07 PM

#

broken sluice In languages where objects are allowed to move, something like id has to be stor...

PyPy uses something other than address for id() iirc

dusk comet Nov 27, 2022, 2:13 PM

#

where hash(None) is implemented? i cant find it
there tp_hash is 0: https://github.com/python/cpython/blob/main/Objects/object.c#L1678-L1692

broken sluice Nov 27, 2022, 2:13 PM

#

It uses the same hash as object, if tp_hash is 0, it will call Py_HashPointer (from what I can tell)

#

So assuming there is merit to my proposal, how am I supposed to actually make it? It got shot down horribly on the forum, maybe because my initial arguments for it were weaker.
On github, a core developer just closed my PR and the issue, and that was it.

I don't know what else can be done at this point

rose schooner Nov 27, 2022, 2:20 PM

#

broken sluice So assuming there is merit to my proposal, how am I supposed to actually make it...

this issue? https://github.com/python/cpython/issues/99540

GitHub

Constant hash value for None · Issue #99540 · python/cpython

Feature or enhancement Fix hash(None) to a constant value. Pitch (Updated 2022.11.18) Under current behavior, the runtime leaks the ASLR offset, since the original address of the None singleton is ...

broken sluice Nov 27, 2022, 2:21 PM

#

Yes

dusk comet Nov 27, 2022, 2:23 PM

#

broken sluice It uses the same hash as object, if tp_hash is 0, it will call Py_HashPointer (f...

found it: https://github.com/python/cpython/blob/main/Python/pyhash.c#L137-L155

spark magnet Nov 27, 2022, 2:24 PM

#

@broken sluice I think the general objection will be that hash() was never intended to be useful beyond the current process. If you need something like that, you need to implement it with the guarantees you want.

broken sluice Nov 27, 2022, 2:25 PM

#

But if that is true, why does the PYTHONHASHSEED feature exist? and it is not deprecated ... and people rely on it for debug/research purposes

#

and as I said in the doc, you can't implement your own hashing strategy in Python

spark magnet Nov 27, 2022, 2:26 PM

#

broken sluice But if that is true, why does the `PYTHONHASHSEED` feature exist? and it is not ...

PYTHONHASHSEED might have been a stop-gap when randomization was introduced, to make dict iteration consistent.

broken sluice Nov 27, 2022, 2:26 PM

#

the builtin containers do not accept a hasher as an outside parameter

spark magnet Nov 27, 2022, 2:27 PM

#

broken sluice the builtin containers do not accept a hasher as an outside parameter

sorry, i haven't read the doc, so I'm not sure what your use case is. I hash data structures, but not for insertion into dicts: https://github.com/nedbat/coveragepy/blob/master/coverage/misc.py#L227-L264

broken sluice Nov 27, 2022, 2:28 PM

#

besides, is it really a counter argument?
I mean sure, Python isn't obliged to support this, but this seems like going out of its way needlessly to break something, at face value

spark magnet Nov 27, 2022, 2:29 PM

#

broken sluice besides, is it really a counter argument? I mean sure, Python isn't obliged to s...

not sure what you mean by "going out of its way": None inherits the hash from object.

broken sluice Nov 27, 2022, 2:29 PM

#

Consider my example:

KeyType1 = tuple[int] | tuple[int, int]

@dataclass(frozen=True)
class KeyType2:
    foo_id: int
    bar_id: Optional[int]

does it make sense that one of these key types hashes deterministically and one doesn't?

#

well, the docs explains why I think that hashing a monostate variable to a constant is the "common sense" thing to do. That's why I said going out of its way. But in terms of implementation you are right, it simply inherits from object

spark magnet Nov 27, 2022, 2:31 PM

#

generally, changes to Python need to have real-world use cases rather than "it makes sense" arguments. Again, you might have them in the doc, I haven't read it.

feral island Nov 27, 2022, 2:34 PM

#

broken sluice So assuming there is merit to my proposal, how am I supposed to actually make it...

Start a discussion on discuss.python.org, referencing the closed issue and other previous discussion. But I can't guarantee there will be any more enthusiasm for the idea than before

halcyon trail Nov 27, 2022, 2:42 PM

#

spark magnet generally, changes to Python need to have real-world use cases rather than "it m...

The use case for something like this is to make regression tests reproducible

#

I'm a bit confused because this is such a standard technique

#

Regression tests and even unit tests commonly use a randomized seed for say rng which they log. If you have a failure you want to reproduce, you rerun it feeding the logged seed

#

Granted that things which use address.hashing will not be reproducible regardless, but many things will be. Anything that has a notion of value semantics, structural equality, etc

#

And optional fits into that

#

(rather, none fits into that but that's usually seen in an "optional" context)

spark magnet Nov 27, 2022, 2:56 PM

#

just to be clear, I am not arguing against this proposal. I'm trying to help explain the core dev mindset, to help @broken sluice

halcyon trail Nov 27, 2022, 3:05 PM

#

Sure, I'm just saying, the use case seems pretty clear, no?

#

Reproducibility always matters

#

Salting hashes makes sense for prod, in testing it's ok to salt hashes but you need to be able to reproduce any test run as closely as possible

#

I would have assumed that that's the purpose of pythonhashseed rather than a stopgap

#

Sets for example still don't have defined iteration order, it's easy to write code that accidentally depends on set iteration order. Say you do and your tests usually pass, now one night they happen to fail. It's not great if you can't reproduce that quickly and easily

spark magnet Nov 27, 2022, 3:09 PM

#

I'd be in favor of making the change

feral island Nov 27, 2022, 3:10 PM

#

making only hash(None) reproducible isn't a very thorough solution for that problem though. Hashes of e.g. types and function objects are still nondeterministic

halcyon trail Nov 27, 2022, 3:10 PM

#

Sure, you can't solve that problem completely if you've.chosen to rely on identity based hashing

#

But it's pretty debatable to say that None is an identity based type

#

To put it mildly

#

None being a Singleton is an implementation detail

spark magnet Nov 27, 2022, 3:11 PM

#

it's not an implementation detail, it's important that x is None behave the right way.

#

but being a singleton is a good reason for it to have a fixed hash

halcyon trail Nov 27, 2022, 3:12 PM

#

It's a monostate type

spark magnet Nov 27, 2022, 3:12 PM

#

how is that different than singleton?

halcyon trail Nov 27, 2022, 3:13 PM

#

Singleton is an implementation detail of being a monostate type, in most cases.

#

Python chose to make is None idiomatic

spark magnet Nov 27, 2022, 3:13 PM

#

can you explain the difference between singleton and monostate?

halcyon trail Nov 27, 2022, 3:13 PM

#

A monostate is a type with only value

#

A Singleton is a type with only one instance

spark magnet Nov 27, 2022, 3:14 PM

#

ok, do we agree that you can't have a Python where None isn't a singleton?

halcyon trail Nov 27, 2022, 3:14 PM

#

Well, because of breaking is None checks, sure

#

Not for any other reason I can think of

#

If python just did == None which is generally what you see, then it would truly be an implementation detail

#

My main point I guess is that None usually shows up as a value as part of an implied Optional semantic

#

Since always having None isn't very interesting

#

And Optional[T] is something that has structural equality if T does, without fail

dusk comet Nov 27, 2022, 3:18 PM

#

broken sluice Consider my example: ``` KeyType1 = tuple[int] | tuple[int, int] @dataclass(fr...

Hashes makes sense only in the same process. If you are saving hash and using it in other process, hashes are not obliged to mean anything useful

halcyon trail Nov 27, 2022, 3:18 PM

#

And structural hashing, generally, as well

grave jolt Nov 27, 2022, 3:18 PM

#

actually, why did is None become idiomatic? pithink

halcyon trail Nov 27, 2022, 3:18 PM

#

Idk. The funny thing is that it was encouraged, but it's arguably pretty terrible

#

Is None cannot be overloaded

#

But I don't think that argument is compelling

flat gazelle Nov 27, 2022, 3:22 PM

#

there are types like numpy arrays which implement == elementwise

#

so probably for those

halcyon trail Nov 27, 2022, 3:23 PM

#

Fair point I suppose

#

I was going to say that performance reasons are the main reason to use identity rather than structural equality for None

broken sluice Nov 27, 2022, 3:35 PM

#

spark magnet how is that different than singleton?

can you explain the difference between singleton and monostate?

A monostate type is a type whose instances can have only a single possible state
In Python since all values are held by references, the only thing that makes sense is to store such a type in a singleton. But that's not generally the case in all other languages.

Regarding hashing, the common sense thing to do when hashing a monostate type is to return a constant. That's what I'm arguing, at least.
I am saying that the Optional "None" type (i.e. a disengaged Optional) is a monotype variable - hence the proposal. Even though in Python, None has other meanings, but we are kind of stuck with None representing a disengaged optional

spark magnet Nov 27, 2022, 3:39 PM

#

broken sluice > can you explain the difference between singleton and monostate? A monostate t...

again, just to give you some perspective on the core devs' mindset: they aren't interested in "other languages do it this way", and using terms from other languages and cultures (like monostate or disengaged optional) aren't likely to convince them either. They want to hear, "Here's a thing I want to do with Python, and it would be much better if we made a change"

broken sluice Nov 27, 2022, 3:39 PM

#

If it were practical to separate the Optional None from the "null reference" None (or whatever otehr meanings people ascribe to it) that would be a great solution, unfortunately it doesn't seem practical at this point

#

If Python implemented Optional[T] as a Union[T, Unit] and had Unit hash to a constant in the first place, we wouldn't be having this discussion right now
There was a choice to overload None which caused this complication, it didn't have to be this way...

and the concerns of "value types" are universal to programming, I don't think they become irrelevant just because we are writing in Python
even if Python puts less emphasis on these ideas

halcyon trail Nov 27, 2022, 3:44 PM

#

spark magnet again, just to give you some perspective on the core devs' mindset: they aren't ...

That seems unfortunate to me. All languages can learn from one another,.both because of intrinsically good ideas and because of things simply becoming consensus across many languages

spark magnet Nov 27, 2022, 3:44 PM

#

@broken sluice I don't know what Unit is in this case, and I don't understand what you mean by "overload None".

spark magnet Nov 27, 2022, 3:45 PM

#

halcyon trail That seems unfortunate to me. All languages can learn from one another,.both bec...

corak is trying to convince people. You need to take your audience into account. How you craft a persuasive argument depends on who you are trying to persuade.

broken sluice Nov 27, 2022, 3:46 PM

#

So maybe I can't convince Python devs if that's really the prevailing attitude

#

We say reproducibility is important sometimes, you say it never is, so there's a stalemate

halcyon trail Nov 27, 2022, 3:47 PM

#

spark magnet corak is trying to convince people. You need to take your audience into account...

Sure I don't disagree with that and I understand you're only trying to help

#

Just think the attitude is a bit unfortunate. No language is an island after all.

#

Also I don't think ned said reproducibility never matters? Maybe I missed it

broken sluice Nov 27, 2022, 3:50 PM

#

denball did:
Hashes makes sense only in the same process. If you are saving hash and using it in other process, hashes are not obliged to mean anything useful

which is effectively the same thing, worded differently
if hashes are different, your second run won't be the same as the first. It is enough that iterate a set anywhere in the program, and it will diverge

dusk comet Nov 27, 2022, 3:50 PM

#

broken sluice If Python implemented `Optional[T]` as a `Union[T, Unit]` and had `Unit` hash to...

Creating more None's in interpreter is bad idea. It already have None, Ellipsis and NotImplemented singletons/sentinels for different purposes.
If you need one more sentinel, it is better to create it yourself: MISSING=object() instead of adding it to interpreter

broken sluice Nov 27, 2022, 3:51 PM

#

I did that, but it's a lot of trouble for devs to maintain such code
having to say Union[T, Unit] instead of Optional[T] everywhere
having to convert None to Unit and back

#

and all that, for what? what is the gain from that approach over modifying None?

dusk comet Nov 27, 2022, 3:52 PM

#

broken sluice I did that, but it's a lot of trouble for devs to maintain such code having to s...

Why would you need to convert some sentinel to None back?

broken sluice Nov 27, 2022, 3:54 PM

#

let's say you write an optional to JSON. Must convert from Unit to None
it gets read back into None, must convert back to Unit
it is a hassle
You're not being honest if you claim it is trivial to sanitize None out of an entire large program, and keep it that way
and again, is it a good idea to ask people to do that? only because they want reproducible runs?

spark magnet Nov 27, 2022, 3:54 PM

#

broken sluice let's say you write an optional to JSON. Must convert from Unit to None it gets ...

I understand you are frustrated, but starting to accuse people of dishonesty is not going to help.

broken sluice Nov 27, 2022, 3:57 PM

#

You are right, but like I said, if people really want to think a certain way, nothing I say will ever convince them otherwise

#

and that's what it looks like to me :/

spark magnet Nov 27, 2022, 4:03 PM

#

it might be that they need more help to see the advantages, or how little it costs to make the change.

broken sluice Nov 27, 2022, 4:04 PM

#

so let's discuss the cost ... what is the actual cost?
I mean, apart from CR
Python devs will know more than I do about that for sure

spark magnet Nov 27, 2022, 4:05 PM

#

broken sluice so let's discuss the cost ... what is the actual cost? I mean, apart from CR Pyt...

Is there a pull request that implements the change?

#

it sounds to me like it would be a dozen lines for the change, and perhaps a dozen for a test.

halcyon trail Nov 27, 2022, 4:07 PM

#

Yeah, that makes sense. Seems very low cost, even if the benefits arent going to be seen as huge

halcyon trail Nov 27, 2022, 4:08 PM

#

dusk comet Hashes makes sense only in the same process. If you are saving hash and using it...

I think the main point is not the hashes being "meaningful" across processes, but reproducibility across processes. Which is very much a real thing.

#

See some of the above examples I gave around testing

dusk comet Nov 27, 2022, 4:10 PM

#

Why would you need hash(None)? Are you using it as key in dict? Item of set? Are you hashing tuple with None?

halcyon trail Nov 27, 2022, 4:11 PM

#

A data class with an optional field?

broken sluice Nov 27, 2022, 4:11 PM

#

Scroll up, there is an example where the hash of None injects non determinism into seemingly harmless key types

KeyType1 = tuple[int] | tuple[int, int]

@dataclass(frozen=True)
class KeyType2:
    foo_id: int
    bar_id: Optional[int]

halcyon trail Nov 27, 2022, 4:11 PM

#

Being used as a key

#

Extremely common examples

broken sluice Nov 27, 2022, 4:29 PM

#

Anyway I made a PR for this and it was closed. I think if I try to open another one, it might be viewed as trying to spam. and I can't re-open the closed PR AFAIK

#

Also in the implmentation, we need to choose between two options

None hashes to a constant
None hashes to a deterministic function of the hash secret, so it only stays constant if PYTHONHASHSEED is used

flat gazelle Nov 27, 2022, 4:31 PM

#

I would go with constant tbh, PYTHONHASHSEED matters for hash collision attacks, which is impossible with None

#

get ready for the which constant bikeshedding

broken sluice Nov 27, 2022, 4:32 PM

#

I put 0xbadcab1e in my PR ... but I don't mind changing that 🙂

#

What of the fact that https://github.com/python/cpython/pull/99541 is closed?

GitHub

gh-99540: Constant hash for _PyNone_Type by yonillasky · Pull Reque...

Issue: gh-99540

halcyon trail Nov 27, 2022, 4:41 PM

#

broken sluice Also in the implmentation, we need to choose between two options - None hashes t...

Yeah this is definitely the way to go

#

Also as to why it was closed , did you read the GitHub thread?

#

Looks like it was closed by a bot because you didn't update news (patch notes roughly speaking)

broken sluice Nov 27, 2022, 4:43 PM

#

I did - rhettinger explained himself there

#

No, the bot did not close it; I did update the news, via blurb

#

and the bot had green status on my change
rhettinger then posted the following on the issue

Thanks for the suggestion but this doesn't make sense. The default hash for every object is its object id. There is nothing special about None in this regard. Also, hash randomization was added intentionally for strings and bytes — we're definitely not in business of trying to make hashes constant and we don't want people to come to rely on a particular hash order or value.

and closed my PR

halcyon trail Nov 27, 2022, 4:44 PM

#

Ah sorry

#

That's incredibly lame, fwiw

#

I can respond fwiw not that I'm anybody

#

I can't seem to actually find rhettingers comment

broken sluice Nov 27, 2022, 4:46 PM

#

my opinion doesn't count for anything either
so I don't know, maybe we should leave it for someone who has the cred to push such a change

#

you don't see it there?

#

https://github.com/python/cpython/issues/99540

GitHub

Constant hash value for None · Issue #99540 · python/cpython

Feature or enhancement Fix hash(None) to a constant value. Pitch (Updated 2022.11.18) Under current behavior, the runtime leaks the ASLR offset, since the original address of the None singleton is ...

halcyon trail Nov 27, 2022, 4:48 PM

#

Ah there we go

#

I can respond there, probably won't help. Maybe on a mailing list you need to raise this first? Happy to support you there as well

broken sluice Nov 27, 2022, 4:51 PM

#

I could try that

spark magnet Nov 27, 2022, 5:17 PM

#

broken sluice I could try that

https://discuss.python.org

Discussions on Python.org

Discussions related to the Python Programming Language, Python Community, and Python Software Foundation operations.

broken sluice Nov 27, 2022, 5:18 PM

#

I sent the mail already

spark magnet Nov 27, 2022, 5:23 PM

#

broken sluice I sent the mail already

to which list?

broken sluice Nov 27, 2022, 5:25 PM

#

python-dev@python.org

#

Now also opened another thread in the forum, hopefully I don't get an infraction for that or something

feral island Nov 27, 2022, 7:35 PM

#

broken sluice and the bot had green status on my change rhettinger then posted the following o...

for what it's worth, Raymond is often quite close-happy when he feels a change is not correct. I don't really like it, but he is a very experienced core dev and educator

#

so his opinion should have some weight

broken sluice Nov 27, 2022, 7:40 PM

#

Fair point, however, it's one thing to think this change is wrong, another to say this: "There is nothing special about None in this regard".

He could say "catering to this use case is not worth the effort"
or "this will hinder planned changes x/y/z" or all kinds of other reasons. But saying something like that just makes it seem like he gave it only a moment's thought at best. Even if he in reality has extremely good reason for what he's saying

swift imp Nov 27, 2022, 8:12 PM

#

Are they saying they want the hash of None to be constant throughout multiple sessions? I thought it's hash was based on id but it's a singleton so that's constant for the lifetime of the program.

broken sluice Nov 27, 2022, 8:13 PM

#

yes, multiple sessions.

swift imp Nov 27, 2022, 8:15 PM

#

Why

#

I don't understand the issue

#

Or should I say the benefit

#

I'm not seeing it in the issue

broken sluice Nov 27, 2022, 8:35 PM

#

You are debugging a program, or there is a unit test failure.
The test runs a lot of code. Somewhere the program computes a set of keys, then iterates on it and does more things under the loop. Let's say the keys are frozen dataclasses with optionals in them. because of the non-determinism of the hash, the set are organized differently each run. So anything downstream from that point diverges every run. Now it might cause flaky tests, failure to repeat problem cases even if you log the inputs, etc.

#

and all of that for what? No one ever told me who actually benefits from hash(None) changing every run

rose schooner Nov 27, 2022, 8:44 PM

#

well this has been going on for the duration of my sleeping time

pliant tusk Nov 27, 2022, 8:44 PM

#

broken sluice You are debugging a program, or there is a unit test failure. The test runs a lo...

wouldn't iterating over a set be non-deterministic anyways? because sets are unordered?

broken sluice Nov 27, 2022, 8:45 PM

#

In theory, yes. In practice, given the exact same operations history and same hash values, the set will be in a perfectly identical state each run, and thus will iterate its contents the same order. You don't know which order and don't care, but it will be the same

pliant tusk Nov 27, 2022, 8:45 PM

#

my point is that code that relies on deterministic set order is already a bug

broken sluice Nov 27, 2022, 8:46 PM

#

but the code doesn't rely on it

rose schooner Nov 27, 2022, 8:46 PM

#

broken sluice In theory, yes. In practice, given the exact same operations history and same ha...

i have no idea what you're talking about ```py

py -c "print([*{'afsaf', 'blak', 'clf', 'hae', '01s'}])"
['clf', 'afsaf', '01s', 'blak', 'hae']

py -c "print([*{'afsaf', 'blak', 'clf', 'hae', '01s'}])"
['blak', 'hae', 'clf', 'afsaf', '01s']

py -c "print([*{'afsaf', 'blak', 'clf', 'hae', '01s'}])"
['hae', 'afsaf', 'blak', 'clf', '01s']

py -c "print([*{'afsaf', 'blak', 'clf', 'hae', '01s'}])"
['afsaf', 'blak', 'clf', 'hae', '01s']

broken sluice Nov 27, 2022, 8:46 PM

#

yes, try that with PYTHONHASHSEED=constant

#

your bug was that the hash values weren't the same

rose schooner Nov 27, 2022, 8:46 PM

#

ok

broken sluice Nov 27, 2022, 8:47 PM

#

read carefully the scenario I mentioned. I did not say the code needs to assume anything about the order in which it will read things from the set. Any order is legal

rose schooner Nov 27, 2022, 8:48 PM

#

well i'm not too careful of a reader or too expert of a developer to understand what's even being talked about here

pliant tusk Nov 27, 2022, 8:49 PM

#

because of the non-determinism of the hash, the set are organized differently each run. So anything downstream from that point diverges every run. Now it might cause flaky tests,
^ that is stating a reliance on set order

broken sluice Nov 27, 2022, 8:49 PM

#

I can just paste a small example then, few mins

rose schooner Nov 27, 2022, 8:49 PM

#

broken sluice yes, try that with PYTHONHASHSEED=constant

ok it works now

broken sluice Nov 27, 2022, 8:53 PM

#

code:

from dataclasses import dataclass
from typing import Optional


@dataclass(frozen=True)
class Key:
    foo_id: int
    bar_id: Optional[int]


set_of_keys = {
    Key(foo_id=i, bar_id=None)
    for i in range(10)
}

for key in set_of_keys:
    # If we perform any downstream logic here based on keys, behavior will diverge
    # # between subsequent runs, even though set_of_keys is a constant input
    print(key)

run 1:
Key(foo_id=9, bar_id=None)
Key(foo_id=4, bar_id=None)
Key(foo_id=7, bar_id=None)
Key(foo_id=2, bar_id=None)
Key(foo_id=0, bar_id=None)
Key(foo_id=3, bar_id=None)
Key(foo_id=8, bar_id=None)
Key(foo_id=5, bar_id=None)
Key(foo_id=6, bar_id=None)
Key(foo_id=1, bar_id=None)

run 2:
Key(foo_id=6, bar_id=None)
Key(foo_id=1, bar_id=None)
Key(foo_id=9, bar_id=None)
Key(foo_id=4, bar_id=None)
Key(foo_id=0, bar_id=None)
Key(foo_id=7, bar_id=None)
Key(foo_id=2, bar_id=None)
Key(foo_id=8, bar_id=None)
Key(foo_id=3, bar_id=None)
Key(foo_id=5, bar_id=None)

broken sluice Nov 27, 2022, 8:57 PM

#

pliant tusk > because of the non-determinism of the hash, the set are organized differently ...

how so?
I can write code that iterates the set and does something, and the code can be correct for any order. Right? it happens all the time
and I can still want reproducible behavior when I'm debugging something

#

these things are not at odds with one another

pliant tusk Nov 27, 2022, 8:59 PM

#

im saying that if the code is non-deterministic when running normally, it should either be configured to enforce order explicitly when debugging if that is what you want

broken sluice Nov 27, 2022, 8:59 PM

#

imagine the set is a set of legal choices for some search algorithm
it might be correct to go over them all in any order
but maybe something goes wrong and you want the entire thing to take the exact same steps at each point
it generally makes life easier when you're debugging
I thought it's kind of obvious, but maybe not

pliant tusk Nov 27, 2022, 9:00 PM

#

broken sluice imagine the set is a set of legal choices for some search algorithm it might be ...

in that example, my test would use a list, not a set, as I want exact order

broken sluice Nov 27, 2022, 9:00 PM

#

I know there are workarounds

#

very often people use sets to deduplicate for example

#

now sure they can do such a thing without sets

#

but it's very easy to fall into the trap

pliant tusk Nov 27, 2022, 9:01 PM

#

function_to_produce_consistent_order(list(set(items)))

broken sluice Nov 27, 2022, 9:01 PM

#

and again, for what purpose do you want to make our lives harder
what is the external cost ..

#

If you did list(set(items)) you broke it

#

if your items can be sorted, you can stick a sorted at the end and you're OK. But not all keys are comparable like that

#

and again, this is placing the burden on researchers who might not be too keen on preserving determinism

pliant tusk Nov 27, 2022, 9:03 PM

#

set order is non-deterministic by design, it is not something that can/should be able to disabled

broken sluice Nov 27, 2022, 9:03 PM

#

it has undefined order. No requirement on the order. That is not the same thing as non-deterministic

#

It's a crucial point

pliant tusk Nov 27, 2022, 9:03 PM

#

how are those different?

broken sluice Nov 27, 2022, 9:04 PM

#

non-determinism is a behavior, not a requirement

pliant tusk Nov 27, 2022, 9:04 PM

#

If i run code in IronPython, PyPy, or CPython that uses sets, it can act differently

#

by design

broken sluice Nov 27, 2022, 9:04 PM

#

try to create a set of ints, that will show non deterministic behavior, being fed the same data/operations

#

really try it

#

something like, starting from these fixed inputs and these fixed operations, I run it once and get one order, I then run it again and get another order

#

again, i'm not saying anything about what the order is, any requirement at all

#

I don't care if it's different in another Python runtime

#

If all you're doing is debugging something, you don't care about that

pliant tusk Nov 27, 2022, 9:07 PM

#

tbh i dont care enough about this to do any of that, just if you are debugging something that loops through a set like that where a specific order of items breaks something you should build a proper test harness for that code to test your possibilites

#

not depend on the language doing it for you when it explicitly says it doesnt

pliant tusk Nov 27, 2022, 9:09 PM

#

pliant tusk not depend on the language doing it for you when it explicitly says it doesnt

because relying on undefined order means that any update (including minor versions) that changes something about sets can break your tests

broken sluice Nov 27, 2022, 9:12 PM

#

if the tests fail it's an indication the code or the test is wrong
I agree that tests should be extensive and then they'll catch order dependency bugs
and you know what? maybe for UTs this is good enough

but what about running the entire thing on some fixed input? let's say you have an input where the program did something that doesn't make sense and broke on an assert
what if the odds of that happening again are 1:1000, and it takes 30 min of compute time to get to that point

#

I mean, reproducible behavior has value, of course you can get by without it. You can get by without a lot of things

pliant tusk Nov 27, 2022, 9:13 PM

#

then you need better tests lmao

#

and better logs

broken sluice Nov 27, 2022, 9:14 PM

#

look, if you think reproducible behavior has no value at all, the discussion can end there

pliant tusk Nov 27, 2022, 9:16 PM

#

my point is that if you have code that is that complex, and has the potential to fail in odd edgecases, it should be possible to debug post mortem without needing to rerun the code

broken sluice Nov 27, 2022, 9:21 PM

#

I understand that, and maybe if I wrote all the code that I'm responsible for, the situation was better. It's a lot of code that was written in a hurry, not everyone writing the operations research code we have is even an engineer.
It's a nightmare to debug it, and non determinism isn't helping

Yes, it would be nice if all of the complex behavior was broken down to small component and tested extremely well in isolation
In reality it isn't, though

you can just say, "well sucks to be you" but I am just asking who's actually benefitting from the non-determinism

#

if the answer is no one, then why have it

#

again note that if one day someone makes sets iterate completely at random every time they can. My change does not contain any contractual guarantee for sets. It can break at any time in the future and I am OK with that

pliant tusk Nov 27, 2022, 9:24 PM

#

afaik, the non-determinism of sets is a speed optimization

broken sluice Nov 27, 2022, 9:25 PM

#

no, I don't mean that (sets are still deterministic today!) I meant the non deterministic hash of None

#

again I could be wrong - show me a history of operations on a set that takes in only constant data with constant hashes and ends up iterating its data in a different order every run. That would be non-deterministic sets

pliant tusk Nov 27, 2022, 9:27 PM

#

^ probably possible as hash(obj) relies on id(obj) which is the address of obj which is non-deterministic from python

broken sluice Nov 27, 2022, 9:27 PM

#

but then it is not the set that is non-deterministic, it is the hash

#

that is the point

pliant tusk Nov 27, 2022, 9:28 PM

#

thats like me saying "its not the door that opens, its the door knob"

broken sluice Nov 27, 2022, 9:29 PM

#

No, it is not some philosophical statement...

pliant tusk Nov 27, 2022, 9:30 PM

#

my point was that if a dependency of an operation is non-deterministic, then the operation is non-derministic

#

the set inherits it

broken sluice Nov 27, 2022, 9:30 PM

#

sets being non-deterministic means you can take things with fixed hashes, say construct the set {1, 2, 42} and iterate them, and let's say it returns: 2, 1, 42. Then you create another set exactly the same way and it iterates them in a different order

pliant tusk Nov 27, 2022, 9:31 PM

#

but you can make sets out of things that do not have fixed hashes

#

so yea, a subset of sets can be deterministic, but all sets cannot be deterministic

broken sluice Nov 27, 2022, 9:32 PM

#

right. I agree. then all bets are off.
but I usually avoid doing that. The researchers do too

pliant tusk Nov 27, 2022, 9:32 PM

#

I am trying to point out that what you are asking for is not just a deterministic hash of None

broken sluice Nov 27, 2022, 9:33 PM

#

they tend to use ints enums and such in keys. and if we set PYTHONHASHSEED we are generally ok.
Expect when they try to use Optional[int] and then we're not

pliant tusk Nov 27, 2022, 9:33 PM

#

its a deterministic hash of everything

broken sluice Nov 27, 2022, 9:33 PM

#

not everything.

#

that can't be done anyway.

#

what types are used as keys? tuples of ints, strs, maybe bool, enum, maybe even frozensets of those things

pliant tusk Nov 27, 2022, 9:34 PM

#

!e If it is truely justNone then just do this locally and call it a day ```py
from fishhook import hook
@hook(type(None))
def hash(self):
return 0xdeadbeef

print(hex(hash(None)))```

broken sluice Nov 27, 2022, 9:34 PM

#

all can be hashed deterministically, if only optional None didn't screw us over

fallen slateBOT Nov 27, 2022, 9:34 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

0xdeadbeef

broken sluice Nov 27, 2022, 9:35 PM

#

I thought of that.
Won't that break the world though?

#

if there is any set or dict anywhere in the Python runtime that was already created based on the default hash of None we are screwed

pliant tusk Nov 27, 2022, 9:35 PM

#

yea it would probably break things

#

you can also use LDPRELOAD

#

but at that point just define your own class inplace of None and use that

broken sluice Nov 27, 2022, 9:36 PM

#

A C extension to patch the tp_hash descriptor?

pliant tusk Nov 27, 2022, 9:36 PM

#

it would work, but again, its a bad idea

broken sluice Nov 27, 2022, 9:37 PM

#

I guess I might do that. it is better than asking people to use non standard idioms for Optional

#

after all what they mean is Optional

#

not Union[T, SomeMadeUpSentinelJustSoICanHashItToZero]

pliant tusk Nov 27, 2022, 9:38 PM

#

you can use a .pth file to hook Optional and swap out None with your custom class

broken sluice Nov 27, 2022, 9:38 PM

#

but Optional isn't actually a class

#

there is no such type

#

It is in fact, just a T | None

#

I'm warming up to the C extension idea
after all nothing in the code will rely on it. we can always throw it away if we want

pliant tusk Nov 27, 2022, 9:40 PM

#

then hook types.UnionType with fishhook and swap out the None there

broken sluice Nov 27, 2022, 9:41 PM

#

I don't think static type checkers will understand that

#

but I'm not sure

pliant tusk Nov 27, 2022, 9:41 PM

#

broken sluice I don't think static type checkers will understand that

the static typecheckers won't care about whats in a pth file

broken sluice Nov 27, 2022, 9:43 PM

#

let's say your dataclass has a field that says
x: Optional[int] = None

A type checker will see that and consider it legit
and it is

how can your hook transform it into
x: Union[int, SentinelType] = Sentinel

#

seems really difficult to rewrite the program in such manner with no help from the programmer, in the general case at least

#

maybe with descriptors on all fields

#

actually
maybe the solution is to patch the hash function generated for datacalss

#

(but there's also NamedTuple...tuples, etc)

#

C extension is less headache I think

#

In fact, easiest is to just backport my PR into whatever version of Python I'm using

halcyon trail Nov 28, 2022, 1:30 AM

#

pliant tusk tbh i dont care enough about this to do any of that, just if you are debugging s...

This whole line of discussion seems to miss the point behind this. The point isn't to get deterministic behavior generally, the point is to get reproducible behavior when you need it

#

Probably you missed it because it was discussed earlier on, but it's pretty similar to how test frameworks that work with RNG start with a singular seed that may default to being taken from e.g. an OS source of randomness, but can actually be fed in at the command line

#

The idea being that if your tests fail sporadically, you can look at a failing test, look at the logged seed, and feed it back in to reproduce it exactly

#

Nobody should be depending on set iteration order on purpose, the point is that if you accidentally depend on it subtly, when you get a test failing you want to reproduce it exactly

spark magnet Nov 28, 2022, 1:39 AM

#

I just found the discuss.python.org thread about this. It got very heated it looks like

halcyon trail Nov 28, 2022, 1:39 AM

#

Sadly I'm not surprised

#

Do you have the link handy?

spark magnet Nov 28, 2022, 1:39 AM

#

https://discuss.python.org/t/constant-hash-for-none/21110 but the first post is gone

halcyon trail Nov 28, 2022, 1:40 AM

#

Hmm why is it gone?

spark magnet Nov 28, 2022, 1:40 AM

#

¯_(ツ)_/¯

#

i haven't read the whole thread, or know how to tell these things

halcyon trail Nov 28, 2022, 1:42 AM

#

I mean with respect people who didn't seem to understand the main ideas got into writing very long posts that were mostly irrelevant and conversation just got derailed

#

I think the origin proposal needs to be stated in a more tightly focused way

spark magnet Nov 28, 2022, 1:42 AM

#

could be

halcyon trail Nov 28, 2022, 1:42 AM

#

Like the whole discussion of ordered set

#

That's totally irrelevant here. Not sure if the original post by corak didn't do s good enough job steering away from that

spark magnet Nov 28, 2022, 1:47 AM

#

my recommendation generally is to try to say yes as much as you can, and avoid saying no. It will keep the discussion where you want it.

raven ridge Nov 28, 2022, 1:48 AM

#

halcyon trail Nobody should be depending on set iteration order on purpose, the point is that ...

this proposal only helps in the case where the set was full of only elements that have consistent hashes across runs or None's, though. Surely that's quite a rare case

spark magnet Nov 28, 2022, 1:49 AM

#

but i also know it can be very frustrating to try to convince them, and sometimes you just can't.

spark magnet Nov 28, 2022, 1:50 AM

#

raven ridge this proposal only helps in the case where the set was full of only elements tha...

setting the hash seed is a key element of making it deterministic.

raven ridge Nov 28, 2022, 1:53 AM

#

that still doesn't make it deterministic for most objects, just a for those from a very small handful of types. So, yes, if you depended on iteration order of a set, and the only things in the set were either things that already have consistent hashes across runs or None's, then this proposal would allow you to reproduce the behavior. But if the set contained any other element that has inconsistent hashes across runs, you've still got the same problem as you had before.

spark magnet Nov 28, 2022, 1:53 AM

#

raven ridge that still doesn't make it deterministic for most objects, just a for those from...

none, int, float, strings would get you a long way.

#

and none is the only non-deterministic hash there.

raven ridge Nov 28, 2022, 1:56 AM

#

those might get you a long way, but they don't get you all the way. Is something that makes a test reproducible only in very specific circumstances really that valuable?

halcyon trail Nov 28, 2022, 1:56 AM

#

raven ridge this proposal only helps in the case where the set was full of only elements tha...

Not really, I don't think it's very small

#

Optional is a member of many value semantic data classes

spark magnet Nov 28, 2022, 1:56 AM

#

raven ridge those might get you a long way, but they don't get you _all_ the way. Is somethi...

for me, the cost to do this is tiny, so if it helps someone, let's do it.

halcyon trail Nov 28, 2022, 1:57 AM

#

spark magnet none, int, float, strings would get you a long way.

Yes, exactly

#

Hettinger is wrong when he says that None is no different than any other type. None is one of a handful of basic building blocks for creating types that people use

#

And it's used very extensively in types with value semantics

#

Just like strings and numeric types

spark magnet Nov 28, 2022, 1:59 AM

#

you tell them I'm in favor! (this means nothing)

halcyon trail Nov 28, 2022, 2:00 AM

#

Obviously if you have classes that use identity semantics as keys things will be non reproducible, but that's not very surprising, and using identity semantic classes as keys is a choice and the impossibility to reproduce results is just one of the downsides

#

Personally I never use identity semantics in hash keys

#

The question I guess is how to reopen this discussion without people getting so upset

#

I don't know why that thread escalated so quickly

rich cradle Nov 28, 2022, 2:04 AM

#

i think conflating set iteration order and making a consistent hash for None blew it up pretty fast

spark magnet Nov 28, 2022, 2:04 AM

#

halcyon trail I don't know why that thread escalated so quickly

yoni went down the ordered set rabbit hole: https://discuss.python.org/t/constant-hash-for-none/21110/14. Better to keep the discussion on track.

halcyon trail Nov 28, 2022, 2:05 AM

#

Yeah that's very unfortunate

#

Online discussions you have to be laser precise

#

Also from the get go it's better to suggest that None has a salted hash the same way strings do. It maximizes security at no real performance or reproducibility cost

spark magnet Nov 28, 2022, 2:06 AM

#

halcyon trail Also from the get go it's better to suggest that None has a salted hash the same...

meh, all the ints and floats have predictable hashes, i don't see why None needs it salted. But ok.

raven ridge Nov 28, 2022, 2:07 AM

#

they do, but that's an implementation detail... I think folks were bristling at the idea of promoting that from implementation detail to supported feature

halcyon trail Nov 28, 2022, 2:08 AM

#

I think simply because there's almost no downside you should salt it. For it's, the performance hit was significant

halcyon trail Nov 28, 2022, 2:08 AM

#

raven ridge they do, _but_ that's an implementation detail... I think folks were bristling a...

It seems honestly pretty obvious to me that languages that salt their hashes should have a reproducibility mechanism built in

spark magnet Nov 28, 2022, 2:09 AM

#

halcyon trail It seems honestly pretty obvious to me that languages that salt their hashes sho...

that's the kind of language you have to avoid though

raven ridge Nov 28, 2022, 2:09 AM

#

Python didn't salt its hashes until very recently, and then they added salting only as a mitigation of a security vulnerability

halcyon trail Nov 28, 2022, 2:09 AM

#

I'm genuinely surprised to see e.g. a cpython core dev refer to "unnamed use cases for reproducibility between runs"

halcyon trail Nov 28, 2022, 2:09 AM

#

spark magnet that's the kind of language you have to avoid though

Yes but this is just between friends 😉

spark magnet Nov 28, 2022, 2:09 AM

#

halcyon trail I'm genuinely surprised to see e.g. a cpython core dev refer to "unnamed use cas...

sounds like it wasn't explained fully enough

halcyon trail Nov 28, 2022, 2:10 AM

#

raven ridge Python didn't salt its hashes until very recently, and then they added salting o...

Salting in general is a security mitigation though. Also, python did actually include a way to make hashing reproducible between runs, right?

#

Not sure what the motivation was, but it seems possible this need had already been recognized

raven ridge Nov 28, 2022, 2:11 AM

#

halcyon trail Salting in general is a security mitigation though. Also, python did actually in...

of the one specific type they added a salt to, yes.

halcyon trail Nov 28, 2022, 2:11 AM

#

I mean presumably if anything else is ever salted it will be using that same value to seed the salt

#

I.e. that environmental variable will always allow for reproducibility of any value semantic hashing

raven ridge Nov 28, 2022, 2:14 AM

#

you don't consider ```py
class SomeClass:
pass

SINGLETON = SomeClass()
hash(SINGLETON)

halcyon trail Nov 28, 2022, 2:17 AM

#

It hasn't defined ==

#

Or hash

raven ridge Nov 28, 2022, 2:17 AM

#

that's correct, but it supports both equality and hash

halcyon trail Nov 28, 2022, 2:18 AM

#

A monostate Singleton is something of a degenerate case

#

That's a big part of why None creates confusion

raven ridge Nov 28, 2022, 2:19 AM

#

I'd call it value-semantic because it does allow == - so if this degenerate case is value-semantic, then no, PYTHONHASHSEED does not make all value-semantic hashes reproducible.

halcyon trail Nov 28, 2022, 2:19 AM

#

For a monostate Singleton, all instances are equal and hash the same so it value and reference semantics aren't exactly distinguishable

#

I would say it's clearly a corner case but not very likely to come up much

#

It comes up a lot for None in practice because it's part of Optional, in practice

#

And Optional is a pretty common building block in value semantic types

raven ridge Nov 28, 2022, 2:30 AM

#

well, that's probably the most convincing way to formulate this argument.

Reproducible hashes are useful for reproducing test failures (which is why pytest defaults to printing out PYTHONHASHSEED, for instance)
The non-reproducibility of hash(None) even when PYTHONHASHSEED is set is the only thing making the hashes of many simple dataclasses non-reproducible.

gray galleon Nov 28, 2022, 2:37 AM

#

when will python have symbol type ||have i asked it before||

long isle Nov 28, 2022, 2:50 AM

#

Help me

gray galleon Nov 28, 2022, 2:50 AM

#

long isle Help me

#1035199133436354600

boreal umbra Nov 28, 2022, 2:58 AM

#

@rich cradle I've always thought the evils of inheritance were overstated, but that's probably because I rarely actually make subclasses, and when I do, it's specifically to avoid boilerplate.

rich cradle Nov 28, 2022, 2:59 AM

#

i've just never written code that really needs that kind of structure. i dunno why.

#

i tend to abuse things like protocols though. holdover from using typeclasses in haskell and rust.

#

the entire inheritance model seems strange to me, architecting shared behavior as a tree, but ¯_(ツ)_/¯ i don't actually use it where possible

boreal umbra Nov 28, 2022, 3:02 AM

#

well, that's why we have duck typing bing_shrug

rich cradle Nov 28, 2022, 3:02 AM

#

well, now that i think about it, it's not even a tree, it's a... directed acyclic graph? maybe? which is even more wild.

swift imp Nov 28, 2022, 3:03 AM

#

When you say protocol you mean dunders or structural protocols

rich cradle Nov 28, 2022, 3:03 AM

#

whatever typing.Protocol is, so probably the latter

swift imp Nov 28, 2022, 3:03 AM

#

How do you abuse that

rich cradle Nov 28, 2022, 3:04 AM

#

i use it in places where inheritance probably would be more appropriate, that's all

swift imp Nov 28, 2022, 3:04 AM

#

Oh

#

I mean isn't using a protocol vs a abc like functional vs oop

#

Just different paradigms

boreal umbra Nov 28, 2022, 3:05 AM

#

it might be a different take on OOP. it might even be a different paradigm. but if it is, functional isn't the one

swift imp Nov 28, 2022, 3:06 AM

#

Wait

#

Is typing.protocol the one you subclass?

boreal umbra Nov 28, 2022, 3:06 AM

#

!docs typing.Protocol

fallen slateBOT Nov 28, 2022, 3:06 AM

#

typing.Protocol


class typing.Protocol(Generic)```
Base class for protocol classes. Protocol classes are defined like this:

```py
class Proto(Protocol):
    def meth(self) -> int:
        ...
```  Such classes are primarily used with static type checkers that recognize structural subtyping (static duck-typing), for example...

swift imp Nov 28, 2022, 3:06 AM

#

And then you can type hint saying your callable takes in an instance matching that protocol

#

Yeah yeah ok

rich cradle Nov 28, 2022, 3:07 AM

#

boreal umbra it might be a different take on OOP. it might even be a different paradigm. but ...

i dunno if it has a name. it's a style that originated (afaik) with haskell typeclasses.

#

well it's different to some extent, but it's the most similar there is in python that i know of

#

well, i think it is. it's been months since i wrote sane python code. the past few months have been largely random things to test one of my projects.

swift imp Nov 28, 2022, 3:09 AM

#

I don't think I write generic enough to really use protocols

#

And while I've used ABCs, it's honestly unnecessary

boreal umbra Nov 28, 2022, 3:13 AM

#

I feel like ABCs are only there to appease people from languages that have them

#

it's more consistent with Python's philosophy to just... not instantiate the class.

halcyon trail Nov 28, 2022, 3:34 AM

#

raven ridge well, that's probably the most convincing way to formulate this argument. - Repr...

Yep, I completely agree. @broken sluice if you haven't given up, i think this may be the purest distillation of the idea I've seen.

halcyon trail Nov 28, 2022, 3:36 AM

#

boreal umbra it's more consistent with Python's philosophy to just... not instantiate the cla...

Not exactly, ABCs in conjunction with mypy also enforce that things are overriden and overriden correctly,.e.g. without typos in the name or signature

#

That's very useful and a very easy mistake to make by accident

halcyon trail Nov 28, 2022, 3:38 AM

#

rich cradle i dunno if it has a name. it's a style that originated (afaik) with haskell type...

Fwiw, protocols are pretty fundamentally different from type classes or traits because the former are satisfied implicitly. The latter, explicitly

#

Python protocols are more like Go's interfaces

rich cradle Nov 28, 2022, 3:39 AM

#

yes

#

but i think they're the closest usable equivalent we can have in python, at least in my usage of them

halcyon trail Nov 28, 2022, 3:40 AM

#

Idk, they are more or less close depending on what you look at

#

ABCs are explicit, like typeclasses

#

In that sense, ABCs are closer

rich cradle Nov 28, 2022, 3:41 AM

#

right, hence my "usable"

#

you can't add ABCs to random stdlib types or things from other packages

#

...i think

halcyon trail Nov 28, 2022, 3:42 AM

#

I think you can but I'm not sure if mypy recognizes it

raven ridge Nov 28, 2022, 3:43 AM

#

I just see Python's Protocols as formalized duck-typing.

#

Protocol lets you describe what a duck looks like

rich cradle Nov 28, 2022, 3:43 AM

#

that's exactly what they are. but i'm fine with that, personally.

halcyon trail Nov 28, 2022, 3:43 AM

#

Fwiw in 95 percent of cases, ABCs are very easy to use.

#

It's compile time duck typing more or less

#

Which is what structural typing is

#

In that sense it matches well with python

#

But there's a fair amount of criticism (that I agree with) in just implicitly satisfying a constraint because you have the right API

rich cradle Nov 28, 2022, 3:45 AM

#

i absolutely would prefer to tack on a ridiculously powerful type system to python. i just don't think it's fundamentally possible, and would break a lot of things.

halcyon trail Nov 28, 2022, 3:45 AM

#

That's why structural typing is very rare, out of popular static languages mostly just Go uses it

#

It's not really about the power of the type system per se

rich cradle Nov 28, 2022, 3:46 AM

#

isn't it? python has built a lot of its type system by shoving things into the class model, even if they don't necessarily fit.

halcyon trail Nov 28, 2022, 3:47 AM

#

No? A specific choice of the type system isn't the same as it being more powerful

rich cradle Nov 28, 2022, 3:47 AM

#

no, wait. ignore my previous statement. that was another thing i somewhat disagree with, but not what i meant to say.

halcyon trail Nov 28, 2022, 3:48 AM

#

It's just worth trying to use ABCs if you havent. Using ABCs has very little to do with the inheritance rabbit hole

rich cradle Nov 28, 2022, 3:48 AM

#

i think what i'm really getting at is "i want static typing, and i want language features that only make sense with it," but there's no chance in hell that's happening. that arguably wouldn't even be python anymore.

halcyon trail Nov 28, 2022, 3:49 AM

#

IME most usages of polymorphism don't actually require the loose coupling provided by protocols

#

A class that implements an ABC is explicit about it, which is nice, and it also means you get errors early rather than later, like with protocols

#

Well, sure,.but I'm talking out of options available in python

rich cradle Nov 28, 2022, 3:50 AM

#

right. i probably shouldn't have even brought that up.

halcyon trail Nov 28, 2022, 3:52 AM

#

I forget how

#

But almost sure it can be done

#

https://docs.python.org/3/library/abc.html#abc.ABCMeta.register

#

There ya go

#

https://github.com/python/mypy/issues/2922

GitHub

ABCMeta.register support · Issue #2922 · python/mypy

class A(metaclass=abc.ABCMeta): pass class B: pass A.register(B) a: A = B() # currently E: Incompatible types in assignment (expression has type "B", variable has type "A...

#

Basically for many people this would be the ideal in a static type system

#

Explicit but non intrusive

#

ABCs are explicit and intrusive

#

Protocols are non-intrusive because they're implicit

#

Haskell type classes and rust traits have this nice explicit but non intrusive property

gray galleon Nov 28, 2022, 5:49 AM

#

gray galleon when will python have symbol type ||have i asked it before||

.

broken sluice Nov 28, 2022, 7:18 AM

#

https://discuss.python.org/t/hash-none-mk-2/21465/16
if you care at all to write anything there
I think there's no use tbh
I've made my case, both sides are repeating the same arguments

Discussions on Python.org

hash(None) Mk.2

At least you are making some sort of argument that I can respond to. You really should have started with that. If an operation returns a constant result (as can be observed from the source code, which is open), running it by definition confers no information to an attacker. I don’t need to be a security expert to know that. If anything, it is ...

radiant garden Nov 28, 2022, 7:58 AM

#

Feeling odd deja vu here

quick snow Nov 28, 2022, 7:59 AM

#

broken sluice https://discuss.python.org/t/hash-none-mk-2/21465/16 if you care at all to write...

Have you looked into faking /dev/random? That should account for ASLR and anything else, fixing not just your specific usecase, but any hashes of arbitrary objects (I think): https://stackoverflow.com/a/26067735/1016216

Stack Overflow

bypass dev/urandom|random for testing

I want to write a functional test case that tests a program with a known value for random numbers. I have already tested it with mocks during the unit testing. But I would like that for functional

broken sluice Nov 28, 2022, 10:32 AM

#

The source of the non-determinism is the memory location of None (since that is what the hash function is based on). It is not due to input from RNGs

elder blade Nov 28, 2022, 11:18 AM

#

broken sluice https://discuss.python.org/t/hash-none-mk-2/21465/16 if you care at all to write...

~~Can I get a Tl;Dr? hash(None) was X and now they want to change hash(None) to 123?~~

Am I understanding things correctly that hash(None) was the default implementation hash(id(self)) and they want to change it to 123 or whatever?

#

!e print(0xBADCAB1E)

fallen slateBOT Nov 28, 2022, 11:23 AM

#

@elder blade :white_check_mark: Your 3.11 eval job has completed with return code 0.

3135023902

elder blade Nov 28, 2022, 11:24 AM

#

flat gazelle Nov 28, 2022, 11:27 AM

#

elder blade ~~Can I get a Tl;Dr? `hash(None)` was X and now they want to change `hash(None)`...

yup, the goal is to make hashes deterministic for things that make sense to use as keys, using similar reasoning to PYTHONHASHSEED being a thing instead of just leaving it random and not letting the user set it.

rose schooner Nov 28, 2022, 12:27 PM

#

elder blade

1 in 1 chance random

dusk comet Nov 28, 2022, 12:52 PM

#

elder blade ~~Can I get a Tl;Dr? `hash(None)` was X and now they want to change `hash(None)`...

I think OP want this behavior:

# pseudocode
def get_none_hash() -> int:
    if USES_PYTHONHASHSEED:
        return RNG(PYTHONHASHSEED).random() # get random number from seeded RNG
    else:
        return RNG().random() # get random number from RNG seeded by random seed

elder blade Nov 28, 2022, 1:02 PM

#

That would make more sense, but unfortunately the PR they submitted is weird and the behaviour they seem to want even weirder

broken sluice Nov 28, 2022, 1:03 PM

#

I'm not sure which makes more sense than returning a constant, there are arguments and opinions both ways

#

it does seem to make a lot of sense to me for a monostate type to hash to a constant, but what do I know

halcyon trail Nov 28, 2022, 1:04 PM

#

elder blade That would make more sense, but unfortunately the PR they submitted is weird and...

What's weird about the behavior they want?

broken sluice Nov 28, 2022, 1:04 PM

#

I couldn't quite figure it out of all the memes

halcyon trail Nov 28, 2022, 1:25 PM

#

broken sluice I couldn't quite figure it out of all the memes

I posted in that thread trying to support it, fwiw

swift imp Nov 28, 2022, 1:57 PM

#

The biggest argument against it, is the false premise that set iteration is dependent purely on hashes and not history of the set, Steve D provided that counter example in first reply of OG thread

native flame Nov 28, 2022, 1:58 PM

#

the history of the set doesnt change over multiple runs

swift imp Nov 28, 2022, 1:59 PM

#

After thinking about it more, I get what the OP wants but the reason for their wanting it, is just wrong and could be found for any number of classes, even after fixing None

native flame Nov 28, 2022, 1:59 PM

#

like what though

swift imp Nov 28, 2022, 1:59 PM

#

Pick virtually any object whose hash is based on id and u r back to square one

native flame Nov 28, 2022, 2:00 PM

#

the argument is that all other objects commonly used as dict keys dont do that

swift imp Nov 28, 2022, 2:01 PM

#

That's weak imo

native flame Nov 28, 2022, 2:01 PM

#

str, bool, float, int, tuples of those

swift imp Nov 28, 2022, 2:01 PM

#

Str do not give constant hash

native flame Nov 28, 2022, 2:01 PM

#

they do if you set the seed with the flag

feral island Nov 28, 2022, 2:04 PM

#

type objects are reasonable choices as dict keys

#

and they also have non-reproducible hashes

swift imp Nov 28, 2022, 2:04 PM

#

Exactly

native flame Nov 28, 2022, 2:04 PM

#

fair

swift imp Nov 28, 2022, 2:04 PM

#

I've used registry patterns where the key is the type

#

I don't want to beat a dead horse, I just think time would be better spent refactoring their need to iterate a set in specific order

#

Pretty sure the help or repl hashes types, into a set no less. I've got weird errors when I messing up custom hash implementations and all of a sudden repr broke in the repl

flat gazelle Nov 28, 2022, 2:11 PM

#

swift imp After thinking about it more, I get what the OP wants but the reason for their w...

you could use the same argument against providing PYTHONHASHSEED, and yet it exists, despite being more complex

swift imp Nov 28, 2022, 2:14 PM

#

flat gazelle you could use the same argument against providing PYTHONHASHSEED, and yet it exi...

Didn't someone show a version of python compiled without the randomization and None was providing a consistent hash?

flat gazelle Nov 28, 2022, 2:15 PM

#

None uses an id-based hash. Which is mostly going to be stable due to the way modern OSs work with memory. But it is nevertheless a non-deterministic value

halcyon trail Nov 28, 2022, 2:32 PM

#

swift imp The biggest argument against it, is the false premise that set iteration is depe...

That's not an argument against at it all

#

Just a misunderstanding

#

Because that was never a premise

#

The point is reproducibility, reproducibility just requires eliminating actual sources of randomness

#

Whether the sets history affects iteration order doesn't matter, because it's not magically randomized

#

Nones hash is randomized. Just like strings hash is. The latter provides a way to make it non random. The former doesn't.

halcyon trail Nov 28, 2022, 2:38 PM

#

feral island type objects are reasonable choices as dict keys

That's probably the best example I've seen of a reasonable use case for identity based hashing in python, one I admit I've actually used

#

The example should be kept in mind but to use it as a basis to reject improvements in reproducibility would IMHO be Nirvana fallacy

broken sluice Nov 28, 2022, 2:45 PM

#

Set's history affects iteration order and that's perfectly fine, because the next time you run the program on the same input it will create the same set in the same way

#

It's a misconception they keep repeating over and over

#

the only thing the set itself can do to break reproducibility is if it reorganizes its internal structure in a manner that isn't deterministic by its input commands

#

for example, a set that has a thread that concurrently rehashes it or something

#

I don't know of a single programming language that offers only non-deterministic sets. It's a nightmare honestly, and concurrent hashmaps are used only for super high perf applications, that probably don't want to use Python anyway

gray galleon Nov 28, 2022, 2:56 PM

#

is it me or @ is the most underused operator in python
its only use case is in numpy for matrix multiplications
even then dot gives the same functionality

umbral plume Nov 28, 2022, 4:10 PM

#

gray galleon is it me or `@` is the most underused operator in python its only use case is in...

the @ binary operator goes literally unused in the stdlib AFAIK, it really was just added to greatly help out all the numerical libraries like numpy and such, https://peps.python.org/pep-0465/ lists out a bunch of motivations for adding it to the language

PEP 465 – A dedicated infix operator for matrix multiplication | pe...

Python Enhancement Proposals (PEPs)

#

also, its recommended to use @ over np.dot when possible, since then expressions appear to translate over to mathematical equations much clearer (plus a little boost in performance i think)

quick snow Nov 28, 2022, 4:16 PM

#

gray galleon is it me or `@` is the most underused operator in python its only use case is in...

Unary @ is used a lot.
Less used operators, IMO: ~ and + (unary)

gray galleon Nov 28, 2022, 4:16 PM

#

unary @?
you mean decorators?

quick snow Nov 28, 2022, 4:26 PM

#

Yes

#

I guess it's not an operator there

dusk comet Nov 28, 2022, 4:53 PM

#

quick snow Unary `@` is used a lot. Less used operators, IMO: `~` and `+` (unary)

also: not not x

#

and __divmod__

quick snow Nov 28, 2022, 4:56 PM

#

dusk comet also: `not not x`

Doesn't count, otherwise I nominate not + ~ + + - ~x

feral island Nov 28, 2022, 5:03 PM

#

quick snow Unary `@` is used a lot. Less used operators, IMO: `~` and `+` (unary)

https://github.com/python/cpython/blob/main/Modules/_datetimemodule.c#L2173

fallen slateBOT Nov 28, 2022, 5:03 PM

#

Modules/_datetimemodule.c line 2173

/* Could optimize this (by returning self) if this isn't a```

dapper lily Nov 28, 2022, 5:06 PM

#

time for a PR

deft horizon Nov 28, 2022, 7:06 PM

#

Makes me want to implement date @ time -> datetime, it's a bad idea but would be cute.

#

I'd also nominate @ as the function composition operator (by exact analogy to matrix multiplication!) but we don't even have functools.compose(), so.

dusk comet Nov 28, 2022, 7:10 PM

#

You can fishhook FunctionType.__matmul__

grave jolt Nov 28, 2022, 7:23 PM

#

wait... unary @?

#

oh you mean decorator syntax?

paper echo Nov 28, 2022, 7:32 PM

#

deft horizon I'd also nominate `@` as the function composition operator (by exact analogy to ...

i've been wanting this for a long time. i think calling @ "matmul" was a huge mistake, they should have just left it open as a "do what you want" operator

grave jolt Nov 28, 2022, 7:50 PM

#

btw, is it used anywhere outside of numpy?

#

and the cursed emails thing

paper echo Nov 28, 2022, 7:56 PM

#

grave jolt btw, is it used anywhere outside of numpy?

i think other mathematical/array libraries support it now

#

yeah xarray has it

halcyon trail Nov 28, 2022, 7:57 PM

#

My guess is that they probably don't want people randomly abusing operators just to get infix

paper echo Nov 28, 2022, 7:57 PM

#

probably true, "here's a random operator have fun" would be pretty un-pythonic

halcyon trail Nov 28, 2022, 7:57 PM

#

In 99 percent of cases if an operator isn't already familiar in a context then overloading it is the wrong call

#

(pun intended?)

grave jolt Nov 28, 2022, 8:24 PM

#

/ for paths and urls was kinda strange tbh

#

but I think I got used to it

#

Haskell libraries used all kinds of custom operators with urls

halcyon trail Nov 28, 2022, 8:40 PM

#

It's not really strange

#

It's the character separator and it's what you type in to combine paths in bash

#

It's certainly less strange than + for strings.

#

C++ also uses operator /, just like python

#

Another reality here is that standard library just has more leeway. They can pick something semi reasonable and everyone will learn it pretty fast. For a third party library it's more annoying really to use operators in obscure ways

grave jolt Nov 28, 2022, 9:01 PM

#

halcyon trail It's certainly less strange than + for strings.

Julia moment 🥴

dusk comet Nov 28, 2022, 9:12 PM

#

grave jolt `/` for paths and urls was kinda strange tbh

It is very natural to me

halcyon trail Nov 28, 2022, 9:25 PM

#

grave jolt Julia moment 🥴

What does Julia do?

grave jolt Nov 28, 2022, 9:31 PM

#

halcyon trail What does Julia do?

* for concatenation, ^ for repeating

#

#

something something monoid

halcyon trail Nov 28, 2022, 9:33 PM

#

Yeah I've seen people make this argument before

#

It's ridiculous

#

Ironically, * was of course used for multiplying reals, integers etc long before scalars

#

Which is commutative

#

was selected for matrix multiplication because it's conceptually similar to multiplication

sacred yew Nov 28, 2022, 9:36 PM

#

grave jolt

???

flat gazelle Nov 28, 2022, 9:36 PM

#

they could have quite literally picked any operator in existence, it's julia, they support all of latex.

halcyon trail Nov 28, 2022, 9:37 PM

#

Mathematicians weren't slaves to the fact that they were using a previously commutative operator for a non commutative operation

flat gazelle Nov 28, 2022, 9:37 PM

#

but eh, if someone wants to go all math nerd for string concat, sure

sacred yew Nov 28, 2022, 9:37 PM

#

integer subtraction is noncommutative
guess * should be for subtraction then

halcyon trail Nov 28, 2022, 9:38 PM

#

When you put higher value on formalism than concepts relative to mathematicians you know you're in bad shape

flat gazelle Nov 28, 2022, 9:38 PM

#

well, subtraction is not asociative

#

the convention that * is for associative operations is a fairly new one

dusk comet Nov 28, 2022, 9:54 PM

#

grave jolt

Following this logic, string repetition should be +, because it is commutative operation: 5+'abc' == 'abc'+5

rose schooner Nov 28, 2022, 10:08 PM

#

grave jolt `*` for concatenation, `^` for repeating

^ is power isn't it?

#

power is right-associative and non-commutative though?

halcyon trail Nov 28, 2022, 10:14 PM

#

I think intuitively it's pretty clear that conceptually string concatenation is like adding. Each item present in each of the arguments shows up exactly once in the final string

#

And with multiplication the things in the collection are multiplied

#

In other words, if z = x + y, then len(z) = len(x) + len(y)

#

And the same relationship holds for string multiplication

swift imp Nov 28, 2022, 11:37 PM

#

paper echo i've been wanting this for a long time. i think calling `@` "matmul" was a huge ...

Cannot agree more. I would like function composition too

quick snow Nov 29, 2022, 6:42 AM

#

halcyon trail Ironically, * was of course used for multiplying reals, integers etc long before...

Wait, what does Julia use for scalar multiplication then? •?

quick snow Nov 29, 2022, 8:01 AM

#

How could they use *, that denotes a noncommutative operation, while scalar multiplication is commutative! Should have used + for multiplication, clearly

deft horizon Nov 29, 2022, 8:06 AM

#

swift imp Cannot agree more. I would like function composition too

In fairness matrix multiplication very common in the sciences! Fortunately it's also just a special case of function composition (where the functions are affine transformations), so that makes sense. And like (matrix) multiplication, function composition is often represented by an infix dot operator 🙂

dusk comet Nov 29, 2022, 9:28 AM

#

__matmul__ is not always a matrix multiplication
__add__ is not always a addition
__truediv__ is not always a division (fpr example pathlib.Path)

So, "matmul" is not bad name for that dunder. Different operators have different names (+ addition, - substraction, * multiplication, / division, @ matrix multiplication), and the operator names are irrelevant to what the operators do. So I don't see it (matmul being a name for operator) as a problem

#

There is also z3 lib (iirc), that have some placeholder variables, and X+Y becomes not result of addition, but some expression object, that can be evaluated at given X and Y

#

Also there is some lib (i forgot name), that have some "magic" var (iirc it is stored at lambda or phi symbol name), and var+1 becomes lambda x: x+1

#

So, you can do whatever you want with operators until you and your users understand what's happening

sacred yew Nov 29, 2022, 2:33 PM

#

sympy?

tall surge Nov 29, 2022, 7:24 PM

#

For the dunder methods __getattr__, __setattr__, and __delattr__ a difference that emerges between them is that __getattr__ is called only if looking up an attribute in an object dictionary fails but for __setattr__ and __delattr__ are called regardless of whether the attribute is present in the objects dictionary but why is it that __getattr__ is handled differently from __setattr__ and __delattr__ rather than handling them all the same?

feral island Nov 29, 2022, 7:25 PM

#

tall surge For the dunder methods `__getattr__`, `__setattr__`, and `__delattr__` a differe...

__getattribute__ does get called for all attributes

grave jolt Nov 29, 2022, 7:25 PM

#

I guess this is a naming issue then 😄

#

__setattribute__ when

feral island Nov 29, 2022, 7:26 PM

#

I think the default __getattr__ behavior is useful because you often want it as a fallback for attributes that aren't explicitly defined, while attributes that are defined normally can just use the normal system

#

Yeah the naming isn't great, probably a historical accident

tall surge Nov 29, 2022, 7:28 PM

#

thank you!

radiant garden Nov 29, 2022, 7:50 PM

#

feral island Yeah the naming isn't great, probably a historical accident

mixing up getattr, getattribute, getitem and get

quick snow Nov 29, 2022, 8:16 PM

#

I like this asymmetry. When you want to customize item access, you define __setitem__/__getitem__. When you want to customize attribute access you define __setattr__/__getattr__, almost always. When you define __getattribute__, you have to be extremely careful, you almost never want it.

grave jolt Nov 29, 2022, 8:35 PM

#

get and set are extremely ambiguous and overused words

#

well, in this context it's probably appropriate

static bluff Nov 29, 2022, 10:55 PM

#

Question about Python dictionaries

#

I'm learning about hashing in my data structures class

#

I just learned about how, as the load factor of the hash table approaches 1, the time to find an unoccupied slot (or, a the time to perform an unsuccessful search) approaches linear time

#

Generally speaking. But I've heard for most of my python-using life that dictionaries are lightning fast, at least in Python terms, and that dictionary operations are in more or less constant time

#

How does Python handle this? Amortized resizing of the table?

feral island Nov 29, 2022, 10:58 PM

#

Python automatically resizes dictionaries as they grow yes

#

Not too familiar with the details, but that should keep access times amortized constant

#

It is possible to get bad behavior if you have many keys that happen to hash to the same bucket

static bluff Nov 29, 2022, 10:59 PM

#

XD There's no winning, is there?

feral island Nov 29, 2022, 11:00 PM

#

It's fairly unlikely in practice. String hashes are randomized to avoid DoS attacks where many keys map to the same bucket

#

It's probably still possible with ints (which have very predictable hashes) but that's rarely relevant in practice

feral cedar Nov 29, 2022, 11:01 PM

#

i think they get resized by a factor of 9/8 when they get 2/3 full or something like that

#

To avoid slowing down lookups on a near-full table, we resize the table when
it's USABLE_FRACTION (currently two-thirds) full.
load factor ^

Currently set to used*3.
how much to expand by ^

i think 9/8 is for lists

tacit hawk Nov 29, 2022, 11:11 PM

#

Is the hash() of int and floats constant? If yes is it stable or just an implementation detail?

feral cedar Nov 29, 2022, 11:12 PM

#

it's an implementation detail. the only thing that must be satisfied is that equal ints/floats have the same hash

spark magnet Nov 29, 2022, 11:46 PM

#

tacit hawk Is the hash() of int and floats constant? If yes is it stable or just an impleme...

is this just for curiosity?

halcyon trail Nov 29, 2022, 11:51 PM

#

Ints hash to themselves

#

Floats have to be compatible with that, I think

#

Although as a general rule of thumb you just don't want to use floats near hash tables

feral cedar Nov 29, 2022, 11:52 PM

#

^ integral floats hash to an int equal to themselves, but non-integral floats hash to...something

halcyon trail Nov 29, 2022, 11:52 PM

#

Statically typed languages almost all just disallow using floats as keys or for lookup

#

(at least by default)

raven ridge Nov 29, 2022, 11:54 PM

#

halcyon trail Ints hash to themselves

!e Not all of them. ```py
print(hash(-1))

fallen slateBOT Nov 29, 2022, 11:54 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

-2

spark magnet Nov 29, 2022, 11:54 PM

#

mostly ints hash to themselves 🙂

halcyon trail Nov 29, 2022, 11:54 PM

#

Interesting

#

Is this a general thing for negatives?

spark magnet Nov 29, 2022, 11:54 PM

#

no, just -1

feral cedar Nov 29, 2022, 11:54 PM

#

no, it's a funny implementation detail of the hash function, lol

halcyon trail Nov 29, 2022, 11:54 PM

#

Please tell me you guys know why this happens

feral island Nov 29, 2022, 11:55 PM

#

it's because returning -1 indicates an error

spark magnet Nov 29, 2022, 11:55 PM

#

also, hash(i) == (i % 2**61) (I think)

halcyon trail Nov 29, 2022, 11:55 PM

#

🤦‍♂️

feral island Nov 29, 2022, 11:55 PM

#

spark magnet also, hash(i) == (i % 2**61) (I think)

with a -1 in there somewhere

halcyon trail Nov 29, 2022, 11:55 PM

#

Under what circumstance does hashing return an error

feral island Nov 29, 2022, 11:56 PM

#

def __hash__(self): 1/0

raven ridge Nov 29, 2022, 11:56 PM

#

https://docs.python.org/3/c-api/typeobj.html#c.PyTypeObject.tp_hash

The value -1 should not be returned as a normal return value; when an error occurs during the computation of the hash value, the function should set an exception and return -1.

halcyon trail Nov 29, 2022, 11:56 PM

#

That doesn't answer my question though

rose schooner Nov 29, 2022, 11:57 PM

#

raven ridge https://docs.python.org/3/c-api/typeobj.html#c.PyTypeObject.tp_hash > The value ...

it's guaranteed isn't it ```py

class A:
... def hash(self):
... return -1
...
a = A()
hash(a)
-2

raven ridge Nov 29, 2022, 11:57 PM

#

I'd have to check the data model docs, but I doubt that's guaranteed...

halcyon trail Nov 29, 2022, 11:58 PM

#

If the user hash throws.then I suppose you can just propagate it. I don't see any legitimate reason for it to throw though,.so I'm surprised that they reserved a sentinel for this

rose schooner Nov 29, 2022, 11:58 PM

#

rose schooner it's guaranteed isn't it ```py >>> class A: ... def __hash__(self): ... ...

maybe only for user-created classes

halcyon trail Nov 29, 2022, 11:58 PM

#

I've never seen hashing reserve an error channel in another language, I think

feral island Nov 29, 2022, 11:58 PM

#

halcyon trail If the user hash throws.then I suppose you can just propagate it. I don't see an...

__hash__ can be Python code and Python code can always throw

halcyon trail Nov 29, 2022, 11:58 PM

#

Yes, it can, but you can also propagate that exception

raven ridge Nov 29, 2022, 11:59 PM

#

rose schooner it's guaranteed isn't it ```py >>> class A: ... def __hash__(self): ... ...

there's no note about -1 in https://docs.python.org/3/reference/datamodel.html#object.__hash__ - so it doesn't seem to be guaranteed.

feral island Nov 29, 2022, 11:59 PM

#

halcyon trail Yes, it can, but you can also propagate that exception

the CPython C API generally uses sentinel return values to indicate that an error occurred

halcyon trail Nov 29, 2022, 11:59 PM

#

The point is that I don't see why it would be a use case to care about it, so I don't see to do it any favors

raven ridge Nov 29, 2022, 11:59 PM

#

efficiency

halcyon trail Nov 29, 2022, 11:59 PM

#

Efficiency in the error path shouldn't be a consideration here

feral island Nov 29, 2022, 11:59 PM

#

otherwise you'd have to call PyErr_Occurred() after every call to tp_hash

#

so the non-error path would be slow

halcyon trail Nov 30, 2022, 12:00 AM

#

Is it more efficient in the happy path?

raven ridge Nov 30, 2022, 12:00 AM

#

halcyon trail Efficiency in the error path shouldn't be a consideration here

no, efficiency in the non-error path.

halcyon trail Nov 30, 2022, 12:00 AM

#

Ugh

#

That makes me sad

raven ridge Nov 30, 2022, 12:00 AM

#

why? It's an implementation detail

feral island Nov 30, 2022, 12:00 AM

#

and it doesn't really affect you if you're working at the Python level

#

unless you go out of your way to check the value of hash(-1) or something

rose schooner Nov 30, 2022, 12:01 AM

#

raven ridge there's no note about `-1` in https://docs.python.org/3/reference/datamodel.html...

seems like the slot for __hash__ turns -1 into -2 if there's no error https://github.com/python/cpython/blob/main/Objects/typeobject.c#L8125-L8141

GitHub

cpython/typeobject.c at main · python/cpython

The Python programming language. Contribute to python/cpython development by creating an account on GitHub.

halcyon trail Nov 30, 2022, 12:01 AM

#

raven ridge why? It's an implementation detail

I can't be sad for things happening in the implementation?

raven ridge Nov 30, 2022, 12:01 AM

#

Python code thinks that __hash__ returns a Python int. For efficiency, the actual C code doesn't use Python ints, it uses int64_t's. So there's always got to be some conversion happening to get from one to the other.

halcyon trail Nov 30, 2022, 12:02 AM

#

feral island and it doesn't really affect you if you're working at the Python level

If my hash returns -1 though, how does that work? Probably I don't fully understand where the sentinel is checked

feral island Nov 30, 2022, 12:02 AM

#

check the code cereal just linked

raven ridge Nov 30, 2022, 12:02 AM

#

rose schooner seems like the slot for `__hash__` turns -1 into -2 if there's no error https://...

that happens here

#

if your __hash__ returns -1 as a Python int, the code that turns that into an int64_t (aka Py_hash_t) returns -2

feral island Nov 30, 2022, 12:03 AM

#

wait no that's the other direction, https://github.com/python/cpython/blob/main/Objects/typeobject.c#L7500 is for when a C tp_hash throws

fallen slateBOT Nov 30, 2022, 12:03 AM

#

Objects/typeobject.c line 7500

wrap_hashfunc(PyObject *self, PyObject *args, void *wrapped)```

raven ridge Nov 30, 2022, 12:05 AM

#

I doubt that microoptimization saves very much, honestly - but it lets the error handling path be c if (hash_code == -1) { // propagate exception } instead of c if (hash_code == -1 && error_occurred()) { // propagate exception }

#

allowing -1 as a hash that isn't just a sentinel for an error would mean that everything that hashes to -1 needs an extra check to see if -1 is or isn't an error indicator.

#

I don't think it improves the performance of things that hash to the other 2^64-1 values, though - so maybe in practice it doesn't save too much.

halcyon trail Nov 30, 2022, 12:10 AM

#

I wonder if any other mainstream GC language has something like this, and I just didn't know about it

raven ridge Nov 30, 2022, 12:10 AM

#

my gut feeling is that this is probably a microoptimization that makes pretty little difference given today's branch predictors.

#

regardless, it's an implementation detail that's invisible to everyone except for those working at the C API level, so 🤷

halcyon trail Nov 30, 2022, 12:11 AM

#

If there was no sentinel then error_occured would always have to be called right? That's what was discussed previously

raven ridge Nov 30, 2022, 12:11 AM

#

right

halcyon trail Nov 30, 2022, 12:12 AM

#

error_occured is expensive, it was alleged

raven ridge Nov 30, 2022, 12:12 AM

#

but some functions in the C API do return -1 as both a sentinel and a real return value - if they return -1 you need to check the error-occurred function as well

halcyon trail Nov 30, 2022, 12:12 AM

#

So the branch predictor wouldn't really save you

#

Yes, this sort of thing makes me wince

rose schooner Nov 30, 2022, 12:13 AM

#

raven ridge my gut feeling is that this is probably a microoptimization that makes pretty li...

shouldn't it get optimized to the equivalent of c if (hash_code != -1 || !PyErr_Occurred()) { goto resume_path; } /* exception */ resume_path: /* continue */ or does that require __builtin_expect

raven ridge Nov 30, 2022, 12:13 AM

#

halcyon trail So the branch predictor wouldn't really save you

not if you unconditionally called the error-occurred function, only if you conditionally called it to confirm or deny a sentinel

halcyon trail Nov 30, 2022, 12:13 AM

#

I was comparing to not using a sentinel at all

raven ridge Nov 30, 2022, 12:14 AM

#

rose schooner shouldn't it get optimized to the equivalent of ```c if (hash_code != -1 || !P...

that's literally what the code would be - we can't talk about what it would be optimized to by showing C code

halcyon trail Nov 30, 2022, 12:14 AM

#

I still don't think I understand though, Jelle s example doesn't return -1, it just throws

#

So there must be some C code that catches that exception, and then returns -1

raven ridge Nov 30, 2022, 12:15 AM

#

yes

#

and then other C code that sees that -1 and propagates the exception

rose schooner Nov 30, 2022, 12:16 AM

#

halcyon trail error_occured is expensive, it was alleged

seems like it's just a few function calls and attribute accesses though

halcyon trail Nov 30, 2022, 12:16 AM

#

So this is written this way based on being old and/or wanting to be efficient on older machine

#

On x86 64 returning a two word trivial object has been basically free for ages

#

Or 32 bit, I suppose

raven ridge Nov 30, 2022, 12:17 AM

#

rose schooner shouldn't it get optimized to the equivalent of ```c if (hash_code != -1 || !P...

if we had the sentinel but you needed to confirm it by calling PyErr_Occurred, then in practice the branch predictor would assume that the return value wasn't -1, and a misprediction would mean it would need to flush the instruction cache and then call PyErr_Occurred to decide whether to go down the branch it had originally assumed it would go down or the exception handling branch

raven ridge Nov 30, 2022, 12:18 AM

#

halcyon trail So this is written this way based on being old and/or wanting to be efficient on...

that's my guess, yeah. I doubt this makes much difference on modern CPUs.

halcyon trail Nov 30, 2022, 12:19 AM

#

Yeah. But anyhow you can see why I wince, having a sentinel that overlaps the legit range for something never feels good

#

It's like the C functions that parse strings to int

#

0 to indicate an error

#

An error sentinel which is probably also the most common legitimate output 😛

raven ridge Nov 30, 2022, 12:23 AM

#

rose schooner shouldn't it get optimized to the equivalent of ```c if (hash_code != -1 || !P...

we can't really talk about optimizations like branch prediction by showing C code, because the magic of branch prediction is things happening speculatively and then potentially needing to be undone, and we can't illustrate that using C code.

feral island Nov 30, 2022, 12:23 AM

#

yes, it's an ugly area of the C API. There's been talk of a new C API that wouldn't use this sort of sentinel, e.g. https://github.com/markshannon/New-C-API-for-Python/blob/main/DesignRules.md#all-functions-have-an-error-out-parameter-or-return-the-error

halcyon trail Nov 30, 2022, 12:27 AM

#

I guess that there's like zero chance of moving to C++?

#

For implementation details

raven ridge Nov 30, 2022, 12:27 AM

#

they just moved to C99 😄

#

like, this year.

halcyon trail Nov 30, 2022, 12:27 AM

#

That makes sense to me

#

I mean when did msvc start supporting c99

#

Like 6 months ago 😛

raven ridge Nov 30, 2022, 12:28 AM

#

yeah. not long ago.

halcyon trail Nov 30, 2022, 12:34 AM

#

Gcc did the C to C++ move continuously, so it's not unthinkable.

#

But definitely not easy

tacit hawk Nov 30, 2022, 12:35 AM

#

spark magnet is this just for curiosity?

I thought about using hash of tuples as a key for faster lookup of records stored on a database, this requires the hash to be constant for all python startups

raven ridge Nov 30, 2022, 12:36 AM

#

definitely don't do that.

spark magnet Nov 30, 2022, 12:36 AM

#

tacit hawk I thought about using hash of tuples as a key for faster lookup of records store...

right, you don't want hash() for that

spark magnet Nov 30, 2022, 12:37 AM

#

tacit hawk I thought about using hash of tuples as a key for faster lookup of records store...

don't try to outsmart a database. it's good at what it does.

halcyon trail Nov 30, 2022, 12:38 AM

#

If you.outsmart the database you become the database

#

It's like beating up the bouncer

gray galleon Nov 30, 2022, 6:30 AM

#

is there a pep for symbol type in python?

flat gazelle Nov 30, 2022, 6:36 AM

#

what would that entail? If you mean symbols in the style of erlang et al, thats pretty much the sentinel objects PEP.

gray galleon Nov 30, 2022, 6:40 AM

#

i mean interned names like in lisp and ruby
wait python already have interned strings

#

it doesn’t look like symbols but nice

flat gazelle Nov 30, 2022, 7:02 AM

#

yeah, python just uses strings in those places. Thanks to the interning, the comparison is fast enough even without the identity, though unlike symbols they aren't namespaced. But for sentinel-style stuff, you use None or object()

long isle Nov 30, 2022, 12:30 PM

#

?

grave jolt Nov 30, 2022, 12:44 PM

#

long isle ?

If you have a question, please see #❓｜how-to-get-help

unkempt rock Nov 30, 2022, 3:19 PM

#

!pep 638

fallen slateBOT Nov 30, 2022, 3:19 PM

#

**PEP 638 - Syntactic Macros**

Link

Status

Draft

Created

24-Sep-2020

Type

Standards Track

unkempt rock Nov 30, 2022, 3:19 PM

#

This could be interesting to see in an interpreted language

boreal umbra Nov 30, 2022, 3:29 PM

#

I thought that PEP was rejected a few years ago for fear that it would fracture the ecosystem

native flame Nov 30, 2022, 3:47 PM

#

thoughts on this? https://mail.python.org/archives/list/typing-sig@python.org/thread/Q3OBZFFEJOTALGQS47JMVCGPSVDMHZNZ/

#

i kinda hate the idea

#

both for any and callable

feral cedar Nov 30, 2022, 3:48 PM

#

that's kinda cursed tbh

quick snow Nov 30, 2022, 3:48 PM

#

Eww

#

any could be synonymous to Union instead

#

Then we could also finally have all (for hypothetical Intersection)

#

any[str, int]. all[Indexable, Sized]

radiant garden Nov 30, 2022, 3:51 PM

#

or an even worse alternative, all for Never

native flame Nov 30, 2022, 3:51 PM

#

i dislike the idea of "reusing" functions for annotations at all tbh

radiant garden Nov 30, 2022, 3:51 PM

#

if Any is any, BABAXD

quick snow Nov 30, 2022, 3:54 PM

#

I promise I will start using type hints when you can do arithmetics with them. Sequence - str, ~int, ...

radiant garden Nov 30, 2022, 3:54 PM

#

hell, might as well reuse iter for iterable

native flame Nov 30, 2022, 3:55 PM

#

map for Mapping

gray galleon Nov 30, 2022, 3:56 PM

#

radiant garden if Any is any, <:BABAXD:734139981974732881>

why use typing.Any when there is object 😎

radiant garden Nov 30, 2022, 3:56 PM

#

serious answer: because different semantics

#

real answer: because fewer characters to type

feral cedar Nov 30, 2022, 3:57 PM

#

gray galleon why use `typing.Any` when there is `object` 😎

they're different. Any means literally any type, basically turning type checking off. object means the operations all objects support (which is not that many)

native flame Nov 30, 2022, 3:57 PM

#

heres a fix error article™️ https://decorator-factory.github.io/typing-tips/faq/object-vs-any/

feral cedar Nov 30, 2022, 3:58 PM

#

@grave jolt

Whatever you have — a strnig, a number, a function, a chess piece — it's an object.
strnig 😔

gray galleon Nov 30, 2022, 3:59 PM

#

feral cedar they're different. Any means literally any type, basically turning type checking...

i thought object on annotations means object and subclasses
which includes every object

radiant garden Nov 30, 2022, 3:59 PM

#

rename TypeGuard to isinstance

grave jolt Nov 30, 2022, 4:00 PM

#

feral cedar <@461097636791844865> > Whatever you have — a strnig, a number, a function, a ...

Pull request when

feral cedar Nov 30, 2022, 4:01 PM

#

gray galleon i thought `object` on annotations means `object` and subclasses which includes e...

subclasses could be passed to a function asking for object, but you wouldn't be able to use those different methods, because not all objects that you pass would support those different methods

grave jolt Nov 30, 2022, 4:01 PM

#

gray galleon i thought `object` on annotations means `object` and subclasses which includes e...

Yeah that's what it is. So if you have a x: object, you can't do x.foo() because not every object has a foo method.

grave jolt Nov 30, 2022, 4:02 PM

#

native flame heres a fix error article™️ https://decorator-factory.github.io/typing-tips/faq/...

Yeah read the article, I'm very good friends with the author

gray galleon Nov 30, 2022, 4:02 PM

#

feral cedar subclasses could be passed to a function asking for `object`, but you wouldn't b...

~~just ignore linter errors~~

quick snow Nov 30, 2022, 4:03 PM

#

feral cedar subclasses could be passed to a function asking for `object`, but you wouldn't b...

Doesn't the same logic apply to Any? Genuine question.

native flame Nov 30, 2022, 4:03 PM

#

typecheckers will let you do anything with Any

#

it isnt treated like normal types

grave jolt Nov 30, 2022, 4:05 PM

#

Yeah it's special

quick snow Nov 30, 2022, 4:06 PM

#

I see

gray galleon Nov 30, 2022, 4:06 PM

#

btw how do i make recursive types
smth like this ```py
@dataclass(frozen=True)
class LinkedList:
first: object
rest: LinkedList # will throw an error

feral island Nov 30, 2022, 4:07 PM

#

gray galleon btw how do i make recursive types smth like this ```py @dataclass(frozen=True) c...

put the second "LinkedList" in quotes

gray galleon Nov 30, 2022, 4:13 PM

#

hmm

feral cedar Nov 30, 2022, 4:26 PM

#

quick snow Doesn't the same logic apply to `Any`? Genuine question.

i think in the ~lingo~ it's a "bottom type" and a "top type". it's a subtype of every type and a supertype of every type

feral island Nov 30, 2022, 4:26 PM

#

(or use from __future__ import annotations, or PEP 649 in the future)

feral island Nov 30, 2022, 4:26 PM

#

feral cedar i think in the ~lingo~ it's a "bottom type" and a "top type". it's a subtype of ...

yes, it's both at once. Never is a bottom type, object is a top type, Any is both

halcyon trail Nov 30, 2022, 4:27 PM

#

It's pretty confusing in a way since Any is usually used as a name for the top type

#

and to me at least that's also what the name implies, verbally

feral cedar Nov 30, 2022, 4:27 PM

#

yeah i think c++ does that

halcyon trail Nov 30, 2022, 4:27 PM

#

C++ doesn't have a top type

#

it has a library type called any that can hold anything though

feral island Nov 30, 2022, 4:28 PM

#

typescript's any is like Python Any

halcyon trail Nov 30, 2022, 4:29 PM

#

are you sure? then there's a mistake on the wikipedia page

feral island Nov 30, 2022, 4:30 PM

#

https://www.typescriptlang.org/play?#code/FAMwrgdgxgLglgewgAhACgB4C5kEMICeAlMgN7DKXIYB0A7gBa4wCmAbiwE4C2LMDCACZw6+GGiIBuYAF8gA

TS Playground - An online editor for exploring TypeScript and JavaS...

The Playground lets you write TypeScript or JavaScript online in a safe and sharable way.

halcyon trail Nov 30, 2022, 4:30 PM

#

yeah you're right

#

does TS not have a proper top type?

#

unknown maybe?

#

yeah, seems like unknown is the top type

feral island Nov 30, 2022, 4:31 PM

#

hm seems to behave a little differently https://www.typescriptlang.org/play?#code/FAMwrgdgxgLglgewgAhACgB4C5kEMICeANMgTpANYQIDuEJAXjhGALYBGApgE4CUyAb2DIRyDADoaAC1wxOANx6tOMKQgAmcGvhhpeAbmGiCkmXMXdlqjVp17Do5A1OyFSlWs3aIug8AC+QA

halcyon trail Nov 30, 2022, 4:31 PM

#

those names feel backwards to me

feral island Nov 30, 2022, 4:31 PM

#

unknown gives an error when you use it, but there's no error for accessing an arbitrary method

halcyon trail Nov 30, 2022, 4:31 PM

#

Any to me feels like a known type, that could just happen to be anything
Unknown is a type we don't know anything about, and we're opting out of type checking

#

weird

#

at least in python object is a pretty typical top-type name

feral island Nov 30, 2022, 4:32 PM

#

they do call unknown the top type: https://www.typescriptlang.org/docs/handbook/release-notes/typescript-3-0.html#new-unknown-top-type

Documentation - TypeScript 3.0

TypeScript 3.0 Release Notes

halcyon trail Nov 30, 2022, 4:36 PM

#

feral island `unknown` gives an error when you use it, but there's no error for accessing an ...

yeah I mean the error is just occurring even earlier

#

Since there are no legal operations on unknown, it immediately errors when an unkown variable is mentioned, in a context where its type hasn't been narrowed in some way

#

it's not really conceptually different though

#

Doing it this way is a little, "extra" strict because x = y is always a legal operations in python and AFAIK JS

#

similarly, if you have a my_list: List[object], then my_list.append(y) is legal even if y is of type object

#

so error'ing immediately when it's mentioned is very odd. I'd have it as a warning.

#

but I can see the benefits practically

dusk comet Nov 30, 2022, 4:59 PM

#

native flame map for Mapping

compile for types.CodeType
isinstance for TypeGuard
getattr[cls, 'attr'] for declaring same type that cls.attr is (i guess there is a PEP or some proposal about that feature)
globals['varname'] - same as getattr, but using global vars instead of attrs of some type
locals['varname'] - same as globals, but using local vars
sorted for Iterable[T] where T is SupportsLT
sum for Iterable[T] where T is SupportsAdd
vars for dict[str, any] (commonly used for arbitrary namespaces, globals() and json dicts have this type, for example)
hash for Hashable
iter for Iterable
~~bool for Boolable~~
aiter for AsyncIterable
len for Sized
reversed for Reversible
open for SupportsFSPath
abs for SupportsAbs
round for SupportsRound
dir for SupportsDir
divmod for SupportsDivMod
format for SupportsFormat
pow for SupportsPow

#

x: property[int] for read-only var or attr (UPD: i realized it is almost equivalent to x: Final[int])

boreal umbra Nov 30, 2022, 6:57 PM

#

native flame i kinda hate the idea

I don't like it, either. any and typing.Any represent two different concepts that aren't even guaranteed to share the same word in every human language. And I think Python should limit its already preferential status for English.

#

I actually didn't know about the callable builtin. I wish it had been named is_callable.

feral cedar Nov 30, 2022, 7:01 PM

#

why is it even a built-in

halcyon trail Nov 30, 2022, 7:09 PM

#

it's a bit of a weird choice because Any is not an annotation you want to be using that often

#

I almost never use it. Usually it enters into it implicitly, because your code depends on first or third party code that was written without annotations, so Any is the "default" when annotations aren't present

grave jolt Nov 30, 2022, 7:20 PM

#

dusk comet `compile` for `types.CodeType` `isinstance` for `TypeGuard` `getattr[cls, 'attr'...

print for wx.Printer brainmon

#

and if it's not installed, it prints the mypy output

halcyon trail Nov 30, 2022, 7:21 PM

#

the concept of built-ins is a bit strange to me; or maybe I'm just attaching more signifiance to it then it deserves, because of the name

grave jolt Nov 30, 2022, 7:21 PM

#

on a real printer

#

hmm

halcyon trail Nov 30, 2022, 7:21 PM

#

i prefer to just think of functions/classes that are imported by default

grave jolt Nov 30, 2022, 7:22 PM

#

yeah

halcyon trail Nov 30, 2022, 7:22 PM

#

seems to be how it works in Kotlin, Rust

#

is there any meaningful distinction though between that, and a python "built in" ?

grave jolt Nov 30, 2022, 7:22 PM

#

Haskell has a "prelude" which is basically a library star-imported by default. IIRC you can even replace it with a different one

halcyon trail Nov 30, 2022, 7:23 PM

#

i don't know if it's replaceable in kotlin or rust

grave jolt Nov 30, 2022, 7:23 PM

#

I don't think Rust even has "built-ins"

#

ah, some macros

#

yeah it does, I misremember

#

https://doc.rust-lang.org/std/prelude/index.html

halcyon trail Nov 30, 2022, 7:24 PM

#

it does

#

fn main() {
    let x: Vec<_> = vec!(1,2,3);
}

#

a valid rust program

grave jolt Nov 30, 2022, 7:25 PM

#

yeah yeah

#

it was a brain fart

feral island Nov 30, 2022, 7:25 PM

#

halcyon trail is there any meaningful distinction though between that, and a python "built in"...

that's basically how Python works, there's a builtins module that is basically part of the global scope by default

grave jolt Nov 30, 2022, 7:25 PM

#

yeah I don't think there's something sacred about builtins

halcyon trail Nov 30, 2022, 7:25 PM

#

in kotlin/rust you can actually have far bigger predules, because of the ability to scope things to classes, without them being members

#

like, all of itertools for example, is iirc in the preludes of both Rust and Kotlin

#

in python this would be extremely annoying and people would rightfully complain

grave jolt Nov 30, 2022, 7:26 PM

#

hmmmm

#

well, it's kinda different

#

unless I misunderstand you

halcyon trail Nov 30, 2022, 7:26 PM

#

it's different because it's member-scoped

grave jolt Nov 30, 2022, 7:27 PM

#

yeah

halcyon trail Nov 30, 2022, 7:27 PM

#

yeah, that's what I said

grave jolt Nov 30, 2022, 7:27 PM

#

oh, you mean traits and extension methods

halcyon trail Nov 30, 2022, 7:27 PM

#

yes

#

you can have all of itertools in Kotlin or Rust, as part of the prelude/"builtins", because they're just available via member functions syntax that are only available on suitable types

#

groupBy in Kotlin is an extension on Iterable<T>, in python groupby is just a free function

#

so with the latter, you'd have issues with shadowing and such if you define any function, class, or variable called "groupby"

#

I have lost track of how many times I've tried naming a variable "input" in python

#

only to get yelled at by my IDE, sigh, and change it

grave jolt Nov 30, 2022, 7:31 PM

#

remove the IDE check brainmon

#

I have used id liberally

#

and filter

halcyon trail Nov 30, 2022, 7:32 PM

#

I mean yeah 99% of the time i twon't matter but I have actually, a couple of times, hit a really confusing bug caused by shadowing

#

I think it's just best practice to not use those names

#

it starts off okay, then other programmers see that id is the idiomatic name for some variable that comes up in your business logic, and they start using it

#

soon there are local varaibles called id everywhere and then eventually something bad happens

#

it's annoying but there is no real technical justification to just not get a different name

#

student_id or foo_id or whatever

grave jolt Nov 30, 2022, 7:34 PM

#

my usual fuckup scenario is when I format that id somewhere into a string

#

and then get something like Foo(id=<built-in function id>)

halcyon trail Nov 30, 2022, 7:34 PM

#

nice

grave jolt Nov 30, 2022, 7:35 PM

#

another solution might be banning certain builtins altogether in a linter, like id

#

that would also catch such mistakes

halcyon trail Nov 30, 2022, 7:38 PM

#

id would also be a good example of something that could just be an extension.
I actually never thought too much of that benefit of extensions; not polluting the global namespace. kinda cool.

grave jolt Nov 30, 2022, 7:45 PM

#

I think id should've just been part of sys tbh

#

I don't think I've actually used it besides the REPL

#

I guess it's useful if you want to include the id in the __repr__, like the default __repr__ but with some extra stuff.

#

But that's a very niche use case, hence it could be moved to sys

halcyon trail Nov 30, 2022, 8:19 PM

#

yeah I agree

#

input is probably my personal worst offender. very useful variable name and I almost never use the function.

raven ridge Nov 30, 2022, 10:47 PM

#

halcyon trail is there any meaningful distinction though between that, and a python "built in"...

Python uses "built in" to refer to two different things in different contexts. It either refers to a function or type implemented in native code rather than Python bytecode, or it refers to an attribute of the builtins module, which is implicitly in scope for all lookups

#

!e print(type("".split))

fallen slateBOT Nov 30, 2022, 10:48 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

<class 'builtin_function_or_method'>

raven ridge Nov 30, 2022, 10:48 PM

#

that's "builtin" in the first sense

#

almost as annoying as the fact that "package" means at least 2 different things in Python

grave jolt Nov 30, 2022, 10:49 PM

#

coroutine 😔

raven ridge Nov 30, 2022, 10:49 PM

#

yeah, that's another one.

#

it's moderately interesting that you can monkeypatch in new (or replacement) builtins in Python

#

!e ```py
import builtins
builtins.hello = lambda a: print(f"Hello, {a}!")
hello("World")

fallen slateBOT Nov 30, 2022, 10:52 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

Hello, World!

raven ridge Nov 30, 2022, 10:54 PM

#

Sick of your linter complaining that you've shadowed builtins.id? Just del it! hyperlemon

halcyon trail Nov 30, 2022, 11:02 PM

#

Idk about rust but in Kotlin you can shadow the prelude

#

No warning either

#

It's just a lot more explicit since it can only be done by an import and not by a local variable I don't believe

raven ridge Nov 30, 2022, 11:33 PM

#

If you modify the builtins module in Python, all modules will see your change, since all modules share a reference to the same builtins module

sacred yew Dec 1, 2022, 2:24 AM

#

grave jolt coroutine 😔

wait whats the 2nd meaning

#

aside from the async function one

feral cedar Dec 1, 2022, 2:25 AM

#

the function itself is called a coroutine, and the thing such a function returns is also called a coroutine

sacred yew Dec 1, 2022, 2:25 AM

#

ah

feral cedar Dec 1, 2022, 2:26 AM

#

same with generators. the "correct" name would be "coroutine function" and "generator function" but no one actually says that

sacred yew Dec 1, 2022, 2:26 AM

#

ackshully its "generator" and "generator iterator" (src: https://docs.python.org/3/glossary.html#index-19)

#internals-and-peps

python pseudocode; added .last_hashed_size field