glass mulch Jul 23, 2024, 6:59 PM

#

Not at all, IMO. Probably not worth the change.

uneven raptor Jul 24, 2024, 3:29 AM

#

CC @grave jolt, since we discussed this earlier

what would be the best way to hash an "arbitrary object" that has the same value between programs? i was told that using hash() is a bad idea, because the hash is not guaranteed to be the same (which is true, but it's not exactly my problem -- as in, if a user implements an odd __hash__, there's nothing i can really do about that).

i ended up writing a function that looks like this:

def _hash(self, value: Hashable, size: int) -> tuple[int, int]:
    if isinstance(value, str):
        # String hashes are not retained between programs
        hashed_str = int(
            hashlib.sha1(value.encode("utf-8")).hexdigest(), 16
        )
        return hashed_str, hashed_str % size

    hashed = hash(value)
    index = (hashed & 0x7FFFFFFF) % size
    return hashed, index

is there something inherently wrong with this?

#

i'm opposed to writing my own stable_hash protocol that's guaranteed to always be the same, because what's the point if there's already an existing __hash__? is it really that common for objects to have different hashes between interpreters?

feral island Jul 24, 2024, 3:35 AM

#

uneven raptor i'm opposed to writing my own `stable_hash` protocol that's guaranteed to always...

Hashes for some objects are randomized, meaning they will be different from one run of the interpreter to the next.

$ python -c 'print(hash(("x",)))'
4734399606021899668
$ python -c 'print(hash(("x",)))'
7577783400188628811
$ python -c 'print(hash(("x",)))'
8033416648646465101

#

Your comment indicates you're aware of that for strings, but checking for strings only at the top level isn't enough, because many objects compute their hash by combining the hash values of the objects they contain

uneven raptor Jul 24, 2024, 3:41 AM

#

feral island Your comment indicates you're aware of that for strings, but checking for string...

ok, so i could special case some collections to deal with the strings inside them, is that reasonable?

feral island Jul 24, 2024, 3:41 AM

#

no

#

what if it's a dataclass

#

you essentially end up having to know the internal structure of every object you're trying to hash

uneven raptor Jul 24, 2024, 3:42 AM

#

feral island what if it's a dataclass

i’m unaware of how they hash, are they randomized?

feral island Jul 24, 2024, 3:42 AM

#

they combine the hashes of the values in the dataclass

#

also, the hashes of some objects (e.g., types) are based on the memory address, so they also won't stay the same across runs

uneven raptor Jul 24, 2024, 3:43 AM

#

ah. i thought about setting PYTHONHASHSEED, but that only works on interpreter startu

uneven raptor Jul 24, 2024, 3:44 AM

#

feral island also, the hashes of some objects (e.g., types) are based on the memory address, ...

FWIW, i’m technically not doing this for “any object,” just types that are serializable by pydantic (which doesn’t include types, i think)

raven ridge Jul 24, 2024, 4:55 AM

#

why not do something like serialize to JSON and then take the md5 of the JSON?

#

that's not particularly fast, but it is stable

uneven raptor Jul 24, 2024, 5:53 AM

#

raven ridge why not do something like serialize to JSON and then take the md5 of the JSON?

that’s probably what ill end up going with, it’s just nice to support user-defined __hash__ methods

quick snow Jul 24, 2024, 9:50 AM

#

Forces you to use a session.

#

(httpx has a sync interface which doesn't, so for quick interactive requests you type less)

jade raven Jul 24, 2024, 10:11 AM

#

is there a "simple" way of implementing ast.unparse in python <= 3.8?

#

!d ast.unparse

rose schooner Jul 24, 2024, 10:32 AM

#

jade raven is there a "simple" way of implementing `ast.unparse` in python <= 3.8?

PyPI's astunparse?

#

!pip astunparse

fallen slateBOT Jul 24, 2024, 10:32 AM

#

astunparse v1.6.3

An AST unparser for Python

Released on <t:1577038333:D>.

halcyon trail Jul 24, 2024, 2:10 PM

#

uneven raptor that’s probably what ill end up going with, it’s just nice to support user-defin...

Be aware if you do this, you have to be very careful as it's very easy for python objects that are "equal" to serialize to different json

#

serialization/deserialization protocols aren't generally interested in making a guarantee that "equal objects serialize to the exact same thing". They're interested in the guarantee that the value is preserved when it round trips.

#

Sets don't care about order at all or guarantee anything, dicts have some guarnatees around ordering but they are still "equal" if their ordering is different.
when you turn these things into json, you'll potentially get differently-ordered json objects/arrays, and thus different md5

raven ridge Jul 24, 2024, 2:22 PM

#

Mm, true.

uneven raptor Jul 24, 2024, 2:47 PM

#

that’s a very good point. maybe i could do some extra check to force the JSON to have a certain order?

halcyon trail Jul 24, 2024, 3:07 PM

#

uneven raptor that’s a very good point. maybe i could do some extra check to force the JSON to...

well....

#

it gets a little bit tricky

#

for dicts, the keys have to be strings, so you could force them to be in key sorted order - that's relatively easy

#

and having the same key twice is pretty questionable anyway so you don't really have to worry about ties

#

the issue is lists

#

to sort lists you'll need to defining a sorting order over json values, which is... annoying

#

well.... but then, you won't want to always sort the lists. just sometimes.

#

so it gets pretty messy. you'll basically need to define your own json serialization.

feral island Jul 24, 2024, 3:10 PM

#

json.dumps() has a sort_keys=True option, for what it's worth

halcyon trail Jul 24, 2024, 3:10 PM

#

e.g you would want lists to simply go into a json array in the same order - but sets you would need to perform sorting

feral island Jul 24, 2024, 3:10 PM

#

Agree that that doesn't fix all problems though

halcyon trail Jul 24, 2024, 3:10 PM

#

yeah, sort_keys will fix the dict issue

#

(which is the easier issue)

#

the real headache is the json arrays

uneven raptor Jul 24, 2024, 3:16 PM

#

honestly, the best solution seems to be messing with PYTHONHASHSEED

#

that removes the string randomization from all objects

unkempt rock Jul 24, 2024, 3:16 PM

#

What is python ?

feral island Jul 24, 2024, 3:17 PM

#

uneven raptor that removes the string randomization from all objects

5883826190306
$ python -c 'print(hash(None))'
5905208968162

#

(and that's not affected by PYTHONHASHSEED)

uneven raptor Jul 24, 2024, 3:17 PM

#

feral island ```$ python -c 'print(hash(None))' 5883826190306 $ python -c 'print(hash(None))'...

is None affected by PYTHONHASHSEED?

#

damn it

feral island Jul 24, 2024, 3:18 PM

#

it's the memory address

#

(maybe not directly)

uneven raptor Jul 24, 2024, 3:18 PM

#

i would have assumed the None hash was just zero

unkempt rock Jul 24, 2024, 3:18 PM

#

Why no one program on the 1 and 0 ?

grave jolt Jul 24, 2024, 3:18 PM

#

That's the thing: you should not assume things about hash 🙂

uneven raptor Jul 24, 2024, 3:19 PM

#

not disagreeing, but there's not really much else i can do, is there?

feral island Jul 24, 2024, 3:20 PM

#

you could disable ASLR I guess

#

(please don't)

unkempt rock Jul 24, 2024, 3:21 PM

#

Every program project 40% stealing 40% ai generated 10% eating 10 % actual work

urban sandal Jul 24, 2024, 3:21 PM

#

you really just need a stable hash function and a stable conversion from whatever object to bytes. trying to just json and pythons __hash__ is probably the wrong call

feral island Jul 24, 2024, 3:21 PM

#

More seriously it just doesn't seem like hash() is the right tool for what you want

#

You'll have to define your own hashing mechanism

uneven raptor Jul 24, 2024, 3:22 PM

#

probably. i lose potential support for any objects that support __hash__ though

feral island Jul 24, 2024, 3:23 PM

#

Right, but as we've been discussing, support for such objects is likely to create bugs for you

unkempt rock Jul 24, 2024, 3:23 PM

#

I can't understand this chat

feral island Jul 24, 2024, 3:23 PM

#

Because there's a good chance their __hash__ depends on hashing a string or None or something else with an unstable hash

unkempt rock Jul 24, 2024, 3:23 PM

#

What is hash ?

grave jolt Jul 24, 2024, 3:24 PM

#

uneven raptor CC <@461097636791844865>, since we discussed this earlier what would be the bes...

You can define a custom_hash for types like list, dict, int, str, None etc., and for compound types you can iterate over the fields in a predefined order. Like:

def custom_hash(obj):
    if isinstance(obj, int):
        return int.to_bytes(byteorder='little')
    elif isinstance(obj, str)
        return hashlib.md5(obj.encode("utf-8")).digest()
    elif obj is None:
        return 0
    ... # handle list, dict, etc.
    else:
        field_names = sorted(obj.__fields__)
        values = [obj[k] for k in field_names]
        return custom_hash([type(obj).__name__, *values])

unkempt rock Jul 24, 2024, 3:24 PM

#

Grinding is hard on this one

urban sandal Jul 24, 2024, 3:25 PM

#

For a similar case, I have:

return xxhash.xxh64_digest(msgspec.msgpack.encode(payload), seed=0)

which is limited to types msgspec knows how to encode to msgpack (it will do so recursively), and reliant on 2 libraries (xxhash and msgspec), but you can do basically anything similar.

uneven raptor Jul 24, 2024, 3:26 PM

#

feral island Because there's a good chance their `__hash__` depends on hashing a string or No...

theoretically speaking (i'm pretty much convinced by now to not use hash()), could one set PYTHONHASHSEED, and then monkeypatch None.__hash__ to return 0 or something?

unkempt rock Jul 24, 2024, 3:26 PM

#

How to code guys ?

feral island Jul 24, 2024, 3:29 PM

#

uneven raptor theoretically speaking (i'm pretty much convinced by now to not use `hash()`), c...

depends on the value of "theoretically". You'd need to use very bad hacks, like using ctypes to modify NoneType's methods

uneven raptor Jul 24, 2024, 3:29 PM

#

yeah, that's what i mean

halcyon trail Jul 24, 2024, 4:16 PM

#

uneven raptor honestly, the best solution seems to be messing with `PYTHONHASHSEED`

I mean that's a "global" thing so I really don't suggest doing that

uneven raptor Jul 24, 2024, 4:17 PM

#

you would have to change it back afterwards

halcyon trail Jul 24, 2024, 4:17 PM

#

what's actually the thing you want to do here

uneven raptor Jul 24, 2024, 4:17 PM

#

grave jolt You can define a `custom_hash` for types like `list`, `dict`, `int`, `str`, `Non...

this, probably

feral island Jul 24, 2024, 4:17 PM

#

uneven raptor you would have to change it back afterwards

pretty sure the effect remains for the lifetime of the process?

halcyon trail Jul 24, 2024, 4:17 PM

#

associate some kind of shorter, representative string, to an arbitrary python value?

#

not "represenative" in the sense of debug information, but something like a UUID or hash or whatever

uneven raptor Jul 24, 2024, 4:19 PM

#

feral island pretty sure the effect remains for the lifetime of the process?

actually, i take that back, i don't think you would have to set anything. from some experimenting, setting PYTHONHASHSEED does nothing once the interpreter has started. would setting PYTHONHASHSEED, and then spinning up a subinterpreter work? it shouldn't affect the current proc

feral island Jul 24, 2024, 4:19 PM

#

uneven raptor actually, i take that back, i don't think you would have to set anything. from s...

I haven't checked the code but I'd assume it's read only at process startup

#

If so, a subinterpreter wouldn't be enough

halcyon trail Jul 24, 2024, 4:20 PM

#

a process pool should work

uneven raptor Jul 24, 2024, 4:20 PM

#

looks like it's done here https://github.com/python/cpython/blob/e9681211b9ad11d1c1f471c43bc57cac46814779/Python/initconfig.c#L1509

fallen slateBOT Jul 24, 2024, 4:20 PM

#

Python/initconfig.c line 1509

config_init_hash_seed(PyConfig *config)```

halcyon trail Jul 24, 2024, 4:20 PM

#

create a process pool of size one with PYTHONHASHEED set appropriately

feral island Jul 24, 2024, 4:20 PM

#

and don't use fork to create the processes

halcyon trail Jul 24, 2024, 4:20 PM

#

why?

uneven raptor Jul 24, 2024, 4:20 PM

#

you know more than me here, is config_init_hash_seed called at process startup, or interpreter startup?

halcyon trail Jul 24, 2024, 4:21 PM

#

either way, why take the chance

feral island Jul 24, 2024, 4:21 PM

#

uneven raptor you know more than me here, is `config_init_hash_seed` called at process startup...

not sure, sorry

uneven raptor Jul 24, 2024, 4:21 PM

#

halcyon trail either way, why take the chance

i'm just curious at this point

uneven raptor Jul 24, 2024, 4:21 PM

#

feral island not sure, sorry

i traced the top level function to PyConfig_Read, what about that?

feral island Jul 24, 2024, 4:22 PM

#

uneven raptor i traced the top level function to `PyConfig_Read`, what about that?

I got that far too but I don't know. Haven't looked at this part of the interpreter much

feral island Jul 24, 2024, 4:22 PM

#

halcyon trail why?

Forked processes would not re-execute the Python startup code, so they won't read the value of the env var

uneven raptor Jul 24, 2024, 4:26 PM

#

feral island I got that far too but I don't know. Haven't looked at this part of the interpre...

it does not :(

import os

import _xxsubinterpreters as _interpreters

print(hash("123"))
os.environ["PYTHONHASHSEED"] = "0"

interp = _interpreters.create()
_interpreters.run_string(interp, "print(hash('123'))")  # prints the same thing

feral island Jul 24, 2024, 4:27 PM

#

that makes sense. Note that changing the environment after process startup is inherently thread-unsafe

halcyon trail Jul 24, 2024, 4:30 PM

#

@uneven raptor i think I got it, fwiw

uneven raptor Jul 24, 2024, 4:30 PM

#

got what? runtime modification of PYTHONHASHSEED?

halcyon trail Jul 24, 2024, 4:31 PM

#

with ProcessPoolExecutor(max_workers=1, mp_context=multiprocessing.get_context("spawn")) as ppe:
    print(ppe.submit(hash, "123").result())

#

if you put this in your program after the os.environ call (and obviously add the needed imports)

#

you should see different values

#

You can spawn the PPE once at top level, and just have a convenience function that takes the ppe and the object to be hashed and computes the hash. so the overhead won't be too bad

uneven raptor Jul 24, 2024, 4:32 PM

#

unfortunately that's a very expensive operation just for calling hash()

halcyon trail Jul 24, 2024, 4:33 PM

#

how many hashes are you calling? Keep in mind that you just do this for the "top" level hash you need to compute

uneven raptor Jul 24, 2024, 4:33 PM

#

a lot :D

#

i'm just gonna write my own function

halcyon trail Jul 24, 2024, 4:34 PM

#

takes about 50 ms

#

though most of that time is simply waiting to get back the message, the CPU time is more like 1ms

uneven raptor Jul 24, 2024, 4:34 PM

#

well, this would be in a function that gets called quite a bit

halcyon trail Jul 24, 2024, 4:35 PM

#

yeah. It's better not to mess with this stuff anyway. It really depends on just how arbitrary of a python object you want this to work on.

#

if you're dealing with reasonably constrained set of objects, then it's not that bad

regal glen Jul 25, 2024, 12:49 PM

#

<@&831776746206265384>

#

(in multiple channels. Just seach up what they posted)

glass mulch Jul 25, 2024, 1:00 PM

#

So, is there any fundamental reason PyREPL doesn't support command history in Windows, or would it be OK to add?

glass mulch Jul 26, 2024, 10:00 PM

#

glass mulch So, is there any fundamental reason PyREPL doesn't support command history in Wi...

I've submitted a PR to add history support for PyREPL in Windows. Anyone wants to give it a try?

uneven raptor Jul 26, 2024, 10:02 PM

#

is it possible for cpython to be built without _socket? types.CapsuleType relies on that

torpid ember Jul 27, 2024, 5:26 AM

#

uneven raptor is it possible for cpython to be built without `_socket`? `types.CapsuleType` re...

did you try?

uneven raptor Jul 27, 2024, 5:27 AM

#

torpid ember did you try?

no, i looked through configure's options and didn't see anything

#

did i miss some blatant option somewhere? 😅

torpid ember Jul 27, 2024, 5:29 AM

#

uneven raptor no, i looked through `configure`'s options and didn't see anything

printf '*disabled*\n_socket\n' > Modules/Setup.local
./configure --with-pydebug && make -j

uneven raptor Jul 27, 2024, 5:30 AM

#

interesting. is there a better option for exposing a CapsuleType?

feral island Jul 27, 2024, 5:30 AM

#

I don't think we need to support that sort of configuration

torpid ember Jul 27, 2024, 5:30 AM

#

uneven raptor did i miss some blatant option somewhere? 😅

You're right, there is nothing in our documentation that mentions this feature. IIRC, there is only a mention in the changelog, which is quite difficult to find.

feral island Jul 27, 2024, 5:31 AM

#

You can do it if you really want to but we shouldn't need to cater to that sort of thing in the rest of the implementation

uneven raptor Jul 27, 2024, 5:31 AM

#

feral island I don't think we need to support that sort of configuration

sure, i'm just speculating. i saw that in types and was wondering if it would be problematic

torpid ember Jul 27, 2024, 5:34 AM

#

oh.. we do have a types.CapsuleType.... I've thought it wasn't exposed in the types module..

feral island Jul 27, 2024, 5:34 AM

#

it's pretty recent

torpid ember Jul 27, 2024, 5:34 AM

#

yeah, it was added 10 months ago

rose schooner Jul 27, 2024, 5:35 AM

#

uneven raptor is it possible for cpython to be built without `_socket`? `types.CapsuleType` re...

it does, but types doesn't rely on its existence

uneven raptor Jul 27, 2024, 5:36 AM

#

rose schooner it does, but `types` doesn't rely on its existence

CapsuleType is defined as type(_socket.CAPI). what happens if _socket isn't built?

torpid ember Jul 27, 2024, 5:37 AM

#

uneven raptor `CapsuleType` is defined as `type(_socket.CAPI)`. what happens if `_socket` isn'...

>>> import types
>>> types.CapsuleType()
Traceback (most recent call last):
  File "<python-input-1>", line 1, in <module>
    types.CapsuleType()
    ^^^^^^^^^^^^^^^^^
  File "/home/eclips4/programming-languages/cpython/Lib/types.py", line 336, in __getattr__
    import _socket
ModuleNotFoundError: No module named '_socket'

#

as expected

uneven raptor Jul 27, 2024, 5:37 AM

#

so it does rely on it

rose schooner Jul 27, 2024, 5:37 AM

#

uneven raptor `CapsuleType` is defined as `type(_socket.CAPI)`. what happens if `_socket` isn'...

hmm yea wait

torpid ember Jul 27, 2024, 5:37 AM

#

uneven raptor so it does rely on it

yes it is

rose schooner Jul 27, 2024, 5:37 AM

#

oh

feral island Jul 27, 2024, 5:37 AM

#

it doesn't break import of the module, though

rose schooner Jul 27, 2024, 5:37 AM

#

types doesn't rely on it either way

#

https://github.com/python/cpython/blob/3.13/Lib/types.py#L334-L338

fallen slateBOT Jul 27, 2024, 5:37 AM

#

Lib/types.py lines 334 to 338

def __getattr__(name):
    if name == 'CapsuleType':
        import _socket
        return type(_socket.CAPI)
    raise AttributeError(f"module {__name__!r} has no attribute {name!r}")```

torpid ember Jul 27, 2024, 5:37 AM

#

The capsules are ... it's a complex thing.

rose schooner Jul 27, 2024, 5:37 AM

#

it's a dynamic get-attribute

#

so if _socket doesn't exist, it won't break the entire thing

uneven raptor Jul 27, 2024, 5:38 AM

#

are all extension modules optional? or just ones like _socket

feral island Jul 27, 2024, 5:38 AM

#

depends on what you mean by "optional"

uneven raptor Jul 27, 2024, 5:38 AM

#

IIRC _datetime has a capsule, if that's more stable than _socket it might be worth moving to that

torpid ember Jul 27, 2024, 5:38 AM

#

uneven raptor are all extension modules optional? or just ones like `_socket`

probably most of them

torpid ember Jul 27, 2024, 5:39 AM

#

uneven raptor IIRC `_datetime` has a capsule, if that's more stable than `_socket` it might be...

i think there's no difference between _datetime and _socket

feral island Jul 27, 2024, 5:39 AM

#

there are certain extension modules in the stdlib that rely on the presence of third-party dependencies that may or may not be present (e.g., tkinter, gdbm)

#

those are really optional, and it's not unusual to encounter a system that lacks some of them

#

then there are those that are always built by default, but you could massage the build system to remove them if you try hard enough, such as _socket. I don't think those are "optional" in a meaningful sense.

#

And then there are modules that are really built deeply into the interpreter, such as sys. If you tried hard enough I guess you could build CPython without sys, but it would be a lot of work.

uneven raptor Jul 27, 2024, 5:41 AM

#

ah -- i was wondering if _socket was similar to _tkinter, in the sense that it relies on an external dependency

merry bramble Jul 27, 2024, 10:19 AM

#

I think PyPy doesn't have a _socket module. And ideally the pure-Python parts of the stdlib should be written so that they work out of the box for all Python implementations, not just CPython. (That's impossible to achieve fully, but we do the best we can.)

glass mulch Jul 27, 2024, 11:04 AM

#

merry bramble I think PyPy doesn't have a `_socket` module. And ideally the pure-Python parts ...

It does have a _socket module, but it lacks _socket.CAPI in 3.10, while it's present in CPython 3.10. But it does have a notion of capsules.

uneven raptor Jul 27, 2024, 4:59 PM

#

does pypy even have capsules in the first place?

raven ridge Jul 27, 2024, 5:22 PM

#

pypy supports most of the CPython C API

uneven raptor Jul 28, 2024, 8:52 PM

#

what does os._exit do, specifically? the source seems to call os__exit_impl, but i wasn’t able to find the definition of that function

raven ridge Jul 28, 2024, 8:54 PM

#

uneven raptor what does `os._exit` do, specifically? the source seems to call `os__exit_impl`,...

it calls https://man7.org/linux/man-pages/man2/exit.2.html

rose schooner Jul 28, 2024, 8:55 PM

#

uneven raptor what does `os._exit` do, specifically? the source seems to call `os__exit_impl`,...

call _exit()

#

https://github.com/python/cpython/blob/main/Modules/posixmodule.c#L6684-L6690

fallen slateBOT Jul 28, 2024, 8:56 PM

#

Modules/posixmodule.c lines 6684 to 6690

static PyObject *
os__exit_impl(PyObject *module, int status)
/*[clinic end generated code: output=116e52d9c2260d54 input=5e6d57556b0c4a62]*/
{
    _exit(status);
    return NULL; /* Make gcc -Wall happy */
}```

uneven raptor Jul 28, 2024, 8:56 PM

#

fallen slate `Modules/posixmodule.c` lines 6684 to 6690 ```c static PyObject * os__exit_impl(...

oh, there it is. gh search wasn’t bring it up

rose schooner Jul 28, 2024, 8:57 PM

#

this internal module also covers nt for some reason

#

despite being named posixmodule.c

glass mulch Jul 30, 2024, 1:04 PM

#

Docs for compile() and ast.parse():

This function raises SyntaxError if the compiled source is invalid, and ValueError if the source contains null bytes.

Behavior since 3.12 (it does raise ValueError in 3.11):

>>> compile("\x00", "lambda.txt", "exec")
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
SyntaxError: source code string cannot contain null bytes

Does it look like a valid doc issue I should file, or am I doing something wrong?

feral island Jul 30, 2024, 1:07 PM

#

glass mulch Docs for `compile()` and `ast.parse()`: ``` This function raises SyntaxError if ...

Good catch, definitely something needs fixing. Possibly it should be changed back to raise ValueError for compatibility. Would be good to bisect to see why it was changed

glass mulch Jul 30, 2024, 1:12 PM

#

Wouldn't that break code that has adapted to the 3.12 and greater behavior? I was thinking about updating the docs, but if you think a behavior change is warranted I can run a bisect.

feral island Jul 30, 2024, 1:13 PM

#

It would, but that change is undocumented and it's still early in the life of 3.12. It might still be better to keep the change since it's been released and nobody appears to have complained, though.

glass mulch Jul 30, 2024, 1:18 PM

#

Ah, it's a docs issue, the change is in whatsnew for 3.12:
https://github.com/python/cpython/blob/d1a1bca1f0550a4715f1bf32b1586caa7bc4487b/Doc/whatsnew/3.12.rst?plain=1#L600-L602

fallen slateBOT Jul 30, 2024, 1:18 PM

#

Doc/whatsnew/3.12.rst?plain=1 lines 600 to 602

* :func:`ast.parse` now raises :exc:`SyntaxError` instead of :exc:`ValueError`
  when parsing source code containing null bytes. (Contributed by Pablo Galindo
  in :gh:`96670`.)```

feral island Jul 30, 2024, 1:30 PM

#

glass mulch Ah, it's a docs issue, the change is in whatsnew for 3.12: https://github.com/py...

If you send a PR, request review from me and I'll look at it tonight

glass mulch Jul 30, 2024, 1:45 PM

#

Thanks, I pinged you: https://github.com/python/cpython/pull/122462

GitHub

gh-122461: Document that compile() and ast.parse() raise SyntaxErro...

Update docs for compile() and ast.parse() because they raise SyntaxError instead of ValueError for null bytes since #97594.

Issue: gh-122461

📚 Documentation preview 📚: https://cpython-preview...

uneven raptor Jul 30, 2024, 3:51 PM

#

a PR of mine has one of the docs jobs failing due to check-warnings.py missing. is that my fault, or is there something wrong with CI?

feral island Jul 30, 2024, 4:08 PM

#

uneven raptor a PR of mine has one of the docs jobs failing due to `check-warnings.py` missing...

hm we may have been changing that recently, I think there's some security project. Maybe try merging in main into your PR branch

uneven raptor Jul 30, 2024, 4:14 PM

#

looks like that did it, thanks!

native wave Jul 30, 2024, 7:43 PM

#

Hey, not sure if this is the right channel to ask. But I noticed that IDLE (for python 3.12.4) very often freezes, like the Not Responding thing on windows, when massive amount of data is being printed, for example after running dis.dis(...) with a very large code object which has a lot of bytecodes. Is it supposed to be like that?

icy canyon Jul 31, 2024, 5:37 PM

#

Who know how to make minecraft cheats

winged sphinx Jul 31, 2024, 5:37 PM

#

icy canyon Who know how to make minecraft cheats

That's quite offtopic here. Maybe start in #python-discussion and be aware of #rules against helping with malicious stuff.

icy canyon Jul 31, 2024, 5:38 PM

#

winged sphinx That's quite offtopic here. Maybe start in <#267624335836053506> and be aware of...

Thx

glass mulch Jul 31, 2024, 7:41 PM

#

Hmm, a ChatGPT generated PR.

glass mulch Jul 31, 2024, 8:39 PM

#

There are just so many ways of exiting the new REPL with errors... This one looks like a "then don't do it", but I will report it in case someone wants to make pyrepl bulletproof...

import builtins
builtins.__import__ = lambda x, y, z, a, b: None
# -> Exception ignored in the internal traceback machinery, exits interpreter

lambda x: None exits with a different error. Both "work" in basic REPL as in it doesn't exit (but imports are obviously borked).

uneven raptor Aug 1, 2024, 3:31 AM

#

if something is deprecated on 3.14, is that something that should be reflected on typeshed? or does typeshed only add types for versions that are past the feature freeze

feral island Aug 1, 2024, 3:37 AM

#

uneven raptor if something is deprecated on 3.14, is that something that should be reflected o...

we sometimes tend to wait until the feature freeze because things can change still

#

I'd probably accept a PR marking new deprecations though

uneven raptor Aug 1, 2024, 3:49 AM

#

ok, good to know

uneven raptor Aug 1, 2024, 5:36 AM

#

thanks for merging 🎉

shy mango Aug 1, 2024, 6:04 PM

#

Why is Python3.13 so much slower than Python3.12.4?

feral island Aug 1, 2024, 6:21 PM

#

shy mango Why is Python3.13 so much slower than Python3.12.4?

what makes you say that?

shy mango Aug 1, 2024, 6:21 PM

#

feral island what makes you say that?

Performance tests
#1268631691002380369 message

dusk comet Aug 2, 2024, 2:53 AM

#

cool, but off-topic
also rule 6

final totem Aug 2, 2024, 2:54 AM

#

dusk comet cool, but off-topic also rule 6

Thanks, I am new here. I am so sorry.

glass mulch Aug 2, 2024, 10:50 PM

#

shy mango Performance tests https://discord.com/channels/267624335836053506/12686316910023...

@shy mango It seems you found a real regression in performance in python.org mac builds, thank you for your investigation and for filling the issue 🙂

shy mango Aug 3, 2024, 12:27 AM

#

glass mulch <@676414144643203120> It seems you found a real regression in performance in pyt...

I'm glad to be able to help. I had always imagined that the Python repo would have so many issues and that if I ever reported anything it would just get pushed to the bottom and never viewed. But it seems that I was wrong.
Knowing this, I will likely be quicker to report issues I find in the future.

Thank you for listening and helping out.

boreal umbra Aug 3, 2024, 1:17 AM

#

I have 48 hours to train a model before the VM where it's running goes down for maintenance, so I asked ChatGPT to make a context manager that breaks out of the context after a fixed amount of time (to save the model in its then-present state right before the VM goes offline). The (human-refactored) solution is here: https://paste.pythondiscord.com/UEOA

I was surprised that it worked for my test cases. work. It depends on signal.SIGALRM. Though I wonder in what ways this approach sets one up for failure.

jade raven Aug 3, 2024, 1:25 AM

#

boreal umbra I have 48 hours to train a model before the VM where it's running goes down for ...

You will need to thread for your use case as time.sleep is blocking

#

Or rather processes since your training will be blocking and CPU intensive

raven ridge Aug 3, 2024, 2:30 AM

#

boreal umbra I have 48 hours to train a model before the VM where it's running goes down for ...

did your test cases actually call the library that you'll be using for training? Whether or not that works depends on whether the code that gets interrupted by the SIGALRM is actually prepared to handle that exception by finishing what it's doing and immediately propagating the TimeoutException upwards. time.sleep() is prepared to do that, but your ML library might not be

#

I'd generally expect that to play pretty poorly with C extension modules that drop into native code for long periods of time. They'd need to poll PyErr_CheckSignals to make it work.

boreal umbra Aug 3, 2024, 2:40 AM

#

raven ridge did your test cases actually call the library that you'll be using for training?...

Nope. (And I'm also not actually using the context manager.)

raven ridge Aug 3, 2024, 2:43 AM

#

well, then - be warned that whether or not this can actually interrupt running code depends on what that code is doing

south kayak Aug 3, 2024, 4:04 AM

#

Is there any news on that lock file pep that Brett's been rewriting?

uneven raptor Aug 3, 2024, 5:05 AM

#

south kayak Is there any news on that lock file pep that Brett's been rewriting?

!pep 665 this one?

fallen slateBOT Aug 3, 2024, 5:05 AM

#

**PEP 665 - A file format to list Python dependencies for reproducibility of an application**

Status

Rejected

Created

29-Jul-2021

Type

Standards Track

uneven raptor Aug 3, 2024, 5:06 AM

#

oh wait, i didn't see the supercede, oops

uneven raptor Aug 3, 2024, 5:06 AM

#

south kayak Is there any news on that lock file pep that Brett's been rewriting?

!pep 751 looks like you're looking for this

fallen slateBOT Aug 3, 2024, 5:06 AM

#

**PEP 751 - A file format to list Python dependencies for installation reproducibility**

Status

Draft

Created

24-Jul-2024

Type

Standards Track

raven ridge Aug 3, 2024, 5:34 AM

#

raven ridge well, then - be warned that whether or not this can actually interrupt running c...

Actually @boreal umbra: a reasonable proxy for whether or not this will interrupt whatever's running is whether or not a ctrl-c can interrupt that thing and raise a KeyboardInterrupt. It should work in almost exactly the same set of cases

uneven raptor Aug 3, 2024, 3:11 PM

#

is there a reason there's a Core and Builtins as well as a Core_and_Builtins directory for the NEWS entries

#

the former has way more entries than the ladder

#

same goes for C API and C_API

feral island Aug 3, 2024, 3:12 PM

#

uneven raptor is there a reason there's a `Core and Builtins` as well as a `Core_and_Builtins`...

We're transitioning from one to the other

#

the ones with spaces will eventually go away

uneven raptor Aug 3, 2024, 3:13 PM

#

oh ok, cool

#

does blurb put it in the underscore version now?

feral island Aug 3, 2024, 3:16 PM

#

yes, the newest release does

south kayak Aug 3, 2024, 5:32 PM

#

uneven raptor !pep 751 looks like you're looking for this

Mayhaps

thick hemlock Aug 3, 2024, 6:39 PM

#

south kayak Is there any news on that lock file pep that Brett's been rewriting?

there's a new draft yeah

#

with 2 types of locking this time!

wanton flame Aug 3, 2024, 10:00 PM

#

uneven raptor is there a reason there's a `Core and Builtins` as well as a `Core_and_Builtins`...

More info here https://discuss.python.org/t/new-blurb-1-2-please-upgrade/59159

uneven raptor Aug 3, 2024, 10:01 PM

#

yup, i've already upgraded my blurb

dreamy abyss Aug 4, 2024, 2:38 PM

#

Hello Everyone,I am Planning To DSA in PYTHON if Any body Interested let 's Connect!

dusk comet Aug 4, 2024, 2:50 PM

#

DSA = Data Science & AI ?

faint river Aug 4, 2024, 4:00 PM

#

dusk comet DSA = Data Science & AI ?

Data Structures and Algorithms

faint river Aug 4, 2024, 4:00 PM

#

dreamy abyss Hello Everyone,I am Planning To DSA in PYTHON if Any body Interested let 's Conn...

Wrong channel

glass mulch Aug 4, 2024, 6:57 PM

#

Having a file called _pyrepl.py in current directory makes the interpreter silently unable to start (or rather: it automatically exits). Having a file called runpy.py also makes the interpreter unable to start, but prints a helpful text about there being a file with that name shadowing the one from the stdlib. If we add an import from _pyrepl to main.c, having a file called _pyrepl.py will also display the helpful message about shadowing. Would that be worth it?

#

This only affects the new REPL.

#

Could not access _pyrepl.__PYREPL_MARKER
AttributeError: module '_pyrepl' has no attribute '__PYREPL_MARKER' (consider renaming '~\PycharmProjects\cpython\_pyrepl.py' since it has the same name as the standard library module named '_pyrepl' and the import system gives it precedence)

dusk comet Aug 4, 2024, 9:26 PM

#

why are you trying to kill new REPL so hard?

glass mulch Aug 4, 2024, 9:31 PM

#

I get anxious thinking it will break by itself and be considered a bad idea. I love the new REPL, so I'd like it to be robust.

thick hemlock Aug 4, 2024, 9:39 PM

#

I find what you're doing really cool

#

I was also kind of scared by how it was introduced without a non-default period (I know it was in PyPy before, but still)

#

Felt like it might end up a lot less robust and people are not gonna find many bugs before release

#

Really appreciate how you're testing this so thoroughly!

glass mulch Aug 4, 2024, 10:10 PM

#

thick hemlock Really appreciate how you're testing this so thoroughly!

Wow, thanks, that means a lot, I'll try even harder to find issues now 😄

halcyon trail Aug 4, 2024, 11:00 PM

#

just curious, is there actually a reason to use this new REPL over ipython, other than not having to or being able to install ipython?

feral island Aug 4, 2024, 11:01 PM

#

halcyon trail just curious, is there actually a reason to use this new REPL over ipython, othe...

I think that's the main reason. I hear that's a big one for educators though; installing ipython is a lot more steps for students to get up and running than just installing Python

halcyon trail Aug 4, 2024, 11:02 PM

#

I see. isn't it trivial to install ipython using pip? (I don't really use pip)

feral island Aug 4, 2024, 11:02 PM

#

trivial if you already know what you're doing 🙂

raven ridge Aug 4, 2024, 11:02 PM

#

the biggest disadvantage might be that IPython has quite a few dependencies, which might conflict with your own application's needs

#

on top of the obvious one that, if you need to install IPython into every venv, the user experience is much worse

halcyon trail Aug 4, 2024, 11:03 PM

#

i suppose so. I have no idea what setup educators are using; I guess maybe if they're just depending on the system python and not installing anything, and it saves them having to ask sysadmins to install one package, or something

feral island Aug 4, 2024, 11:03 PM

#

there might not be a sysadmin; e.g. if you're teaching students and they all bring their own laptops

halcyon trail Aug 4, 2024, 11:04 PM

#

raven ridge the biggest disadvantage might be that IPython has quite a few dependencies, whi...

I agree this could be an issue in theory, though I've never actually heard someone claim this. I'm not sure I understand the second point really, you'd just make ipython one of the packages in your dependencies.txt or however you're doing it, wouldn't you?

raven ridge Aug 4, 2024, 11:04 PM

#

that assumes the existence of a requirements.txt

#

beginners are vanishingly unlikely to have one

#

beginners mostly create a venv and pip install what they need into it one thing at a time, as they discover a new library that they want to use

halcyon trail Aug 4, 2024, 11:05 PM

#

feral island there might not be a sysadmin; e.g. if you're teaching students and they all bri...

I feel like in that scenario it should actually be easier, as the harder part is actually installing python, isn't it? using pip should be as simple as pip install ipython or maybe sudo pip install ipython - but then, that's what I'm asking. I've barely used pip in years.

halcyon trail Aug 4, 2024, 11:05 PM

#

raven ridge beginners are vanishingly unlikely to have one

but beginners probably aren't going to use venv at all, let alone multiple venvs?

raven ridge Aug 4, 2024, 11:06 PM

#

they're often forced to - it's impossible to install system-wide on lots of systems these days

feral island Aug 4, 2024, 11:06 PM

#

halcyon trail I feel like in that scenario it should actually be easier, as the harder part is...

I don't have personal experience with this but I imagine the more commands people have to run, the more opportunities there are for people to get confused

halcyon trail Aug 4, 2024, 11:06 PM

#

but basically what I'm hearing is - I should continue to tell people to use ipython, and to try to get ipython installed if they have even a smattering of experience

raven ridge Aug 4, 2024, 11:07 PM

#

raven ridge they're often forced to - it's impossible to install system-wide on lots of syst...

but the same point applies even with system-wide deployments. Better defaults are nice because people don't need to know that there's something better out there that they could instead be using and take the time to install it

halcyon trail Aug 4, 2024, 11:07 PM

#

if they are just using python for a 3-4 month course then obviously it doesn't matter as much

#

this makes me wonder just how bad ipython's dependencies are, would be nice if it was just packaged with python.

thick hemlock Aug 4, 2024, 11:08 PM

#

I think that in a Python class you're not even getting to explaining pip and what are packages until a pretty advanced part

raven ridge Aug 4, 2024, 11:08 PM

#

I do think IPython is nicer than pyrepl, FWIW

thick hemlock Aug 4, 2024, 11:08 PM

#

thick hemlock I think that in a Python class you're not even getting to explaining pip and wha...

While a REPL is probably day 1

raven ridge Aug 4, 2024, 11:08 PM

#

but pyrepl is much better than the old default REPL

halcyon trail Aug 4, 2024, 11:08 PM

#

wasn't the old default REPL called IDLE

#

or am I getting confused

thick hemlock Aug 4, 2024, 11:09 PM

#

raven ridge I do think IPython is nicer than pyrepl, FWIW

IPython is amazing!

thick hemlock Aug 4, 2024, 11:09 PM

#

halcyon trail wasn't the old default REPL called IDLE

no that's a different thing

raven ridge Aug 4, 2024, 11:09 PM

#

IDLE is a text editor

halcyon trail Aug 4, 2024, 11:09 PM

#

It was just a wild moment for me when I heard about the default repl getting syntax highlighting and block support a few months ago and I did a bit of a double take

raven ridge Aug 4, 2024, 11:09 PM

#

the old default REPL didn't have a name that I know of. It's just called the "basic repl" now that we need a name for it to distinguish it from the new pyrepl

halcyon trail Aug 4, 2024, 11:10 PM

#

ipython has been stable and widely used and had those features for around a decade

raven ridge Aug 4, 2024, 11:10 PM

#

the old default repl is what you get if you just run "python3" in a shell

halcyon trail Aug 4, 2024, 11:10 PM

#

oh damn

#

yeah the default repl is/was truly terrible

thick hemlock Aug 4, 2024, 11:10 PM

#

btw on servers with thin docker images you probably won't have IPython

raven ridge Aug 4, 2024, 11:11 PM

#

yeah. IPython is great, but better defaults benefit everyone

halcyon trail Aug 4, 2024, 11:11 PM

#

raven ridge IDLE is a text editor

it seems like IDLE has an interpreter built-in though, it's not just a text editor

raven ridge Aug 4, 2024, 11:11 PM

#

IDE, then, if you like, I guess

halcyon trail Aug 4, 2024, 11:12 PM

#

raven ridge yeah. IPython is great, but better defaults benefit everyone

well, this is an exaggeration tbh :-). For sure, it will benefit some people. It hasn't affected me in the last 10 years, and I suspect it will not benefit me in the future. But I'm happy someone will benefit!

feral island Aug 4, 2024, 11:12 PM

#

halcyon trail it seems like IDLE has an interpreter built-in though, it's not just a text edit...

I think the interpreter has IDLE built into it 😛

halcyon trail Aug 4, 2024, 11:13 PM

#

i was just confused when he said IDLE was a text editor because I could have sworn I had a distant memory of a window with IDLE written on it, and an interpreter, and I did a google search to make sure I wasn't having a senior moment

feral island Aug 4, 2024, 11:13 PM

#

but yes, it's meant to be an IDE, not just a text editor

halcyon trail Aug 4, 2024, 11:14 PM

#

thick hemlock btw on servers with thin docker images you probably won't have IPython

idk about probably, I suspect that it's so little space that if you like ipython there isn't really much reason not to just deploy it

#

I'm actually curious now to see how much space ipython + dependencies actually uses

#

i wonder if ipython installs all the notebook stuff too by default, or if you can just get the ipython interpreter by itself

raven ridge Aug 4, 2024, 11:15 PM

#

it does install the notebook stuff by default

halcyon trail Aug 4, 2024, 11:16 PM

#

So, I don't have pip handy, but with micromamba, an environment with just python is 180M. When I install ipython, it goes to 217M

halcyon trail Aug 4, 2024, 11:17 PM

#

raven ridge it does install the notebook stuff by default

I'm not sure if it does, at least on mamba/conda, all the notebook stuff has moved to the "jupyter" moniker

raven ridge Aug 4, 2024, 11:17 PM

#

ooh, indeed - I think I'm wrong about that, and the notebook stuff got split out

#

here's what it installed:

Installing collected packages: wcwidth, pure-eval, ptyprocess, traitlets, six, pygments, prompt-toolkit, pexpect, parso, executing, decorator, matplotlib-inline, jedi, asttokens, stack-data, IPython

halcyon trail Aug 4, 2024, 11:17 PM

#

  + pickleshare          0.7.5  py_1003       conda-forge     Cached
  + decorator            5.1.1  pyhd8ed1ab_0  conda-forge     Cached
  + exceptiongroup       1.2.2  pyhd8ed1ab_0  conda-forge       20kB
  + pygments            2.18.0  pyhd8ed1ab_0  conda-forge     Cached
  + traitlets           5.14.3  pyhd8ed1ab_0  conda-forge     Cached
  + typing_extensions   4.12.2  pyha770c72_0  conda-forge       40kB
  + executing            2.0.1  pyhd8ed1ab_0  conda-forge     Cached
  + pure_eval            0.2.3  pyhd8ed1ab_0  conda-forge       17kB
  + wcwidth             0.2.13  pyhd8ed1ab_0  conda-forge     Cached
  + ptyprocess           0.7.0  pyhd3deb0d_0  conda-forge     Cached
  + parso                0.8.4  pyhd8ed1ab_0  conda-forge     Cached
  + six                 1.16.0  pyh6c4a22f_0  conda-forge     Cached
  + matplotlib-inline    0.1.7  pyhd8ed1ab_0  conda-forge     Cached
  + prompt-toolkit      3.0.47  pyha770c72_0  conda-forge      271kB
  + pexpect              4.9.0  pyhd8ed1ab_0  conda-forge     Cached
  + jedi                0.19.1  pyhd8ed1ab_0  conda-forge     Cached
  + asttokens            2.4.1  pyhd8ed1ab_0  conda-forge     Cached
  + stack_data           0.6.2  pyhd8ed1ab_0  conda-forge     Cached
  + ipython             8.26.0  pyh707e725_0  conda-forge      599kB

#

not quite a 1:1 match

#

i wonder why it's different

#

but yeah, I do think the "weight"/space reason isn't much of a reason not to install ipython, at least now with jupyter split out - it will add a trivial amount to the size of your venv/docker/etc

raven ridge Aug 4, 2024, 11:20 PM

#

50 MB, it seems - small, but not trivial

halcyon trail Aug 4, 2024, 11:24 PM

#

On a raspberry pi or something like that not trivial

#

On a typical server probaly trivial

#

(it was 37 M for me and in a real project it would be even less, as some things would be amortized by other dependencies)

raven ridge Aug 4, 2024, 11:28 PM

#

fwiw, pyrepl is adapted from pypy's repl

#

it's not a brand new repl being built from scratch for CPython, it's an existing one being incorporated

thick hemlock Aug 4, 2024, 11:30 PM

#

halcyon trail So, I don't have pip handy, but with micromamba, an environment with just python...

not that small tbh

#

Alpine itself is just 50MB or ao

raven ridge Aug 4, 2024, 11:32 PM

#

a Python install is ~100 MB, if IPython is ~50 MB (du says 48.5 MB in the fresh venv I just installed it into) that's a ~50% size increase for an app with no other dependencies

thick hemlock Aug 4, 2024, 11:33 PM

#

very big!

#

obviously you're not gonna be running that many REPLs on your servers anyway

#

but still

raven ridge Aug 4, 2024, 11:34 PM

#

all the more reason why it's not great to be paying the cost for a heavier REPL by default

halcyon trail Aug 4, 2024, 11:35 PM

#

I think this is pretty theoretical, the vast majority of people I talk to have far bigger deployments than that

#

But yes, along with students, people for whom < 50 megs on a deployment is make or break are another major beneficiary here

#

But most peopl can easily have access to ipython everywhere if they choose to

raven ridge Aug 4, 2024, 11:36 PM

#

when we talk about every install of the interpreter getting larger by X%, that has quite a broad impact. A server that could have hosted N apps can now only host 2/3N

#

at the extremes, granted - but there are a lot of apps that have few or no dependencies outside the stdlib

halcyon trail Aug 4, 2024, 11:37 PM

#

I don't think being limited by disk space is a common scenario 🤷‍♂️

#

Also if you are really that disk space constrained, and deploying that many environments, you should use something that can reuse storage

thick hemlock Aug 4, 2024, 11:38 PM

#

halcyon trail I think this is pretty theoretical, the vast majority of people I talk to have f...

Well, it does depend on who you talk to... The Python community is very very broad

raven ridge Aug 4, 2024, 11:38 PM

#

halcyon trail I don't think being limited by disk space is a common scenario 🤷‍♂️

at the level of infrastructure providers it's a huge one, AFAIU

halcyon trail Aug 4, 2024, 11:39 PM

#

Conda/mamba use hard links extensively so that N environments will not take up even close to Nx as much memory

thick hemlock Aug 4, 2024, 11:39 PM

#

That doesn't work with containers though, right?

halcyon trail Aug 4, 2024, 11:39 PM

#

Not sure

thick hemlock Aug 4, 2024, 11:39 PM

#

I mean containers have different file systems

#

They're, by design, isolated

halcyon trail Aug 4, 2024, 11:40 PM

#

But like, you can't simultaneously care so much about disk space that this is a big deal but also pick such an inefficient solution to begin with, it seems to me

thick hemlock Aug 4, 2024, 11:41 PM

#

I don't know, I think if you'll look at solutions offered by cloud providers it's not uncommon to see an app deployed on a 100+ slim containers

halcyon trail Aug 4, 2024, 11:41 PM

#

Anyway, I'm certainly willing to bet this is quite niche - I've yet to encounter someone who actually said they wanted ipython on a server, considered adding it, but felt they couldn't because of disk constraints

#

Are you my first? 😛

#

Everyone else I talked said they just didn't bother, or didn't know what ipython was, or already had it everywhere

thick hemlock Aug 4, 2024, 11:42 PM

#

Again, the python community is very very vast

halcyon trail Aug 4, 2024, 11:42 PM

#

I'll take that as a no

thick hemlock Aug 4, 2024, 11:42 PM

#

The people I talk to are different than who you talk to

thick hemlock Aug 4, 2024, 11:42 PM

#

halcyon trail I'll take that as a no

I have talked to people that really cared about container sizes

#

And I have been on containers where I wished I had IPython

halcyon trail Aug 4, 2024, 11:43 PM

#

That's not what I asked, but good to know!

thick hemlock Aug 4, 2024, 11:43 PM

#

I mean, it's just not a discussion I really have

#

I don't know what answers I'd get

thick hemlock Aug 4, 2024, 11:44 PM

#

halcyon trail That's not what I asked, but good to know!

not sure where you asked something

halcyon trail Aug 4, 2024, 11:44 PM

#

Look for the question mark I guess

thick hemlock Aug 4, 2024, 11:44 PM

#

Oh

#

Yeah me personally I wouldn't mind the size

#

on apps I work on

#

I would add IPython to all of them probably if I got around to it

faint river Aug 5, 2024, 5:52 PM

#

<@&831776746206265384> joined to advertise, they put this same message in 3 channels

boreal umbra Aug 6, 2024, 6:35 PM

#

What are the chances that type statements could be extended to support keyword arguments?

type Vector = list[float, size=x]

This example doesn't seem very useful, but for the types that DS/AI people often use, it would be helpful to encode promises like what columns a given dataframe would have or the number of dimensions an array would have.

feral island Aug 6, 2024, 6:37 PM

#

!pep 637

fallen slateBOT Aug 6, 2024, 6:37 PM

#

**PEP 637 - Support for indexing with keyword arguments**

Status

Rejected

Python-Version

3.10

Created

24-Aug-2020

Type

Standards Track

feral island Aug 6, 2024, 6:38 PM

#

boreal umbra What are the chances that type statements could be extended to support keyword a...

I don't think this should have anything to do with type statements, but I think a case could be made for revisiting that PEP

boreal umbra Aug 6, 2024, 6:39 PM

#

Yeah, I remember that PEP. I liked it at the time, but I think limiting the new behavior to only type statements would address the concerns that the council had.

feral island Aug 6, 2024, 6:39 PM

#

I don't think so. The right-hand side of a type statement is just an expression

urban sandal Aug 6, 2024, 6:39 PM

#

limiting it to the type statement would complicate the type statement further because currently ^

boreal umbra Aug 6, 2024, 6:40 PM

#

I see

urban sandal Aug 6, 2024, 6:41 PM

#

I think it's more compelling now than it was when it was rejected, but I'm personally not a fan of keyword arguments in indexing

feral island Aug 6, 2024, 6:42 PM

#

I think PEP 696 fits nicely with this syntax, it's nice to be able to name defaulted type parameters

boreal umbra Aug 6, 2024, 6:42 PM

#

!pep 696

fallen slateBOT Aug 6, 2024, 6:42 PM

#

**PEP 696 - Type Defaults for Type Parameters**

Status

Accepted

Python-Version

3.13

Created

14-Jul-2022

Type

Standards Track

boreal umbra Aug 6, 2024, 6:47 PM

#

how bad would a change such as this be for parsing efficiency? (where keyed_getitem is some imagined new pattern)

type_alias:
    | "type" NAME [type_params] '=' (expression | keyed_getitem)

feral island Aug 6, 2024, 6:48 PM

#

boreal umbra how bad would a change such as this be for parsing efficiency? (where `keyed_get...

that's fine as far as the parser goes, but I don't think it would make for a good user experience. For example, you could write type X = list[int, a=3] but not type X = list[int, a=3] | set[int, a=3]

boreal umbra Aug 6, 2024, 6:51 PM

#

feral island that's fine as far as the parser goes, but I don't think it would make for a goo...

I see. and when it's just "type" NAME [type_params] '=' expression, nothing extra needs to be done to make it recursive.

peak spoke Aug 7, 2024, 12:25 AM

#

Is there a way of setting an exception's __cause__ without having to raise with a from or is that the only mechanism that sets it?

quick snow Aug 7, 2024, 6:27 AM

#

peak spoke Is there a way of setting an exception's `__cause__` without having to raise wit...

You mean other than manually setting it?

#

!e

i = IndexError("oh no")
z = ZeroDivisionError("no!")
z.__cause__ = i
raise z

fallen slateBOT Aug 7, 2024, 6:28 AM

#

quick snow !e ```py i = IndexError("oh no") z = ZeroDivisionError("no!") z.__cause__ = i ra...

:x: Your 3.12 eval job has completed with return code 1.

001 | IndexError: oh no
002 | 
003 | The above exception was the direct cause of the following exception:
004 | 
005 | Traceback (most recent call last):
006 |   File "/home/main.py", line 4, in <module>
007 |     raise z
008 | ZeroDivisionError: no!

glass mulch Aug 7, 2024, 1:42 PM

#

From https://pyfound.blogspot.com/2024/06/python-language-summit-2024-pyrepl-new-default-repl-for-python.html:

Emily Morehouse, speaking as a Steering Council member added that the Steering Council has requested an informational PEP on the new REPL. "Hearing concerns about how [the new REPL] might be rolled out... it sounds like we might need something that's more compatible and an easier rollout", leaving the final discussions to the 3.13 release manager, Thomas Wouters. Carol replied that she believes "we could do it in documentation".

Does anybody know whether the final plan is to create a PEP or just documentation?

uneven raptor Aug 7, 2024, 4:22 PM

#

i had a conversation with someone the other day regarding the object structure, and it seems ob_refcnt_split is undocumented. is that something that should be?

feral island Aug 7, 2024, 4:23 PM

#

uneven raptor i had a conversation with someone the other day regarding the object structure, ...

Possibly in internal documentation, definitely not publicly

#

i.e. it should be documented for people who want to hack on CPython, not as something to rely on for users of the C API

peak spoke Aug 8, 2024, 9:24 AM

#

quick snow You mean other than manually setting it?

Yeah I was looking for something that doesn't use the dunder as avoiding manipulating them directly usually is a badidea but that seems to be working fine so far

harsh atlas Aug 8, 2024, 5:27 PM

#

guys iam still at the beginning of python am i in the right channel or what

#

i still need guidance

heady solar Aug 8, 2024, 5:28 PM

#

harsh atlas guys iam still at the beginning of python am i in the right channel or what

You can ask questions here https://discord.com/channels/267624335836053506/267624335836053506 or make a thread in https://discord.com/channels/267624335836053506/1035199133436354600

zenith wadi Aug 9, 2024, 6:10 PM

#

I've been looking at this issue the last few days https://github.com/python/typeshed/issues/6347
I think no small amount of the problem is the byzantine implementation of lru_cache.
Does it need to be a bunch of nested functions and closured variables?
I'd like to simplify it to a plain-old class.

GitHub

functools._lru_cache_wrapper should be a descriptor class, not a ca...

I'm not entirely sure this is a bug in the type stub. It depends on the interpretation of ParamSpec when used with methods. This is related to the discussion here and this bug report filed in t...

feral island Aug 9, 2024, 6:14 PM

#

zenith wadi I've been looking at this issue the last few days https://github.com/python/type...

I don't think changes to the runtime implementation of lru_cache can affect how typeshed describes it. If you make a change to the runtime that makes the stubs materially different, you've probably made a backwards-incompatible change to the runtime.

zenith wadi Aug 9, 2024, 6:15 PM

#

agreed =/

#

still it seems unecessarily obtuse

uneven raptor Aug 9, 2024, 10:46 PM

#

i'm thoroughly impressed with nogil, i've been stress testing some of my extensions that use threads and it's been able to run them without any changes

spark verge Aug 10, 2024, 11:44 AM

#

uneven raptor i'm thoroughly impressed with nogil, i've been stress testing some of my extensi...

You have c extensions that use threads?

uneven raptor Aug 10, 2024, 1:13 PM

#

spark verge You have c extensions that use threads?

yeah, via C11s thrd_t

boreal umbra Aug 10, 2024, 3:57 PM

#

Earlier I mentioned my "p-string" idea, where p"some/path" is equivalent to pathlib.Path("some/path"). And this idea is a non-starter because pathlib isn't implemented in C, and pathlib depends on several stdlib modules that also aren't implemented in C.

Though I wonder: would it be possible for the presence of a p-string in the code to trigger the importing of pathlib? does import do this with importlib?

uneven raptor Aug 10, 2024, 4:01 PM

#

you can import modules from the C API, yeah

feral island Aug 10, 2024, 4:05 PM

#

it's possible yes but there is no existing precedent for it in the language core

#

well, I suppose defining a generic does implicitly import typing

boreal umbra Aug 10, 2024, 4:07 PM

#

feral island well, I suppose defining a generic does implicitly import `typing`

how expensive is typing to import relative to something like pathlib and its dependencies?

feral island Aug 10, 2024, 4:07 PM

#

I don't know, not sure it really matters for this question

#

relatedly are you aware of

#

!pep 750

fallen slateBOT Aug 10, 2024, 4:08 PM

#

**PEP 750 - Tag Strings For Writing Domain-Specific Languages**

Status

Draft

Python-Version

3.14

Created

08-Jul-2024

Type

Standards Track

boreal umbra Aug 10, 2024, 4:47 PM

#

feral island relatedly are you aware of

I am not--thank you for bringing it to my attention

faint river Aug 10, 2024, 6:08 PM

#

oh I was thinking exactly p-strings when I saw this pep

#

import pathlib
from typing import Decoded

def p(path: Decoded) -> pathlib.Path:
    return pathlib.Path(path.raw)

print(p"some/path")

something like that?

dusk comet Aug 10, 2024, 6:25 PM

#

faint river ```py import pathlib from typing import Decoded def p(path: Decoded) -> pathlib...

what about p'C:/Users/{username}/blabla'? your tag function does not accept that

#

that makes me think that a helper to produce a string with all inline fields evaluated will be used pretty often

faint river Aug 10, 2024, 6:26 PM

#

dusk comet what about `p'C:/Users/{username}/blabla'`? your tag function does not accept th...

I don't believe it should accept that

#

of course it should probably have a good error message to deal with it tho

dusk comet Aug 10, 2024, 6:28 PM

#

from ... import make_string

def p(*parts: Decoded | ?) -> Path:
    return Path(make_string(parts))

#

the PEP says:

Tag functions accept prepared arguments and return a string:

does it mean that returning Path will raise an exception?

#

or it is a typo

faint river Aug 10, 2024, 6:28 PM

#

that's gotta be a typo or an error

#

they even show examples of not returning a string

faint river Aug 10, 2024, 6:29 PM

#

dusk comet ```py from ... import make_string def p(*parts: Decoded | ?) -> Path: retur...

do you want to do interpolation or just want the raw {whatever} in there?

dusk comet Aug 10, 2024, 6:30 PM

#

faint river do you want to do interpolation or just want the raw `{whatever}` in there?

i want it to behave just like an f-string with tag applied on top of it

faint river Aug 10, 2024, 6:32 PM

#

perhaps the PEP should introduce a function which takes the received arguments from tagging and uses them as an f-string would

dusk comet Aug 10, 2024, 6:33 PM

#

faint river perhaps the PEP should introduce a function which takes the received arguments f...

and another that would return just a raw content from it

faint river Aug 10, 2024, 6:34 PM

#

dusk comet and another that would return just a raw content from it

real raw content or applying escape sequences?

dusk comet Aug 10, 2024, 6:35 PM

#

well, i guess none of it makes sense
you always can do p(''), p(r''), p(f'') to be precise instead of p''

faint river Aug 10, 2024, 6:35 PM

#

true

dusk comet Aug 10, 2024, 6:35 PM

#

tags should be used only if you intend to do something special with interpolation parts

faint river Aug 10, 2024, 6:36 PM

#

i think the main reason people would want p-strings is to be able to use them without explicitly importing pathlib, because otherwise it's just mildly shorter syntax

#

like you would be able to just use them without any hassle whatsoever

#

just realized I don't think it's possible to write a 100% accurate raw_str tag

#

because of the = feature of f-strings

dusk comet Aug 10, 2024, 6:56 PM

#

and also p'a{x:{fmt}}b'

faint river Aug 10, 2024, 7:23 PM

#

tru

thick hemlock Aug 10, 2024, 7:48 PM

#

that was a really quick merge Jelle

#

thank you lol

shy grove Aug 10, 2024, 9:30 PM

#

faint river oh I was thinking exactly p-strings when I saw this pep

from .tags import p

path = p"some/path"

from pathlib import Path

path = Path("some/path")

I'm not really convinced that the former is really an improvement over the latter

#

The thread mentions the idea of stdlib including some pre-defined tags and that would make it much more appealing to me

#

There was a thread about wanting pathlib.Path.realpath() which basically did .expanduser().resolve(), I can see that getting added as a tag function in stdlib as a cool little use of this pep

jade raven Aug 10, 2024, 9:37 PM

#

shy grove ```py from .tags import p path = p"some/path" ``` ```py from pathlib import Pat...

the Path pre tag was proposed as it's own separate PEP i believe, maybe this is a workaround to make it more general and easier to digest

shy grove Aug 10, 2024, 9:38 PM

#

not aware of a seperate pep

#

I was just talking about pep 750

uneven raptor Aug 10, 2024, 10:58 PM

#

jade raven the Path pre tag was proposed as it's own separate PEP i believe, maybe this is ...

yeah, i think PEP 750 is the better solution

spark magnet Aug 10, 2024, 11:19 PM

#

shy grove The thread mentions the idea of stdlib including some pre-defined tags and that ...

That means having p pre-defined in builtins? That sounds terrible.

jade raven Aug 10, 2024, 11:19 PM

#

uneven raptor yeah, i think PEP 750 is the better solution

i can't seem to find the original PEP however

uneven raptor Aug 10, 2024, 11:23 PM

#

jade raven i can't seem to find the original PEP however

was it a discourse thread instead of a PEP?

jade raven Aug 10, 2024, 11:24 PM

#

uneven raptor was it a discourse thread instead of a PEP?

i seem to remember it being an actual PEP, i could be wrong though

uneven raptor Aug 10, 2024, 11:24 PM

#

i don't remember ever seeing a PEP for that

alpine rose Aug 11, 2024, 3:43 AM

#

something about the no space function call is weird to me. i liked steve's suggestion of "have an i-string" and then you do regex(i"my_escaped_{word}") or whatever. solves the dotted name / namespace problem, is more minimal syntax, avoids weird things that beginners may run into like print"asdf", etc

shy grove Aug 11, 2024, 5:45 AM

#

spark magnet That means having `p` pre-defined in builtins? That sounds terrible.

I was thinking more of an import like from stdlibtags import regexstring

#

And with slightly more descriptive names

#

I also like the i-string suggestion fwiw

radiant garden Aug 11, 2024, 8:11 AM

#

callable juxtaposition is definitely a learnability issue waiting to manifest, it sounds fantastic for #esoteric-python though

shy grove Aug 11, 2024, 11:17 AM

#

alpine rose something about the no space function call is weird to me. i liked steve's sugge...

The more I think about it, the more appealing this sounds to me

grave jolt Aug 11, 2024, 12:55 PM

#

alpine rose something about the no space function call is weird to me. i liked steve's sugge...

Will be extra fun with soft keywords. Like match"foo":

glass mulch Aug 11, 2024, 2:14 PM

#

This works in current implementation:

def greet(*args):
    """Uppercase and add exclamation."""
    salutation = args[0].upper()
    return f"{salutation}!"

print(greet"Hello")
__builtins__.__dict__["raise"] = greet
raise"Well that's novel"

Outputs (in the playground):

HELLO!

"WELL THAT'S NOVEL!"

winged sphinx Aug 11, 2024, 2:39 PM

#

greet"Hello" is lazily evaluated, right? Is it similar to returning a partial of greet(...)? I guess I need to read the full PEP.

swift imp Aug 11, 2024, 3:27 PM

#

I really dont get the tags PEP and it frustrates me, like I'm not seeing the power or how it can help with making a dsl (only dsl im really familiar with is Jinja or Jenkins declaritive pipeline syntax). I dont see the power in it or any benefits

uneven raptor Aug 11, 2024, 4:47 PM

#

swift imp I really dont get the tags PEP and it frustrates me, like I'm not seeing the pow...

think of them like user-defined f strings. instead of adding more string prefixes (such as p, as suggested above) to the interpreter itself, the user can make them

dusk comet Aug 11, 2024, 5:11 PM

#

fallen slate

First, the format_spec can be arbitrarily nested:
mytag'{x:{a{b{c}}}}'
im not sure what this is supposed to mean
f'{x:{a{b{c}}}}' is invalid syntax currently

#

No Implicit String Concatenation
Implicit tag string concatenation isn’t supported, which is unlike other string literals.

The expectation is that triple quoting is sufficient. If implicit string concatenation is supported, results from tag evaluations would need to support the + operator with add and radd.
this doesnt really make sense
'a' "b" does not perform any addition, it is just a way to write 'ab'
so i dont see a reason for tag'a' tag'b' to not be equivalent to tag'ab'

grave jolt Aug 11, 2024, 5:29 PM

#

dusk comet > No Implicit String Concatenation > Implicit tag string concatenation isn’t sup...

With other modifiers, e.g. r and f: r"hm\n{foo}" f"\nbar{baz}" means r"hm\n{foo}" + f"\nbar{baz}", not rf"hm\n{foo}\nbar{baz}"

#

So if you wanted to make implicit string concatenation work, it would be (tag"a" + tag"b"). But a tagged string doesn't have to return a string, which makes implicit concatenation on them kinda nonsensical

#

Like, if nd"0 1 2, {x} 4 5" makes a numpy array, should nd"0 1 2" nd"{x} 4 5" produce np.ndarray([0, 1, 2]) + np.ndarray([x, 4, 5])?

#

it would be extra awkward because path strings won't be concatenable

dusk comet Aug 11, 2024, 5:32 PM

#

i see
there is no problem in treating tag'a' tag'b' as tag'ab', but tag1'a' tag2'b' cannot be implicitly concatenated because tags are different

grave jolt Aug 11, 2024, 5:32 PM

#

And what if the tags are different?

grave jolt Aug 11, 2024, 5:34 PM

#

dusk comet i see there is no problem in treating `tag'a' tag'b'` as `tag'ab'`, but `tag1'a'...

Hm, maybe it could make sense to do this. But we'll need to see what people will actually use these for

dusk comet Aug 11, 2024, 5:34 PM

#

is there a real usecase for implicit string concatenation?

glass mulch Aug 11, 2024, 5:35 PM

#

int"1" + int"2" works

grave jolt Aug 11, 2024, 5:35 PM

#

dusk comet is there a real usecase for implicit string concatenation?

When the string literal is too long to fit in one line, and you don't want to add a bunch of +s or introduce whitespace

#

I don't really like this feature, because it's an easy footgun ```py
things = [
"foo",
"bar",
"baz,"
"fizz",
"buzz",
"final item"
"wait, another one"
]

dusk comet Aug 11, 2024, 5:36 PM

#

grave jolt Hm, maybe it could make sense to do this. But we'll need to see what people will...

i realized that tag'a' tag'b' -> tag'ab' can produce results that make little sense
consider np'1 2' == [1, 2] and np'3 4' == [3, 4]
it would be weird to have np'1 2' np'3 4' == np'1 23 4' == [1, 23, 4]

grave jolt Aug 11, 2024, 5:37 PM

#

yep

#

Perhaps the tag could decide what to do with string interpolation. It might make sense for some tags (like HTML) but not for others (like numpy)

dusk comet Aug 11, 2024, 5:37 PM

#

grave jolt When the string literal is too long to fit in one line, and you don't want to ad...

use multiline strings then

grave jolt Aug 11, 2024, 5:37 PM

#

dusk comet use multiline strings then

or introduce whitespace

dusk comet Aug 11, 2024, 5:38 PM

#

i hate python sometimes

glass mulch Aug 11, 2024, 5:39 PM

#

I worry about it being too powerful actually. Decimal literals? Fixed integer sizes? Random syntax? Calling functions with quotes? Everything becomes possible. Very fun to write, but will it be easy to read and understand?

dusk comet Aug 11, 2024, 5:41 PM

#

dusk comet i realized that `tag'a' tag'b'` -> `tag'ab'` can produce results that make littl...

there is a workaround for the case with same tag: ```py
np'a{x}b' -> np('a', lambda: x, 'b')
np'c{y}d' -> np('c', lambda: y, 'd')

np'a{x}b' np'c{y}d' -> np('a', lambda: x, 'bc', lambda: y, 'd')
^ ^ ^^^^ bad

np'a{x}b' np'c{y}d' -> np('a', lambda: x, 'b', 'c', lambda: y, 'd')
^ ^ ^^^^^^^^ ok

#

!pypi custom-literals

fallen slateBOT Aug 11, 2024, 5:41 PM

#

custom-literals v0.1.3

A module implementing custom literal suffixes using pure Python

Released on <t:1648813793:D>.

swift imp Aug 11, 2024, 5:45 PM

#

uneven raptor think of them like user-defined f strings. instead of adding more string prefixe...

Ok that makes more sense

raven ridge Aug 11, 2024, 5:52 PM

#

calling PEP 750 "Tag Strings For Writing Domain-Specific Languages" just seems very strange to me. To the extent that this allows creating DSLs, it allows creating rigid, strange DSLs that bear little resemblance to other languages and that do a poor job of allowing users to express themselves

heady solar Aug 11, 2024, 5:55 PM

#

From skimming it I got the impression that it's supposed to make it easier to work with DSLs, not create them
Like they give examples of SQL and templates in Jinja (not sure if the second one is a DSL?)
So the use of "writing" here is strange

raven ridge Aug 11, 2024, 5:57 PM

#

PEP 501 seems much more reasonable to me, at a quick skim

uneven raptor Aug 11, 2024, 5:59 PM

#

unrelated, but is it possible that PEP 556 is revived with the introduction of nogil?

dusk comet Aug 11, 2024, 6:01 PM

#

!pep 501

fallen slateBOT Aug 11, 2024, 6:01 PM

#

**PEP 501 - General purpose string interpolation**

Status

Deferred

Python-Version

3.6

Created

08-Aug-2015

Type

Standards Track

raven ridge Aug 11, 2024, 6:03 PM

#

heady solar From skimming it I got the impression that it's supposed to make it easier to wo...

I'm not convinced that is what they mean, because PEP 501 i-strings meet that need. They say that "The authors of [PEP 750] consider tag strings as a generalization of the updated work in PEP 501", and the only innovation I see in PEP 750 is that it lets you do foo"{x}" instead of foo(i"{x}") - surely that is the DSL they're talking about

little robin Aug 11, 2024, 6:06 PM

#

Hello my am

raven ridge Aug 11, 2024, 6:07 PM

#

and the cost of that is that it'll be impossible to add new string prefixes in the future. They note in https://peps.python.org/pep-0750/#valid-tag-names that any existing string prefix must be an invalid tag name, but they don't acknowledge that this implies that introducing any new string prefixes in the future would be backwards-incompatible, as they might conflict with user-defined tag names

little robin Aug 11, 2024, 6:07 PM

#

I have a question

#

Which game making program is best suited for Python?

raven ridge Aug 11, 2024, 6:09 PM

#

try asking in #python-discussion, @little robin

winged sphinx Aug 11, 2024, 6:09 PM

#

raven ridge and the cost of that is that it'll be impossible to add new string prefixes in t...

Oh that's interesting problem, as well as combinations of existing tags (rf, etc).

dusk comet Aug 11, 2024, 6:10 PM

#

why are exsiting string prefixes case insensitive? why do URFB'' prefixes exist at all?
(i vaguely remember that some of them had a little different behaviour somewhere)

raven ridge Aug 11, 2024, 6:14 PM

#

winged sphinx Oh that's interesting problem, as well as combinations of existing tags (rf, etc...

and given that we've introduced new string prefixes repeatedly (r, then u, then b, then f) it seems like a really bad bet to say that we'll never need any new one again once we have tag strings

uneven raptor Aug 11, 2024, 6:48 PM

#

raven ridge and the cost of that is that it'll be impossible to add new string prefixes in t...

i was under the impression that the point of it was so that they wouldn't have to add new ones in the future

#

theoretically, couldn't they add new string prefixes in a backwards compatible manner by just using the new one if it's not in the namespace?

winged sphinx Aug 11, 2024, 6:53 PM

#

uneven raptor theoretically, couldn't they add new string prefixes in a backwards compatible m...

In the current way, you can combine them (r, f, rf, etc). How would that be compatible in the new way?

uneven raptor Aug 11, 2024, 6:54 PM

#

oh, i didn't see that

raven ridge Aug 11, 2024, 6:59 PM

#

even setting that aside, this absolutely doesn't imply that we'll never need a new one in the future. The existing string prefixes change the way that the string is parsed. If we didn't already have raw strings, you wouldn't be able to define an r that behaves like r"a\b" does today using only the tools that PEP 750 tag strings would give you

radiant garden Aug 11, 2024, 7:08 PM

#

doesn't it provide raw string contents?

raven ridge Aug 11, 2024, 7:09 PM

#

yeah, I'm wrong - I now see that the proposed Decoded does give you access to the raw string, which would be enough to let you do it. Not well - you'd wind up parsing it at runtime instead of compile time - but at least it's possible

#

ah, no - I was right the first time, based on

mytag'{expr=}' is parsed to being the same as mytag'expr={expr}’

#

that means that you wouldn't be able to implement an r tag because you wouldn't be able to distinguish r"{x=}" from r"x={x}"

#

actually, even deeper than that - r"{" is valid today, but if I'm reading the PEP right, sometag"{" would be syntactically invalid, so that's another reason why it wouldn't be possible to define r as a tag function

winged sphinx Aug 11, 2024, 7:28 PM

#

Is this solvable by requiring the new string functions have some prefix, like xgreet"blah"? Not pretty, just thinking out loud

raven ridge Aug 11, 2024, 7:40 PM

#

yes, or even mandating a minimum length

#

if all existing prefixes are one or two characters, we're probably safe reserving 1 or 2 character prefixes for the language and letting user-defined identifiers be 3+ characters

grave jolt Aug 11, 2024, 7:44 PM

#

so p is out of the question?

raven ridge Aug 11, 2024, 7:47 PM

#

I think it should be. And, if we had p, note you'd have trouble representing a filename containing { - you'd need to escape the { as {{ or use a regular string literal and call Path explicitly

uneven raptor Aug 11, 2024, 8:26 PM

#

from the decoded strings section, this snippet makes no sense:

decoded = raw.encode("utf-8").decode("unicode-escape")
if decoded == raw:
    decoded = raw

#

what's the point of this if?

raven ridge Aug 11, 2024, 8:30 PM

#

I think the idea is that otherwise it would take 2x the memory

#

I think that's replacing two distinct but equal strings with 2 references to the same string

uneven raptor Aug 11, 2024, 8:31 PM

#

not something i would normally think about in python code, but interesting

#

i'm guessing it was translated to python from c code

raven ridge Aug 11, 2024, 8:32 PM

#

yeah. if I'm right that it's just an optimization, I'm surprised that they bothered to illustrate it, rather than dropping that from their translation...

uneven raptor Aug 11, 2024, 8:36 PM

#

i'm curious about what error they'll pick for the disallowed tag names

#

SyntaxError?

#

or, how will they actually implement it? both of these are valid pieces of code, it would be a breaking change to make it error in the future

def f(*args):
    ...

print(f"whatever")

raven ridge Aug 11, 2024, 8:38 PM

#

uneven raptor `SyntaxError`?

I'd expect so

uneven raptor Aug 11, 2024, 8:39 PM

#

uneven raptor or, how will they actually implement it? both of these are valid pieces of code,...

i'd personally be much more comfortable with a SyntaxWarning, for this reason

raven ridge Aug 11, 2024, 8:40 PM

#

I'm not sure what you're saying is (or would be) a breaking change

uneven raptor Aug 11, 2024, 8:41 PM

#

raven ridge I'm not sure what you're saying is (or would be) a breaking change

how are they going to specifically disallow use of those names as tags?

feral island Aug 11, 2024, 8:42 PM

#

something in the tokenizer or grammar, doesn't seem too difficult

radiant garden Aug 11, 2024, 8:43 PM

#

this is in my opinion a stretch of a generalization

uneven raptor Aug 11, 2024, 8:43 PM

#

feral island something in the tokenizer or grammar, doesn't seem too difficult

i'm just curious about how they'll differentiate f"hello world" as a user trying to use an fstring, or if they're trying to use a tag called f

raven ridge Aug 11, 2024, 8:44 PM

#

the PEP defines that - it's an f string

feral island Aug 11, 2024, 8:44 PM

#

uneven raptor i'm just curious about how they'll differentiate `f"hello world"` as a user tryi...

there's no differentiating, it's just always an f-string

uneven raptor Aug 11, 2024, 8:44 PM

#

that seems like it could cause some odd problems for beginners wondering why their code is ignoring their tag

raven ridge Aug 11, 2024, 8:45 PM

#

I would expect beginners to never define their own tags

uneven raptor Aug 11, 2024, 8:45 PM

#

fair enough

uneven raptor Aug 11, 2024, 8:46 PM

#

raven ridge the PEP defines that - it's an f string

this section mentions raising an error

raven ridge Aug 11, 2024, 8:46 PM

#

that's talking about hard keywords

#

you can't do return"foo" or is"foo"

radiant garden Aug 11, 2024, 8:47 PM

#

technically a breaking change

uneven raptor Aug 11, 2024, 8:48 PM

#

raven ridge you can't do `return"foo"` or `is"foo"`

ah, i got confused from the prefixes listed above

raven ridge Aug 11, 2024, 8:49 PM

#

honestly, I don't think they even need to specify this

#

none of those can be used as names for the callable, so of course none of them can be used for the tag name

uneven raptor Aug 11, 2024, 8:50 PM

#

yeah, it's just confusing

raven ridge Aug 11, 2024, 8:50 PM

#

(they may be specifying this since it's relevant at the level of the grammar, but it's not relevant at the level of a Python programmer using the feature)

uneven raptor Aug 11, 2024, 8:50 PM

#

will there be any nicer errors for trying to use functions that are not tags? e.g. print"hi"

feral island Aug 11, 2024, 8:50 PM

#

I saw someone show an example where they did builtins.__dict__["raise"] = some_callable and then raise"foo" worked in the prototype

uneven raptor Aug 11, 2024, 8:51 PM

#

(i've only skimmed through the PEP, FWIW)

feral island Aug 11, 2024, 8:51 PM

#

feral island I saw someone show an example where they did `builtins.__dict__["raise"] = some_...

it probably should not work

uneven raptor Aug 11, 2024, 8:52 PM

#

feral island I saw someone show an example where they did `builtins.__dict__["raise"] = some_...

how did that get past the parser? raise"foo" is invalid syntax

radiant garden Aug 11, 2024, 8:52 PM

#

it's perfectly valid, it raises a string

#

but also in the proto it's a tag

uneven raptor Aug 11, 2024, 8:52 PM

#

!e raise"a"

fallen slateBOT Aug 11, 2024, 8:52 PM

#

uneven raptor !e raise"a"

:x: Your 3.12 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "/home/main.py", line 1, in <module>
003 |     raise"a"
004 | TypeError: exceptions must derive from BaseException

winged sphinx Aug 11, 2024, 8:53 PM

#

I do like that with this, we have a nicer solution for lazy eval log messages

uneven raptor Aug 11, 2024, 8:53 PM

#

TIL

feral island Aug 11, 2024, 8:53 PM

#

uneven raptor how did that get past the parser? `raise"foo"` is invalid syntax

I think in the prototype it doesn't tokenize to a keyword, it tokenizes to a tag

uneven raptor Aug 11, 2024, 8:53 PM

#

i was under the impression that a space was needed between raise and the exception

radiant garden Aug 11, 2024, 8:53 PM

#

same deal with return, await, yield, yield from

#

most anything actually

uneven raptor Aug 11, 2024, 8:55 PM

#

oh yeah, because it's a literal, righttt

raven ridge Aug 11, 2024, 8:55 PM

#

uneven raptor will there be any nicer errors for trying to use functions that are not tags? e....

who says print is not a tag?

feral island Aug 11, 2024, 8:55 PM

#

radiant garden same deal with return, await, yield, yield from

That's a good observation, probably worth bringing up on Discourse. The PEP's backwards compatibility section is rather thin https://peps.python.org/pep-0750/#backwards-compatibility

Python Enhancement Proposals (PEPs)

PEP 750 – Tag Strings For Writing Domain-Specific Languages | peps....

This PEP introduces tag strings for custom, repeatable string processing. Tag strings are an extension to f-strings, with a custom function – the “tag” – in place of the f prefix. This function can then provide rich features such as safety checks, lazy ...

#

it's true that return"x" is currently valid syntax and the PEP (at least in the current version) will change what it means

uneven raptor Aug 11, 2024, 8:57 PM

#

raven ridge who says `print` is not a tag?

ok, bad example 😛. what about something more strict, like max"hello"

raven ridge Aug 11, 2024, 8:57 PM

#

raven ridge who says `print` is not a tag?

less glibly: given that we define what a tag function is entirely by the interface it conforms to, given that print() does conform to that interface, it seems to me that it is a tag

#

max conforms to the interface, too, as long as no placeholders are given - right?

uneven raptor Aug 11, 2024, 8:58 PM

#

we're slowly reverting back to python 2's print statement

dusk comet Aug 11, 2024, 8:58 PM

#

!e print(max('foo'))

fallen slateBOT Aug 11, 2024, 8:58 PM

#

dusk comet !e `print(max('foo'))`

:white_check_mark: Your 3.12 eval job has completed with return code 0.

feral island Aug 11, 2024, 8:59 PM

#

uneven raptor Aug 11, 2024, 8:59 PM

#

struggling to pick a builtin that doesn't support str

feral island Aug 11, 2024, 8:59 PM

#

(from https://pauleveritt.github.io/tagstr-site/playground/lab/index.html)

dusk comet Aug 11, 2024, 8:59 PM

#

raven ridge `max` conforms to the interface, too, as long as no placeholders are given - rig...

no placeholders = 0 or 1 string piece
max will fail with 0 args
upd: max will fail with 1 arg too, because it would be a DecodedConcrete

so you can't make max work as a tag

grave jolt Aug 11, 2024, 8:59 PM

#

uneven raptor struggling to pick a builtin that doesn't support `str`

abs"olutely"

feral island Aug 11, 2024, 8:59 PM

#

uneven raptor Aug 11, 2024, 9:00 PM

#

so it just throws a TypeError, interesting

raven ridge Aug 11, 2024, 9:00 PM

#

honestly, I think this is a very good argument for why structural subtyping isn't good enough for this, and an ABC would be desirable

uneven raptor Aug 11, 2024, 9:00 PM

#

i guess that makes sense

raven ridge Aug 11, 2024, 9:00 PM

#

users will get bad error messages if we infer tag-ness instead of declaring it

grave jolt Aug 11, 2024, 9:01 PM

#

Maybe a decorator, like @tagtools.tag

dusk comet Aug 11, 2024, 9:01 PM

#

!pypi tags

fallen slateBOT Aug 11, 2024, 9:01 PM

#

tags v0.0.3

A toolkit to create HTML code with python

Released on <t:1257256840:D>.

dusk comet Aug 11, 2024, 9:01 PM

#

!pypi tagtools

fallen slateBOT Aug 11, 2024, 9:01 PM

#

tagtools v0.8d

Python helpers to work with tags.

Released on <t:1282512732:D>.

dusk comet Aug 11, 2024, 9:01 PM

#

grave jolt Maybe a decorator, like `@tagtools.tag`

you are killing pypi libs 💀

grave jolt Aug 11, 2024, 9:01 PM

#

Yes, the users of this package from 2010 will be in utter shambles

raven ridge Aug 11, 2024, 9:01 PM

#

@types.tag or something would be fine

uneven raptor Aug 11, 2024, 9:02 PM

#

i think types would be a weird place to put it

grave jolt Aug 11, 2024, 9:02 PM

#

is that like stdlib.h in C? If we don't know where to put it, put it in types 🙂

feral island Aug 11, 2024, 9:02 PM

#

grave jolt Yes, the users of this package from 2010 will be in utter shambles

it never had any release apparently

dusk comet Aug 11, 2024, 9:02 PM

#

@ast.tag

grave jolt Aug 11, 2024, 9:02 PM

#

feral island it never had any release apparently

wdym https://pypi.org/project/tagtools/#history

uneven raptor Aug 11, 2024, 9:02 PM

#

grave jolt is that like `stdlib.h` in C? If we don't know where to put it, put it in `types...

from my understanding, types is used for getting native types (e.g. coroutine) at runtime

feral island Aug 11, 2024, 9:02 PM

#

hm maybe pypi is broken for me

#

direct link works but then if I click on one of the tabs it goes blank

raven ridge Aug 11, 2024, 9:03 PM

#

grave jolt is that like `stdlib.h` in C? If we don't know where to put it, put it in `types...

seems to fit well enough to me - it is a type. Honestly, I think it'd be fine to even make it an actual type and force library authors to inherit their tag callables from it

#

could go in string as well, come to think of it

#

class sql(string.tag):

dusk comet Aug 11, 2024, 9:04 PM

#

!d string

fallen slateBOT Aug 11, 2024, 9:04 PM

#

string

Source code: Lib/string.py

grave jolt Aug 11, 2024, 9:04 PM

#

this is how we might acquire the second and potentially third users of the string module 😛

feral island Aug 11, 2024, 9:04 PM

#

one of the less-loved stdlib modules

uneven raptor Aug 11, 2024, 9:05 PM

#

at that point, just add it as a method of str like @str.tag

grave jolt Aug 11, 2024, 9:05 PM

#

wait, it has stuff like string.ascii_lower. nevermind

raven ridge Aug 11, 2024, 9:05 PM

#

string.Template is genuinely useful and should be used more

uneven raptor Aug 11, 2024, 9:05 PM

#

i've used string.ascii_letters for random strings many times

#

!d string.Template

fallen slateBOT Aug 11, 2024, 9:06 PM

#

string.Template


class string.Template(template)```
The constructor takes a single argument which is the template string.

uneven raptor Aug 11, 2024, 9:08 PM

#

why does it use the $ syntax instead of {}?

raven ridge Aug 11, 2024, 9:08 PM

#

I think it predates any use of {} for placeholders in Python

#

and $ has been used for placeholders in lots of other languages

uneven raptor Aug 11, 2024, 9:10 PM

#

raven ridge I think it predates any use of `{}` for placeholders in Python

haven't we had that since like, the early 2000s?

#

!pep 3101

fallen slateBOT Aug 11, 2024, 9:10 PM

#

**PEP 3101 - Advanced String Formatting**

Status

Final

Python-Version

3.0

Created

16-Apr-2006

Type

Standards Track

feral island Aug 11, 2024, 9:10 PM

#

!pep 292

fallen slateBOT Aug 11, 2024, 9:11 PM

#

**PEP 292 - Simpler String Substitutions**

Status

Final

Python-Version

2.4

Created

18-Jun-2002

Type

Standards Track

feral island Aug 11, 2024, 9:11 PM

#

^ this one introduced string.Template

raven ridge Aug 11, 2024, 9:12 PM

#

actually - requiring that the tag callables derive from some base class would address a lot of my concerns with PEP 750. It addresses the concern about random functions accidentally working as tags, and of bad error messages when trying to use a function that doesn't support the interface as a tag, and it allows an extension point for tags to opt into different behavior in the future. Imagine in the future we discover a need to suppress the {x=} -> x={x} expansion for some tags - the class could just have a def __tag_debug_string_expansion__(self): return False, which would allow a backwards-compatible way of changing how tagged strings are parsed in the future

uneven raptor Aug 11, 2024, 9:12 PM

#

feral island ^ this one introduced string.Template

oh, i didn't realize string was that old

uneven raptor Aug 11, 2024, 9:13 PM

#

raven ridge actually - requiring that the tag callables derive from some base class would ad...

be the discourse message you want to see in the world 😄

raven ridge Aug 11, 2024, 9:32 PM

#

ok, done

uneven raptor Aug 11, 2024, 9:36 PM

#

some bikeshedding: __tag_uses_debug_expansion__ is too long

#

what about like, __tagexpand__

#

or really, since this is in an ABC, just __expands__ or something like that

raven ridge Aug 11, 2024, 9:38 PM

#

it could be any name at all, totally not worth bikeshedding on at this stage

#

it could even be a __tagflags__

uneven raptor Aug 11, 2024, 9:40 PM

#

raven ridge it could be any name at all, totally not worth bikeshedding on at this stage

i thought the whole point of the name "bikeshedding" was that you do it instead of important things 😉

raven ridge Aug 11, 2024, 9:40 PM

#

sure. I don't plan to engage, though 😉

uneven raptor Aug 11, 2024, 9:41 PM

#

raven ridge it could even be a `__tagflags__`

flags as in like, TAG_EXPANDS | TAG_SOMETHING_ELSE?

raven ridge Aug 11, 2024, 9:41 PM

#

yeah.

grave jolt Aug 11, 2024, 9:41 PM

#

"tag flags" sounds cool

uneven raptor Aug 11, 2024, 9:41 PM

#

not too big a fan of that in python, is there a precedent for that in the stdlib?

raven ridge Aug 11, 2024, 9:41 PM

#

the important part of this idea is that it gives a way for a tag callable to declare some attributes about itself that the interpreter could inspect. The specific mechanism for how it would do that isn't something we'd need to settle just now

uneven raptor Aug 11, 2024, 9:43 PM

#

i'm more of an advocate for a decorator rather than an ABC, but yeah, i like the general idea

raven ridge Aug 11, 2024, 9:43 PM

#

even the specific attributes it might declare about itself aren't something we'd need to settle right now

#

maybe both of the examples I gave are bad ideas and we don't want to implement them - but I can virtually guarantee that there will eventually be something where we need some strings to be parsed differently than others.

#

r strings and u strings and f strings and b strings are all parsed differently, and it seems unreasonable to bet that tag strings will be the last new type of parsing we'll ever need for the stuff in the quotes

dusk comet Aug 11, 2024, 9:44 PM

#

uneven raptor not too big a fan of that in python, is there a precedent for that in the stdlib...

all classes have cls.__flags__ but it is only used by C code

#

also, code objects have .co_flags

#

and compile supports flags kwarg

#

!d compile

fallen slateBOT Aug 11, 2024, 9:45 PM

#

compile


compile(source, filename, mode, flags=0, dont_inherit=False, optimize=-1)```
Compile the *source* into a code or AST object. Code objects can be executed by [`exec()`](https://docs.python.org/3/library/functions.html#exec) or [`eval()`](https://docs.python.org/3/library/functions.html#eval). *source* can either be a normal string, a byte string, or an AST object. Refer to the [`ast`](https://docs.python.org/3/library/ast.html#module-ast) module documentation for information on how to work with AST objects.

The *filename* argument should give the file from which the code was read; pass some recognizable value if it wasn’t read from a file (`'<string>'` is commonly used).

dusk comet Aug 11, 2024, 9:45 PM

#

some flags are probably involved in buffer protocol

uneven raptor Aug 11, 2024, 9:46 PM

#

rephrasing the question: is there any precedent for flags in python code

raven ridge Aug 11, 2024, 9:47 PM

#

the re module, for instance

dusk comet Aug 11, 2024, 9:47 PM

#

I don't remember any 🙂

raven ridge Aug 11, 2024, 9:47 PM

#

sys.setdlopenflags()

uneven raptor Aug 11, 2024, 9:48 PM

#

raven ridge the `re` module, for instance

damn it

dusk comet Aug 11, 2024, 9:49 PM

#

some filesystem/socket/mmap/... stuff uses flags

raven ridge Aug 11, 2024, 9:49 PM

#

anyway, the important part of the idea is that it provides an extension point. We don't have to define how we'd use that extension point yet, it's useful to have even if we don't yet have any need for it, since we might have a future need for it

uneven raptor Aug 11, 2024, 9:50 PM

#

dusk comet some filesystem/socket/mmap/... stuff uses flags

yeah but that's all thin wrappers over C code. are there pure-python APIs that use flags as their choice of configuration (FWIW, i think re counts, as godly mentioned)

uneven raptor Aug 11, 2024, 9:52 PM

#

raven ridge anyway, the important part of the idea is that it provides an extension point. W...

i do hope the PEP authors take your idea into consideration, it fixes most of the issues i'm seeing in the thread

dusk comet Aug 11, 2024, 9:52 PM

#

is it reasonable to create new namespace for tags only?

there are currently 3 global namespaces: globals themselves, __builtins__ and __annotations__

I suggest making __tags__ namespace so that tag"foo" looks into __tags__['tag']

uneven raptor Aug 11, 2024, 9:53 PM

#

how will something be added to said namespace?

dusk comet Aug 11, 2024, 9:54 PM

#

uneven raptor yeah but that's all thin wrappers over C code. are there pure-python APIs that u...

you can search for any use of enum.IntFlag

uneven raptor Aug 11, 2024, 9:54 PM

#

!d enum.IntFlag is that really a thing

fallen slateBOT Aug 11, 2024, 9:54 PM

#

enum.IntFlag


class enum.IntFlag```
*IntFlag* is the same as *Flag*, but its members are also integers and can be used anywhere that an integer can be used...

uneven raptor Aug 11, 2024, 9:54 PM

#

how dare they

raven ridge Aug 11, 2024, 9:54 PM

#

flags are, like, super duper common, my friend 😄

uneven raptor Aug 11, 2024, 9:54 PM

#

keep them in C where they belong

dusk comet Aug 11, 2024, 9:55 PM

#

uneven raptor how will something be added to said namespace?

from myhtml import html_tag, html
from ... import add_tag

add_tag(html=html_tag)
add_tag(html_tag, name='html') # or this

html() # that is a class and this line creates an instance
html'fubar' # that is a use of tag

grave jolt Aug 11, 2024, 9:57 PM

#

Why not follow the normal lookup rules?

uneven raptor Aug 11, 2024, 9:57 PM

#

raven ridge flags are, like, super duper common, my friend 😄

i find a dataclass or namedtuple works much better for configuration (or just plain old kwargs, depending on the case) in python. what libraries (that aren't just C wrappers) use flags?

dusk comet Aug 11, 2024, 9:58 PM

#

grave jolt Why not follow the normal lookup rules?

normal lookup rules are not normal anymore
so adding 4th namespace wouldn't change it much

raven ridge Aug 11, 2024, 9:58 PM

#

uneven raptor i find a dataclass or namedtuple works much better for configuration (or just pl...

https://docs.python.org/3/library/doctest.html#option-flags

dusk comet Aug 11, 2024, 9:58 PM

#

and there are 2 more namespaces that I didn't mention : locals and nonlocals
and they all behave differently

grave jolt Aug 11, 2024, 9:58 PM

#

dusk comet normal lookup rules are not normal anymore so adding 4th namespace wouldn't chan...

With normal lookup rules, you can compose tags or decorate them: ```py
def some_function():
# ...
debug_html = debug_tag(html, "some_function")
return debug_html"<div>{foo}</div>"

raven ridge Aug 11, 2024, 9:59 PM

#

uneven raptor i find a dataclass or namedtuple works much better for configuration (or just pl...

the "mode" argument for open()

grave jolt Aug 11, 2024, 10:00 PM

#

Or like ```py
def some_function():
# ...
html_ = html.override(ascii_only=True)
return html_"<div>{foo}</div>"

uneven raptor Aug 11, 2024, 10:00 PM

#

raven ridge the "mode" argument for `open()`

that's a string, i'm not sure that counts

raven ridge Aug 11, 2024, 10:00 PM

#

why wouldn't it count? it's multiple discrete pieces of information packed into a single field

uneven raptor Aug 11, 2024, 10:01 PM

#

well, that argument is based on C code anyway, which didn't fit my criteria

raven ridge Aug 11, 2024, 10:01 PM

#

it's not based on C code, open is implemented in Python

#

but even if it wasn't, there's bz2.open and gzip.open and shelve.open etc

dusk comet Aug 11, 2024, 10:02 PM

#

open in C also uses string of flags, iirc

uneven raptor Aug 11, 2024, 10:02 PM

#

i thought it was equivalent to the second parameter of fopen in C

grave jolt Aug 11, 2024, 10:02 PM

#

Everything is based on C code eventually

raven ridge Aug 11, 2024, 10:02 PM

#

uneven raptor i thought it was equivalent to the second parameter of `fopen` in C

it's not, there's Python-specific flags

halcyon trail Aug 11, 2024, 10:03 PM

#

idk about everything. but in any case, that doesn't mean you couldn't have a nicer API - python's subprocess.run for example is vastly nicer than any C API it's eventually delegating to

feral island Aug 11, 2024, 10:03 PM

#

raven ridge it's not based on C code, `open` is implemented in Python

open() is implemented in C (not sure it matters for this argument though)

halcyon trail Aug 11, 2024, 10:03 PM

#

I would like to think that open in python is how it is simply because it's quite old - if open's API were being designed today then I'd like to hope it would take an enum (but maybe I'm delusional)

feral island Aug 11, 2024, 10:04 PM

#

halcyon trail I would like to think that `open` in python is how it is simply because it's qui...

Agree, I don't think the open() interface is a great example to follow

halcyon trail Aug 11, 2024, 10:04 PM

#

or maybe even a dataclass with multiple bools, idk

uneven raptor Aug 11, 2024, 10:04 PM

#

raven ridge it's not, there's Python-specific flags

i'm still not an advocate for designing an API to use flags instead of a dataclass or whatever these days, but yes you win, python has flags in the stdlib

halcyon trail Aug 11, 2024, 10:05 PM

#

fwiw I agree with you that flags suck

#

it's a very C thing. Even in C++, if you wanted to achieve the same underlying efficiency, you'd do it in a more type safe way.

grave jolt Aug 11, 2024, 10:06 PM

#

uneven raptor yeah but that's all thin wrappers over C code. are there pure-python APIs that u...

They definitely are used
https://grep.app/search?q=enum\.(Int)%3FFlag&regexp=true&filter[lang][0]=Python
https://grep.app/search?q=from enum import[^F]%2BFlag&regexp=true&case=true&filter[lang][0]=Python

uneven raptor Aug 11, 2024, 10:06 PM

#

grave jolt They definitely are used <https://grep.app/search?q=enum%5C.%28Int%29%3FFlag&reg...

from what i'm seeing, many of the results are in libraries that are FFIs

grave jolt Aug 11, 2024, 10:07 PM

#

Yes, especially for IntFlag

halcyon trail Aug 11, 2024, 10:07 PM

#

I believe they are used, but I'm not sure I see a good reason to use an enum.IntFlag in pure python (i.e. no wrapping of C or eventual system calls)

#

maybe binary serialization into another language (but again that's not really "pure python" anymore, exactly)

raven ridge Aug 11, 2024, 10:08 PM

#

this whole discussion is weird and unnecessary. The relevant idea is that there should be some way for the tag callable to tell the interpreter how it wants to be called. There are infinitely many contracts that would allow for that; it's weird to get stuck on one like this

uneven raptor Aug 11, 2024, 10:09 PM

#

bikeshedding :D

grave jolt Aug 11, 2024, 10:10 PM

#

halcyon trail I believe they are used, but I'm not sure I see a *good* reason to use an enum.I...

I see it as just a handy way to express a set of literal options. Something like ```py
class Flags(enum.Flag):
MULTILINE = "multiline"
DOTALL = "dotall"
VERBOSE = "verbose"
#<=>
frozenset[Literal["multiline", "dotall", "verbose"]]

#

But yeah, if you need to add an option that's not a bool, you need to bolt it to the side

halcyon trail Aug 11, 2024, 10:11 PM

#

what's the actual benefit of this, I don't really understand

#

you can just write Flags.MULTILINE | Flags.DOTALL

#

?

grave jolt Aug 11, 2024, 10:11 PM

#

the benefit of what?

halcyon trail Aug 11, 2024, 10:12 PM

#

I just don't really see why this is better than a normal enum and {Flags.MULTILINE, Flags.DOTALL}, that's all

grave jolt Aug 11, 2024, 10:12 PM

#

ah

#

good question 🙂

halcyon trail Aug 11, 2024, 10:12 PM

#

in C, creating an actual hashset for something like this is an insane amount of work

#

So obviously you're going to use a bitset

#

it's also far faster and you often care about speed, avoiding heap allocations, etc

#

So even in C++, stuff like this does get used, though often an attempt is made to wrap it up more nicely.

#

there's no real reason outside of that to use it that I know of, so basically almost no reasoning applicable to python

uneven raptor Aug 11, 2024, 10:16 PM

#

i'm not too sure about PEP 750's choice to allow any return type, it's odd that you could do things like foo"hello" == 42

halcyon trail Aug 11, 2024, 10:17 PM

#

@grave jolt another nasty thing from a type perspective is that Flags.MULTILINE, and x = Flags.MULTILINE | Flags.DOTALL, have the same type

#

so now Flags.MULTILINE in x feels awfully weird because you're checking if a T is in a T

grave jolt Aug 11, 2024, 10:19 PM

#

halcyon trail so now `Flags.MULTILINE in x` feels awfully weird because you're checking if a T...

str 👀

#

actually... maybe that is also weird

halcyon trail Aug 11, 2024, 10:20 PM

#

it just ends up being weird no matter what, with flags.
In C++, ideally, if I wanted to do "flags", I would try to have a separate type for the enum, and for the enum set

#

but really it's only something I'd do for performance

#

{Flags.MULTILINE, Flags.DOTALL} is a set[T] and so later you're doing a membership check of T in a Set[T] - life is simple and makes sense

raven ridge Aug 11, 2024, 10:26 PM

#

halcyon trail I just don't really see why this is better than a normal enum and {Flags.MULTILI...

enum.IntFlag is extensible to flags that are not known in advance but might still be supported (by virtue of being threaded through to a different layer that knows what to do with them)

uneven raptor Aug 11, 2024, 10:27 PM

#

this is more about python flags than i ever needed to know 😄

halcyon trail Aug 11, 2024, 10:28 PM

#

usually the whole concept with enums is to restrict things intentionally. But in any case, evenif you wanted to do that, nothing is actually stopping you from putting different values into a set in python

#

I don't see what IntFlag's advantage in that regard is

raven ridge Aug 11, 2024, 10:29 PM

#

🤷‍♂️ all of the things that are nicer about enums than just global variables, I suppose

halcyon trail Aug 11, 2024, 10:29 PM

#

err what

#

I don't see any connection there at all

glass mulch Aug 11, 2024, 10:29 PM

#

uneven raptor i'm not too sure about PEP 750's choice to allow any return type, it's odd that ...

What, you don't like callable strings?

def lam(*args):
    return lambda: args[0]

print(lam"Hi"())

uneven raptor Aug 11, 2024, 10:30 PM

#

yeah... not great

#

however, it opens the door to lots of black magic shenanigans on pypi

glass mulch Aug 11, 2024, 10:30 PM

#

python-ideas will have a field day with tags

raven ridge Aug 11, 2024, 10:31 PM

#

halcyon trail I don't see any connection there at all

I'm not sure why not? Your argument that instead of using Flags.FOO | Flags.BAR you could use {Flags.FOO, Flags.BAR}, which is true. But by extension to that same argument, we don't need enum at all, you can just do {module.FOO, module.BAR}. The things that you get from enum are a way to check that all the things in the set are valid (for some definition of valid), a nice repr, type safety, etc

halcyon trail Aug 11, 2024, 10:31 PM

#

raven ridge I'm not sure why not? Your argument that instead of using `Flags.FOO | Flags.BAR...

When you start putting things from outside the IntFlag into your set though, you already lost that ability 🤷‍♂️

#

maybe it's easier if you actually show the IntFlag code that you envision - then we could see if there's a way to write it without IntFlag that's equally nice

raven ridge Aug 11, 2024, 10:32 PM

#

halcyon trail When you start putting things from outside the IntFlag into your set though, you...

nah, you still have some type safety (at least it's an int) and you still have a nice repr (it just uses a constant for the value that wasn't known up front)

halcyon trail Aug 11, 2024, 10:32 PM

#

then you can just use ints 🙂

#

You can have one enum with int values, pass around sets of ints that could potentially have ints from outside that enum - that another "layer" in the codebase knows about

#

you can even put a union type in your set

raven ridge Aug 11, 2024, 10:34 PM

#

halcyon trail then you can just use ints 🙂

of course you can, at the cost of the nice repr, and the ability to do membership checks by name

halcyon trail Aug 11, 2024, 10:34 PM

#

Set[NormalEnum | int]

#

Seems exactly the same to me?

#

the diference is that here, we're explicit about the fact that some values will be from inside the "known" enum, and some values will not be.

grave jolt Aug 11, 2024, 10:34 PM

#

glass mulch What, you don't like callable strings? ```py def lam(*args): return lambda: ...

You can insert a comment or other supplementary material in the middle of a call now ```diff
@motivational # turns the function into a tag returning a callable
def pick_polling_strategy(con_pool, expected_size, expected_count):
...

def handle_something():

poll = pick_polling_strategy(config.pool, expected_size=2**20, expected_count=50)

poll = pick_polling_strategy "It is the rule in war, if ten times the enemy's strength, surround them; if five times, attack them; if double, be able to divide them; if equal, engage them; if fewer, defend against them; if weaker, be able to avoid them." (config.pool, expected_size=2**20, expected_count=50)

raven ridge Aug 11, 2024, 10:34 PM

#

halcyon trail the diference is that here, we're explicit about the fact that some values will ...

ok. That's implicit with an IntEnum

halcyon trail Aug 11, 2024, 10:34 PM

#

With IntFlag, it's just always going to be implicit whether or not that's the case

#

yes, exactly

dusk comet Aug 11, 2024, 10:35 PM

#

grave jolt You can insert a comment or other supplementary material in the middle of a call...

finally, inline comments 🧑‍🔬

halcyon trail Aug 11, 2024, 10:35 PM

#

so IntFlag is just throwing away type safety which is basically always going to be useful somewhere 🤷‍♂️

dusk comet Aug 11, 2024, 10:36 PM

#

foo/*comment*/(args)
foo"comment"(args)

halcyon trail Aug 11, 2024, 10:36 PM

#

MyIntFlag is a single type that is overloaded to mean NormalEnum, Set[NormalEnum], and Set[NormalEnum | int] - hard to consider that a win

raven ridge Aug 11, 2024, 10:36 PM

#

that's exactly what it's for, though

dusk comet Aug 11, 2024, 10:37 PM

#

I don't like using stuff I don't understand
and I don't understand fully how enum magic works, so I don't like using enum module

raven ridge Aug 11, 2024, 10:37 PM

#

it's an improvement upon bitsets. It's useful for doing bitsetty things in a more readable and safer way

halcyon trail Aug 11, 2024, 10:37 PM

#

I don't know how it matters what it's intention is - I don't think it serves any purpose usefully - that was the whole discussion 🤷‍♂️ .
I don't see the point repeating "but that's the intention!"

#

Yes - and if you need an underlying data representation that is a bitset, then it's fine.

grave jolt Aug 11, 2024, 10:38 PM

#

Speaking of PEP 750. Will any expression be allowed inside of {}? Like, can you do hmm"{some.thing + 420} is what i want to print"?

halcyon trail Aug 11, 2024, 10:38 PM

#

I said that from the get go - but that very rarely comes up outside of wrapping C code and thing similar to that

uneven raptor Aug 11, 2024, 10:38 PM

#

grave jolt Speaking of PEP 750. Will any expression be allowed inside of `{}`? Like, can yo...

i think so, it's an extension of f strings

grave jolt Aug 11, 2024, 10:39 PM

#

uneven raptor i think so, it's an extension of f strings

Will await be banned inside of {} then?

glass mulch Aug 11, 2024, 10:39 PM

#

for x in rg"0..10":
    print(x)

grave jolt Aug 11, 2024, 10:39 PM

#

grave jolt Will `await` be banned inside of `{}` then?

(that does sound like a sensible option)

uneven raptor Aug 11, 2024, 10:39 PM

#

grave jolt Will `await` be banned inside of `{}` then?

do fstrings not support await?

#

or actually, it probably can't, because of lazy evaluation

grave jolt Aug 11, 2024, 10:40 PM

#

uneven raptor do fstrings not support `await`?

They do, because they're transformed at compile time. But with the proposed interface, it would not be possible to handle hmm"aaa {await something()} bbb"

uneven raptor Aug 11, 2024, 10:40 PM

#

grave jolt They do, because they're transformed at compile time. But with the proposed inte...

worth bringing up on discourse, i don't think they mention that

grave jolt Aug 11, 2024, 10:41 PM

#

Similarly, (yield) would be disallowed

dusk comet Aug 11, 2024, 10:41 PM

#

glass mulch ```py for x in rg"0..10": print(x) ```

for x in rg"1,2,...,{n}":
    print(x)

uneven raptor Aug 11, 2024, 10:41 PM

#

this is exactly why tags should be required to return strings

dusk comet Aug 11, 2024, 10:41 PM

#

dusk comet ```py for x in rg"1,2,...,{n}": print(x) ```

I like it soo much 🥰

#

imagine doing this ```py
for x in rg"({a},{b}]": print(x)

#

meh, too many brackets of all sorts...

uneven raptor Aug 11, 2024, 10:44 PM

#

now that i think of it... maybe PEP 750 makes golfers too strong

glass mulch Aug 11, 2024, 10:45 PM

#

dusk comet imagine doing this ```py for x in rg"({a},{b}]": print(x) ```

The beauty is that you can parse whatever syntax you want, including comments or multiline ranges, just yield from range(start, end, step).

dusk comet Aug 11, 2024, 10:45 PM

#

uneven raptor now that i think of it... maybe PEP 750 makes golfers too strong

not really
only if there will be a stdlib module with many tags

#

making your own tag is probably too many chars

uneven raptor Aug 11, 2024, 10:46 PM

#

depends on what it is, i guess

grave jolt Aug 11, 2024, 10:50 PM

#

Actually, I kinda agree with concerns from Eric Traut. The template can evaluate the arguments in any order (or at a later point in time), which can be surprising

#

I do like the explicit lazy marker. But not lambda: 💀

raven ridge Aug 11, 2024, 10:59 PM

#

uneven raptor this is exactly why tags should be required to return strings

That would make them far less useful. In fact, you'd lose practically all of their advantages

#

taking the sql"" example, for instance, you'd be losing the ability to use that SQL statement with APIs that take the parameters as objects, losing out on the ability to use prepared statements or execution plan caching, etc

glass mulch Aug 11, 2024, 11:14 PM

#

raven ridge That would make them far less useful. In fact, you'd lose practically all of the...

I worry about the idea of these being called "tagged strings" when they return anything. You cannot safely iterate over a tagged string because it might have returned itertools.count. They are actually a special function call protocol that may be used to construct anything, with one such uses being advanced string operations. I'd rather call it the "tag protocol" or something.

grave jolt Aug 11, 2024, 11:18 PM

#

b"foo" doesn't produce a str

dusk comet Aug 11, 2024, 11:20 PM

#

re'foobar' == re.compile(r'foobar')

grave jolt Aug 11, 2024, 11:20 PM

#

dusk comet `re'foobar' == re.compile(r'foobar')`

as opposed to e'foobar', which doesn't preserve backslashes 👍

feral island Aug 11, 2024, 11:22 PM

#

grave jolt as opposed to `e'foobar'`, which doesn't preserve backslashes 👍

clearly that calls eval()

grave jolt Aug 11, 2024, 11:22 PM

#

Finally. Backtick strings

glass mulch Aug 11, 2024, 11:24 PM

#

grave jolt `b"foo"` doesn't produce a `str`

But it returns something based on whatever is between quotes. You can put names and arbitrary objects in the global namespace with a tag. Seems to be a significant difference to me.

raven ridge Aug 11, 2024, 11:26 PM

#

that's true of an f-string, too

feral island Aug 11, 2024, 11:27 PM

#

!e f"{globals().__setitem__('x', 'y')}"; print(x)

fallen slateBOT Aug 11, 2024, 11:27 PM

#

feral island !e ```f"{globals().__setitem__('x', 'y')}"; print(x)```

:white_check_mark: Your 3.12 eval job has completed with return code 0.

dusk comet Aug 11, 2024, 11:28 PM

#

f'{(x:=1)}'; print(x)

#

apparently f'{x:=1}' is a valid syntax

#

>>> f'{2:=1}'
'2'

feral island Aug 11, 2024, 11:28 PM

#

that formats with =1 as the format string, right?

dusk comet Aug 11, 2024, 11:29 PM

#

yes

grave jolt Aug 11, 2024, 11:29 PM

#

Maybe allowing arbitrary expressions inside f-strings was a mistake. I've seen some terrible stuff

#

(a comprehension inside of {} is "terrible stuff" in my book)

feral island Aug 11, 2024, 11:30 PM

#

format specifiers were enough to stump Pablo, lol. https://github.com/python/cpython/issues/121130#issuecomment-2197120529

GitHub

Self-documenting f-string in conversion specifier throws ValueError...

Bug report Bug description: Since Python 3.12, the compiler throws a ValueError when compiling a string like f"{x:{y=}}": $ ./python.exe Python 3.14.0a0 (heads/main:81a654a342, Jun 28 202...

glass mulch Aug 11, 2024, 11:30 PM

#

raven ridge that's true of an f-string, too

Thanks! Maybe I'm not aware of just how powerful strings already are? 🤔

dusk comet Aug 11, 2024, 11:30 PM

#

is f'foo{",".join(...)}bar' considered "terrible stuff" ? i wrote it several times

#

f'A {f"{B} {C}":10} D'

grave jolt Aug 11, 2024, 11:31 PM

#

I mean, ultimately it's subjective

#

for me it makes it harder to separate the template text from what's being inserted into it

#

my brain has very little RAM

grave jolt Aug 12, 2024, 12:14 AM

#

The current PEP750 reference implementation does something interesting with yield ```pycon

def inspekt(*args):
... rv = []
... for arg in args:
... if isinstance(arg, Decoded):
... print("decoded", arg)
... else:
... v = arg.getvalue()
... rv.append(v)
... print("interp", repr(v))
... return rv
...
x, y, z = inspekt"foo{42}bar{yield 5}baz{yield 6}"
decoded foo
interp 42
decoded bar
interp <generator object <interpolation> at 0x7d211cbe8670>
decoded baz
interp <generator object <interpolation> at 0x7d211cbe8880>
next(y)
5
next(y)
Traceback (most recent call last):
File "<python-input-31>", line 1, in <module>
next(y)

~~~~^^^

StopIteration

next(z)
6
next(z)
Traceback (most recent call last):
File "<python-input-33>", line 1, in <module>
next(z)

~~~~^^^

StopIteration

feral island Aug 12, 2024, 12:32 AM

#

grave jolt The current PEP750 reference implementation does something interesting with `yie...

I think at some point it literally translated it into lambda: yield 5, not sure if it still does that, but that seems to be what you're seeing

uneven raptor Aug 12, 2024, 11:07 AM

#

raven ridge That would make them far less useful. In fact, you'd lose practically all of the...

yeah, fair. i'm just worried that adding tags will encourage some shenanigans like rg from above

stable drum Aug 12, 2024, 1:02 PM

#

Can some pls help on how to go about this using python

rn_image_picker_lib_temp_3e89ebdb-bc4c-4be7-a7f2-a424e9bd8d5c.jpg

eternal geyser Aug 13, 2024, 12:31 PM

#

stable drum Can some pls help on how to go about this using python

First thing that came tk my mind was to create a table containing dictionaries that describe that table. These dictionaries will be correspond to a simple hash (in this case i just concatenated the X Y coordinates into a string)
example:

coord_table = {
    "873":{
        "x":87,
        "y":3,
        "c":"□" # special character 
    }
}
x, y = 87, 3
hash = str(x) + str(y)
print(coord_table[hash])

#

!e

coord_table = {
    "873":{
        "x":87,
        "y":3,
        "c":"□" # special character 
    }
}
x, y = 87, 3
hash = str(x) + str(y)
print(coord_table[hash])

fallen slateBOT Aug 13, 2024, 12:34 PM

#

eternal geyser !e ``` coord_table = { "873":{ "x":87, "y":3, "c":"□...

:white_check_mark: Your 3.12 eval job has completed with return code 0.

{'x': 87, 'y': 3, 'c': '□'}

radiant garden Aug 14, 2024, 8:48 AM

#

me when the 1, 11 coordinate overwrites the 11, 1 coordinate

warm otter Aug 15, 2024, 5:09 PM

#

How do i repeat ?

lunar badge Aug 16, 2024, 2:19 PM

#

yo what are some mobile alternatives for pycharm

dim turtle Aug 16, 2024, 2:19 PM

#

Hello there everyone

lunar badge Aug 16, 2024, 2:19 PM

#

hi

#

!e

fallen slateBOT Aug 16, 2024, 2:20 PM

#

Missing required argument

code

dim turtle Aug 16, 2024, 2:20 PM

#

Hmm what's this

#

Alr

#

I just joined the server

#

How are you

lunar badge Aug 16, 2024, 2:39 PM

#

print("hello")

uneven raptor Aug 16, 2024, 4:27 PM

#

how does typeshed indicate a soft deprecation? @typing.deprecated?

feral island Aug 16, 2024, 4:28 PM

#

uneven raptor how does typeshed indicate a soft deprecation? `@typing.deprecated`?

we haven't explicitly discussed it

uneven raptor Aug 16, 2024, 4:30 PM

#

good to know. it might be difficult to get people to migrate without their IDE telling them so

gray galleon Aug 16, 2024, 4:32 PM

#

how does python handle duplicate imports
like```py

main.py

import math
import dist

assert dist.dist(2, 1, 5, 5) == math.sqrt(3 * 3 + 4 * 4)

dist.py

import math

def dist(x1, y1, x2, y2):
return math.sqrt((x2 - x1)**2 + (y2 - y1)**2)

feral island Aug 16, 2024, 4:34 PM

#

gray galleon how does python handle duplicate imports like```py # main.py import math import ...

it gets cached the first time, so the second import does little work

#

import is roughly implemented as ```
try:
return sys.modules[module_name]
except KeyError:
mod = actually_import(module_name)
sys.modules[module_name] = mod
return mod

gray galleon Aug 16, 2024, 4:39 PM

#

does it cache in this case```py

main.py

from math import factorial
from dist import dist

assert dist(2, 1, 5, 5) == factorial(4) + 1

dist.py

from math import sqrt

def dist(x1, y1, x2, y2):
return sqrt((x2 - x1)**2 + (y2 - y1)**2)

feral island Aug 16, 2024, 4:40 PM

#

yes

raven ridge Aug 16, 2024, 4:46 PM

#

from module_name import Y, Z is roughly implemented as ```py
try:
mod = sys.modules[module_name]
except KeyError:
mod = sys.modules[module_name] = actually_import(module_name)

Y = mod.Y
Z = mod.Z
del mod

#

(instead of being bound and then unbound, the name mod is just never bound, but this is roughly the idea)

feral island Aug 16, 2024, 4:54 PM

#

Note it also looks for sys.modules[f"{module_name}.{Y}"] though

raven ridge Aug 16, 2024, 7:19 PM

#

oh, true 🙂

uneven raptor Aug 16, 2024, 7:26 PM

#

does caching apply to the C APIs for it? e.g. PyImport_Import

feral island Aug 16, 2024, 7:28 PM

#

uneven raptor does caching apply to the C APIs for it? e.g. `PyImport_Import`

I believe so but read the docs and/or the code

uneven raptor Aug 16, 2024, 7:30 PM

#

it doesn't seem like PyImport_Import does any caching...

#

https://github.com/python/cpython/blob/c13e7d98fb8581014a225b900b1b88ccbfc28097/Python/import.c#L3881

fallen slateBOT Aug 16, 2024, 7:30 PM

#

Python/import.c line 3881

PyImport_Import(PyObject *module_name)```

uneven raptor Aug 16, 2024, 7:30 PM

#

or, if it does, it doesn't skip the list creation and whatnot

feral island Aug 16, 2024, 7:31 PM

#

lol it literally calls builtins.__import__

#

the caching is inside of that

#

but there's enough other stuff going on here that adding your own caching on top of PyImport_Import is likely worthwhile

uneven raptor Aug 16, 2024, 7:34 PM

#

feral island the caching is inside of that

i figured

#

looking closer, it doesn't look like any of the C APIs for import do caching

misty oxide Aug 17, 2024, 3:48 AM

#

Where is the definitive source of truth in the standard and/or stdlib on how to escape strings? Both fstrings and normal strings.

rose schooner Aug 17, 2024, 4:10 AM

#

"escape strings"?

uneven raptor Aug 17, 2024, 12:12 PM

#

while looking at the import implementation, i came across PyImport_ImportModuleNoBlock, which is deprecated and scheduled for removal in 3.15 -- it's part of the stable ABI, wouldn't that break forward compatibility?

#

i don't think anyone has used it since 3.3, i'm just curious 😄

misty swan Aug 18, 2024, 10:36 PM

#

hey guys, I'm running into an issue where my tests for a cpython PR is failing on some specific environments, wondering if anyone can have a look. I've already posted on #1035199133436354600 so I'm linking the whole thing here: #1274855244643172424 message

raven ridge Aug 19, 2024, 1:04 AM

#

misty swan hey guys, I'm running into an issue where my tests for a cpython PR is failing o...

you're calling the zipapp main with:

        args = [str(source), '--include-pattern', r'.*\.py', '--exclude-pattern', r'.*z.*']

where source is a subdirectory of self.tmpdir - what happens if self.tmpdir contains a z in its name?

#

I'm not positive that this is the bug, but at a glance, it seems like if the temp directory has a z in its name (or one of its parent directory's names), the test would fail with exactly the symptoms that you're seeing - every file would be excluded

unkempt rock Aug 19, 2024, 1:39 AM

#

!python

paper vault Aug 19, 2024, 10:52 AM

#

Anyone a ESRGAN expert here?

winged sphinx Aug 19, 2024, 12:09 PM

#

paper vault Anyone a ESRGAN expert here?

That's more a question for #data-science-and-ml , but also helps if you just ask the question (there)

misty swan Aug 20, 2024, 1:28 AM

#

raven ridge you're calling the zipapp main with: ``` args = [str(source), '--include...

OMG, that must be it, no wonder it seems to fail randomly, thank you @raven ridge

raven ridge Aug 20, 2024, 1:28 AM

#

I commented on the PR, too, in case you missed it here 🙂

unkempt rock Aug 20, 2024, 6:31 AM

#

raven ridge you're calling the zipapp main with: ``` args = [str(source), '--include...

what about this

frozen burrow Aug 20, 2024, 7:52 PM

#

Urgent Help
Who can help me?

grave jolt Aug 20, 2024, 8:10 PM

#

frozen burrow Urgent Help Who can help me?

If you have a Python question, you should see #❓｜how-to-get-help and ask in #1035199133436354600. Make sure to post all the details you have

frozen burrow Aug 20, 2024, 8:14 PM

#

grave jolt If you have a Python question, you should see <#704250143020417084> and ask in <...

thank for your support

uneven raptor Aug 23, 2024, 12:32 AM

#

a conversation in pydis piqued my interest, is it possible to remove the recursion limit without modifying the core?

raven ridge Aug 23, 2024, 12:44 AM

#

I think it depends a lot on exactly what you mean by "remove the recursion limit"

dusk comet Aug 23, 2024, 12:44 AM

#

you will eventually overflow C stack, and it will lead to a crash

raven ridge Aug 23, 2024, 12:45 AM

#

not necessarily - in modern Python versions, it's possible to call Python functions forever without overflowing the C stack

dusk comet Aug 23, 2024, 12:46 AM

#

def f():
 try: f()
 except: f()
 finally: f()
f()

uneven raptor Aug 23, 2024, 12:47 AM

#

raven ridge I think it depends a lot on exactly what you mean by "remove the recursion limit...

recursively calling infinitely will never raise a RecursionError

#

would like sys.setrecursionlimit(-1) work?

raven ridge Aug 23, 2024, 12:48 AM

#

You can try: ```py
import sys

sys.setrecursionlimit(2**31-1)

def a():
a()

a()

uneven raptor Aug 23, 2024, 12:49 AM

#

well, that’s a lot, but it’s not totally infinite

raven ridge Aug 23, 2024, 12:50 AM

#

it doesn't run out of stack space, though, it runs out of heap space

uneven raptor Aug 23, 2024, 12:51 AM

#

what if you had a beefy computer that could hold all the frames in memory at once. it would still raise a RecursionError, right?

raven ridge Aug 23, 2024, 12:51 AM

#

yes

#

there's no technical reason why that needs to be the case, though

uneven raptor Aug 23, 2024, 12:52 AM

#

so there’s no sure-fire way to remove the limit on all systems?

raven ridge Aug 23, 2024, 12:53 AM

#

sure there is - delete the code that imposes a limit

uneven raptor Aug 23, 2024, 12:54 AM

#

delete the code in the core?

raven ridge Aug 23, 2024, 12:55 AM

#

yeah

uneven raptor Aug 23, 2024, 12:56 AM

#

that’s not exactly portable now is it 😛

raven ridge Aug 23, 2024, 12:56 AM

#

I don't know what you mean by "portable" here

#

in practice, if you never call C functions and only call Python functions, you can recurse 2 billion calls deep in current CPython versions. You will run out of heap memory before you successfully push 2 billion call frames

uneven raptor Aug 23, 2024, 12:57 AM

#

portable as in, you can run it on any python interpreter

raven ridge Aug 23, 2024, 12:58 AM

#

if "any Python interpreter" includes Python 3.10 and earlier, you will overflow the C stack and crash the process, even if you never call any C functions

uneven raptor Aug 23, 2024, 12:58 AM

#

raven ridge in practice, if you never call C functions and only call Python functions, you c...

i know, i avoid recursion anyway. i was just wondering if it was something that cpython had a way to do

raven ridge Aug 23, 2024, 12:59 AM

#

it'd take hundreds of gigs of memory to be able to hold 2 billion call frames, though

#

the limit you'll hit, in practice, isn't the recursion limit, it's the amount of memory on the system

uneven raptor Aug 23, 2024, 1:01 AM

#

with that being said, could one manually deallocate frames that you know you’ll never see again?

raven ridge Aug 23, 2024, 1:02 AM

#

no... you need to be able to return to those frames

uneven raptor Aug 23, 2024, 1:03 AM

#

oh well

raven ridge Aug 23, 2024, 1:05 AM

#

back of the napkin math, 2 billion call frames would take at least 176 GB of heap memory just for the _PyInterpreterFrame structs

uneven raptor Aug 23, 2024, 1:07 AM

#

don’t some psychos have like 256 gb ram these days

feral island Aug 23, 2024, 1:07 AM

#

sure you can get a machine with tons of memory and get a little further

#

doesn't make any real difference to the answer

uneven raptor Aug 23, 2024, 1:08 AM

#

yeah, fair. it's a fun exercise though!

raven ridge Aug 23, 2024, 1:08 AM

#

the interesting fact here is that there's no longer any reason why there must be a recursion limit because Python frames no longer take up space on the C stack

#

but, when a Python frame calls into a callable that's implemented in C, that adds an extra frame to the C stack.

uneven raptor Aug 23, 2024, 1:09 AM

#

i think it's helpful for debugging, a RecursionError is nicer than seeing a segmentation fault

raven ridge Aug 23, 2024, 1:09 AM

#

you won't get a segmentation fault from the code I shared above - try it

dusk comet Aug 23, 2024, 1:10 AM

#

raven ridge the interesting fact here is that there's no longer any reason why there must be...

it prevents errors in a friendly way
I would prefer to get recursion error after 1000 calls, instead of python consuming 50gb out of 16gb physical memory

uneven raptor Aug 23, 2024, 1:10 AM

#

oh, it just runs infinitely, and then linux kills the process eventually. i would prefer a segfault!

raven ridge Aug 23, 2024, 1:11 AM

#

linux kills the process because your machine runs out of memory

uneven raptor Aug 23, 2024, 1:11 AM

#

right, that's not exactly nice for debugging

dusk comet Aug 23, 2024, 1:11 AM

#

windows just slows down a lot, because CPU is busy compressing/decompressing memory to/from swap

raven ridge Aug 23, 2024, 1:12 AM

#

it'll eventually die even on Windows. You don't have unlimited swap

feral island Aug 23, 2024, 1:12 AM

#

yes, I think that's why we didn't just remove the recursion limit in 3.11

#

a quick(ish) RecursionError is much better for users than eating all your memory

raven ridge Aug 23, 2024, 1:12 AM

#

that, and the fact that you can still overflow the C stack when calling stuff implemented in C

#

if the recursion limit were removed entirely, whether or not you get stack overflows would depend on whether or not the stuff you're calling is implemented in Python. You'd need to know implementation details of a lot of stuff in order to reason about your program's correctness

uneven raptor Aug 23, 2024, 1:14 AM

#

raven ridge You can try: ```py import sys sys.setrecursionlimit(2**31-1) def a(): a() ...

it gives me a pretty segfault on 3.9, FWIW

raven ridge Aug 23, 2024, 1:15 AM

#

yep - any version before 3.11 will overflow the stack and segfault, any version from 3.11 on won't

dusk comet Aug 23, 2024, 1:15 AM

#

raven ridge that, and the fact that you can still overflow the C stack when calling stuff im...

def foo(x):
  sorted([...], key=foo)
foo(...)

(foo calls sorted, sorted calls foo, and so on...)
will C stack will be growing with each recursion level?

am I understanding it correctly that this will not grow C stack: def foo(): foo() ?

uneven raptor Aug 23, 2024, 1:16 AM

#

why isn't faulthandler enabled by default? it doesn't affect runtime performance

raven ridge Aug 23, 2024, 1:16 AM

#

dusk comet ```py def foo(x): sorted([...], key=foo) foo(...) ``` (foo calls sorted, sorte...

yes, and yes

#

technically, this also depends on whether a custom frame evaluation function has been installed. If so, even calls from a Python function into a Python function take up extra call frames on the C stack.

#

I'm not aware of anything interesting that actually makes use of custom frame eval functions, but the API for them exists, and if something were to use them, it would disable the optimization that allows Python functions to call into Python functions without consuming stack space (yet another good reason for keeping the recursion limit)

uneven raptor Aug 23, 2024, 1:31 AM

#

i didn’t even know that existed

grave jolt Aug 23, 2024, 1:35 AM

#

uneven raptor don’t some psychos have like 256 gb ram these days

If you want to test stuff... AWS offers instances with up to 32TB of RAM

uneven raptor Aug 23, 2024, 1:36 AM

#

that sounds cheap!

grave jolt Aug 23, 2024, 1:40 AM

#

uneven raptor that sounds cheap!

$407.68 per hour apparently... though it does include almost 900 CPU cores, they don't offer 1 CPU with 32 TB of RAM

#

the 3TB one is more reasonable at $20/h

raven ridge Aug 23, 2024, 1:42 AM

#

if someone wants to spend $20, I'd be curious whether 2 billion Python frames takes more or less than 3 TB of memory 😄

grave jolt Aug 23, 2024, 1:42 AM

#

If you can figure it out in 15 minutes, it's only $5 🙂

uneven raptor Aug 23, 2024, 2:00 AM

#

would be an interesting stress test for cpython

uneven raptor Aug 23, 2024, 2:02 AM

#

grave jolt $407.68 per hour apparently... though it does include almost 900 CPU cores, they...

1 cpu with 32 TB would be somewhat comical

jade raven Aug 23, 2024, 3:19 AM

#

grave jolt $407.68 per hour apparently... though it does include almost 900 CPU cores, they...

Spot instances are a lot less, but it’s a lot harder to get access to them (the more powerful ones anyways)

quick snow Aug 23, 2024, 10:22 AM

#

raven ridge I'm not aware of anything interesting that actually makes use of custom frame ev...

Will it not also be disabled when a trace/profile function is installed?

dusk comet Aug 23, 2024, 3:28 PM

#

uneven raptor 1 cpu with 32 TB would be somewhat comical

quick Google search suggests that typical ram bandwidth is not much more than 50 GB/s
32 TB / 50 GB/s = 640 s = 10.7 minutes just to write these frames to memory

pearl river Aug 23, 2024, 5:43 PM

#

that's if the 32TB were all in one stick, which is impossible - i think in theory you need to also divide by the number of channels. but of course they can't even be on one motherboard, so who knows how many channels are there.

raven ridge Aug 23, 2024, 5:54 PM

#

I don't think it matters whether they're all on one stick or not... The writes would all have to be serial rather than parallel, regardless, just by nature of this being a stack

pearl river Aug 23, 2024, 5:56 PM

#

ah, that's true

reef night Aug 24, 2024, 3:38 PM

#

Guys im making a database but i forgot what did the cursor do

winged sphinx Aug 24, 2024, 3:39 PM

#

reef night Guys im making a database but i forgot what did the cursor do

Ask in #databases

reef night Aug 24, 2024, 3:40 PM

#

Oo thankuu

crude anvil Aug 25, 2024, 5:28 PM

#

Question: AFAIK you can't change methods of builin classes such as str, and I want to do just that (specifically I want to print every declared string when running an arbitrary program), so am thinking of rebuilding the python library with that change and use that executable instead, how can proceed to doing that. or is there a better solution that will allow me to change built-in methods in python?

halcyon trail Aug 25, 2024, 6:36 PM

#

Is that really your only option? Did you consider creating a type that inherits from the str-like type designed for user inheritance, and changing the functionality there?

pearl river Aug 25, 2024, 7:19 PM

#

You might be able to do it with fishhook or a similar library, but first consider how much of a nightmare it would be to print something every time a string is created anywhere in the interpreter. For one, you will likely hit the problem that printing something itself requires making a string.

spark magnet Aug 25, 2024, 7:30 PM

#

crude anvil Question: AFAIK you can't change methods of builin classes such as str, and I wa...

can you say more about "print every declared string"? Tell us the larger problem, and how printing the strings will help.

pliant tusk Aug 25, 2024, 8:35 PM

#

crude anvil Question: AFAIK you can't change methods of builin classes such as str, and I wa...

!e ```py
from fishhook.asm import get_interned_strings_dict
from fishhook import hook, orig
import sys

interned = get_interned_strings_dict()

oldnames = interned.copy()
def audithook(*args):
for key in interned:
if key not in oldnames:
print('[audithook] new string:', key)
oldnames[key] = key

for method in ['add', 'mul', 'getitem']:
@hook(str, name=method)
def strhook(self, *args, method=method):
ret = orig(self, *args)
if type(ret) is str and ret not in oldnames:
oldnames[ret] = ret
print(f'[str.{method}] new string', ret)
return ret

sys.addaudithook(audithook)

eval("'newname'")

fallen slateBOT Aug 25, 2024, 8:35 PM

#

pliant tusk !e ```py from fishhook.asm import get_interned_strings_dict from fishhook import...

:white_check_mark: Your 3.12 eval job has completed with return code 0.

[audithook] new string: newname

crude anvil Aug 25, 2024, 10:20 PM

#

pliant tusk !e ```py from fishhook.asm import get_interned_strings_dict from fishhook import...

This doesnt print declared strings normally (not in eval), so it might not work for my case, and ofc I don't quiet understand what's that hooking technique/library u used, so I might be wrong.

pliant tusk Aug 25, 2024, 10:21 PM

#

if strings are declared in the same script then they are generated and stored before the script actually runs

#

if you want it to work for declared strings then you need to import your code after the hooks have been added

crude anvil Aug 25, 2024, 10:22 PM

#

spark magnet can you say more about "print every declared string"? Tell us the larger proble...

This idea camed to me when doing some reverse eng, and for -let's say- obfuscated python code, this would come helpful in debugging it.

pliant tusk Aug 25, 2024, 10:26 PM

#

crude anvil This doesnt print declared strings normally (not in eval), so it might not work ...

audithooks are builtin to python now, they get called by certain C functions as it evaluates code.

crude anvil Aug 25, 2024, 10:26 PM

#

pliant tusk if you want it to work for declared strings then you need to import your code af...

Got u!

pliant tusk Aug 25, 2024, 10:26 PM

#

!d sys.addaudithook

fallen slateBOT Aug 25, 2024, 10:26 PM

#

sys.addaudithook


sys.addaudithook(hook)```
Append the callable *hook* to the list of active auditing hooks for the current (sub)interpreter.

When an auditing event is raised through the [`sys.audit()`](https://docs.python.org/3/library/sys.html#sys.audit) function, each hook will be called in the order it was added with the event name and the tuple of arguments. Native hooks added by [`PySys_AddAuditHook()`](https://docs.python.org/3/c-api/sys.html#c.PySys_AddAuditHook) are called first, followed by hooks added in the current (sub)interpreter. Hooks can then log the event, raise an exception to abort the operation, or terminate the process entirely.

pliant tusk Aug 25, 2024, 10:27 PM

#

and fishhook is a library i wrote that allows for hooking C class methods (eg: str.__getitem__) and hooking raw C functions on supported platforms

crude anvil Aug 25, 2024, 10:33 PM

#

pliant tusk if strings are declared in the same script then they are generated and stored be...

can these too be printed using another way (like the idea that i suggested before)?

pliant tusk Aug 25, 2024, 10:40 PM

#

You can print all declared strings that made it into the interned strings dictionary by making oldnames initialized to an empty dict

#

You can also work with the interned strings dict directly using the function exposed by fishhook.asm

crude anvil Aug 25, 2024, 10:42 PM

#

pliant tusk You can print all declared strings that made it into the interned strings dictio...

can you elaborate -am not that familiar with python internals-?

pliant tusk Aug 25, 2024, 10:43 PM

#

Python holds references to strings that are considered interned, and will reuse those references

#

Those references are stored in a dictionary that is normally not accessible

#

Fishhook.asm has an example that grabs the interned strings

crude anvil Aug 25, 2024, 10:46 PM

#

pliant tusk Fishhook.asm has an example that grabs the interned strings

so is the strings that are declared after in the same script included in that dictionary too?

from fishhook.asm import get_interned_strings_dict
from fishhook import hook, orig
import sys

interned = get_interned_strings_dict()

oldnames = interned.copy()
def audithook(*args):
    for key in interned:
        if key not in oldnames:
            print('[audithook] new string:', key)
            oldnames[key] = key

for method in ['__add__', '__mul__', '__getitem__']:
    @hook(str, name=method)
    def strhook(self, *args, method=method):
        ret = orig(self, *args)
        if type(ret) is str and ret not in oldnames:
            oldnames[ret] = ret
            print(f'[str.{method}] new string', ret)
        return ret

sys.addaudithook(audithook)

"newname" # <---- this one ?

pliant tusk Aug 25, 2024, 10:50 PM

#

Yes in most cases

spark magnet Aug 25, 2024, 10:51 PM

#

@crude anvil can you say more about what you mean by "declared" strings? String literals? Or the computed values of strings at runtime?

#

in particular, Python doesn't have variable declarations as such.

crude anvil Aug 25, 2024, 10:53 PM

#

spark magnet <@883436543182778458> can you say more about what you mean by "declared" strings...

string literals and computed values of strings at runtime.

crude anvil Aug 25, 2024, 10:53 PM

#

pliant tusk Python holds references to strings that are considered interned, and will reuse ...

I think @pliant tusk answer is comprehensive for now

spark magnet Aug 25, 2024, 10:54 PM

#

crude anvil string literals **and** computed values of strings at runtime.

another possibility if you need it is to observe Python execution with trace functions or sys.monitoring.

pliant tusk Aug 25, 2024, 10:54 PM

#

I feel like for debugging/reversing at that level you would be better suited with a c level debugger

#

If it's really so obfuscated

pliant tusk Aug 25, 2024, 10:55 PM

#

spark magnet another possibility if you need it is to observe Python execution with trace fun...

Oh sys.monitoring is a good idea, haven't done much with it yet since it was added

crude anvil Aug 25, 2024, 10:57 PM

#

spark magnet another possibility if you need it is to observe Python execution with trace fun...

cool, I'll check that out!

spark magnet Aug 25, 2024, 11:01 PM

#

pliant tusk Oh sys.monitoring is a good idea, haven't done much with it yet since it was add...

it's changing as we speak also: https://github.com/python/cpython/pull/122564

#

(because after 18 months of telling him, mark finally listened)

pliant tusk Aug 25, 2024, 11:04 PM

#

Oh that co_branches method for code objects will simplify my bytecode decompiler/recompiler

#

Since right now I walk the bytecode to find branches

spark magnet Aug 25, 2024, 11:04 PM

#

pliant tusk Since right now I walk the bytecode to find branches

i walk the ast

pliant tusk Aug 25, 2024, 11:04 PM

#

Makes sense, better than walking the bytecode

spark magnet Aug 25, 2024, 11:05 PM

#

i started by looking at bytecode, which is how I realized that bytecode can be very complicated

pliant tusk Aug 25, 2024, 11:06 PM

#

Yea it's tricky, but my code is intended to run after the functions have been compiled so I don't have ast at that point

#

So it'll be nice to take advantage of a method that gets the info I need before the source is discarded

spark magnet Aug 25, 2024, 11:06 PM

#

thanks for mentioning co_branches, i need to see how to make use of that. maybe I can scrap all the ast code.

rose schooner Aug 25, 2024, 11:07 PM

#

pliant tusk Oh that co_branches method for code objects will simplify my bytecode decompiler...

woa

#

that's nice

spark magnet Aug 25, 2024, 11:08 PM

#

overall, it's been a lot of work to adapt coverage.py to sys.monitoring, and it's not done yet.

#

i'm right now running tests on a +621 -1000 branch

pliant tusk Aug 25, 2024, 11:10 PM

#

Seems like it'll simply it once sys.monitoring is fully implemented

spark magnet Aug 25, 2024, 11:11 PM

#

"simplify": yes

pliant tusk Aug 25, 2024, 11:11 PM

#

Definitely a lot to change tho

spark magnet Aug 25, 2024, 11:11 PM

#

but it will be a few years before 3.14 is the minimum python version

uneven raptor Aug 25, 2024, 11:16 PM

#

the five year EOL is a killer for libraries

spark magnet Aug 25, 2024, 11:17 PM

#

uneven raptor the five year EOL is a killer for libraries

if it was really a problem, i could just bump the minimum version. People on 3.8, 3.9 would continue to use the coverage.py version that shipped on them.

uneven raptor Aug 25, 2024, 11:18 PM

#

yup. it's just a little unfortunate that libraries have to wait so long to get new features

spark magnet Aug 25, 2024, 11:19 PM

#

in this case, the benefit is to the user of the library (low-overhead coverage measurement)

winged sphinx Aug 26, 2024, 11:36 AM

#

Wrong channel. Use #career-advice

inland verge Aug 26, 2024, 11:37 AM

#

winged sphinx Wrong channel. Use <#470889390588035082>

sorry

thick hemlock Aug 26, 2024, 10:55 PM

#

spark magnet in this case, the benefit is to the user of the library (low-overhead coverage m...

got me thinking about running coverage.py in production😆

raven ridge Aug 27, 2024, 4:24 AM

#

please delete this. We don't allow soliciting paid work here.

#

!rule paid

fallen slateBOT Aug 27, 2024, 4:24 AM

#

Rules

9. Do not offer or ask for paid work of any kind.

faint river Aug 27, 2024, 4:24 AM

#

raven ridge please delete this. We don't allow soliciting paid work here.

they spammed it in other channels too

little robin Aug 27, 2024, 6:02 AM

#

How to juyprer notebook

hollow spear Aug 27, 2024, 6:04 AM

#

little robin How to juyprer notebook

https://www.youtube.com/watch?v=HW29067qVWk

YouTube

Corey Schafer

Jupyter Notebook Tutorial: Introduction, Setup, and Walkthrough

In this Python Tutorial, we will be learning how to install, setup, and use Jupyter Notebooks. Jupyter Notebooks have become very popular in the last few years, and for good reason. They allow you to create and share documents that contain live code, equations, visualizations and markdown text. This can all be run from directly in the browser. I...

▶ Play video

little robin Aug 27, 2024, 6:05 AM

#

hollow spear https://www.youtube.com/watch?v=HW29067qVWk

Thank you

floral pivot Aug 27, 2024, 5:44 PM

#

PEP 318 quotes Guido as such:

[…] – with no new syntax, the magicness of a function like this is extremely high:

Using functions with “action-at-a-distance” through sys.settraceback may be okay for an obscure feature […] The widely held view here is that decorators need to be added as a syntactic feature to avoid the problems with the postfix notation used in 2.2 and 2.3. […]

What is “the postfix notation used in 2.2 and 2.3”?

sour thistle Aug 27, 2024, 5:55 PM

#

I'm guessing that they mean func = decorator(func)?

also side note: idk what you're reading a PEP from the early 2000s for, but just keep in mind that old PEPs effecitvely are historical documents and not always representative of how the language works nowadays

grave jolt Aug 27, 2024, 5:56 PM

#

floral pivot [PEP 318](<https://peps.python.org/pep-0318/>) quotes Guido as such: > […] – wit...

According to https://mail.python.org/pipermail/python-dev/2004-September/048518.html

I personally feel that prefix decorators are a huge improvement over the "f = staticmethod(f)" style of decorating.
I think etrotta might be correct

#

In other news, it's kinda shocking how much stuff was added in each Python release back then.
Python 2.2 added

new-style classes
multiple inheritance
descriptors
iterators
generators (as an experimental feature)
then Python 2.3 added
generators (stabilized)
set
logging, csv
bool (yeah)
import hooks

feral island Aug 27, 2024, 6:04 PM

#

grave jolt In other news, it's kinda shocking how much stuff was added in each Python relea...

wasn't bool (infamously) added in 2.1.1?

#

but yeah, new stuff was being added at a much higher rate then

grave jolt Aug 27, 2024, 6:05 PM

#

feral island wasn't bool (infamously) added in 2.1.1?

https://docs.python.org/3/whatsnew/2.3.html#pep-285-a-boolean-type

feral island Aug 27, 2024, 6:05 PM

#

oh I see, they added True/False globals in 2.2.1 but not the bool typ

uneven raptor Aug 27, 2024, 6:46 PM

#

what’s the difference internally between PyObject_Malloc and PyMem_Malloc? the docs don’t specify

raven ridge Aug 27, 2024, 6:49 PM

#

uneven raptor what’s the difference internally between `PyObject_Malloc` and `PyMem_Malloc`? t...

remember last week when I argued that the C API docs aren't great and you asked why I thought so? 😛

uneven raptor Aug 27, 2024, 6:49 PM

#

yes, you win 🙄

feral island Aug 27, 2024, 6:50 PM

#

uneven raptor what’s the difference internally between `PyObject_Malloc` and `PyMem_Malloc`? t...

I think this is about the three "domains" in https://docs.python.org/3/c-api/memory.html#allocator-domains ?

Python documentation

Memory Management

Overview: Memory management in Python involves a private heap containing all Python objects and data structures. The management of this private heap is ensured internally by the Python memory manag...

uneven raptor Aug 27, 2024, 6:51 PM

#

i saw that, but both object and mem say they “take from the python private heap.” are there two of them?

raven ridge Aug 27, 2024, 6:51 PM

#

no, both of them allocate using pymalloc

uneven raptor Aug 27, 2024, 6:52 PM

#

so they’re just “optimized differently”?

feral island Aug 27, 2024, 6:55 PM

#

if I'm reading the code right they point to the same underlying allocator function by default

raven ridge Aug 27, 2024, 6:56 PM

#

I think the answer is that the only reason both exist is to provide 2 different customization points, in case someone wants to use a different allocator for Python objects than for other stuff

feral island Aug 27, 2024, 6:56 PM

#

yes, I think you can use https://docs.python.org/3/c-api/memory.html#c.PyMem_SetAllocator to make one or the other behave differently

Python documentation

Memory Management

Overview: Memory management in Python involves a private heap containing all Python objects and data structures. The management of this private heap is ensured internally by the Python memory manag...

raven ridge Aug 27, 2024, 6:57 PM

#

yeah

#

in principle, the reason you might want to use a different allocator for objects than for not-objects is that objects tend to be small and very consistently sized, while other stuff can be much larger and have much less predictable sizes

feral island Aug 27, 2024, 7:00 PM

#

I wonder how significant the overhead is from allocations through a function pointer; Python tends to allocate small objects at a pretty high rate

#

I guess that's part of why we have freelists for some types

raven ridge Aug 27, 2024, 7:00 PM

#

raven ridge in principle, the reason you might want to use a different allocator for objects...

but pymalloc handles that reasonably well all on its own: it checks the size of the allocation up front, and then delegates to malloc (or, I guess to PyMem_RawMalloc) for large allocations

rare shale Aug 27, 2024, 7:01 PM

#

hello

raven ridge Aug 27, 2024, 7:01 PM

#

feral island I wonder how significant the overhead is from allocations through a function poi...

you mean the cost of the indirection itself? I'd guess that's far, far lower than the cost of the allocator - not free, but...

feral island Aug 27, 2024, 7:02 PM

#

yes, it's a few extra CPU instructions before you get to the actual allocator

raven ridge Aug 27, 2024, 7:02 PM

#

my understanding is that the freelists are mostly useful because they avoid calling the allocator, and they avoid needing to run part of the tp_new for some types

#

it can be cheaper to re-initialize an object retrieved from a freelist than to initialize a chunk of uninitialized memory

uneven raptor Aug 27, 2024, 7:06 PM

#

raven ridge I _think_ the answer is that the only reason both exist is to provide 2 differen...

i think the docs should clarify that 😄

raven ridge Aug 27, 2024, 7:07 PM

#

I agree 🙂

final geode Aug 27, 2024, 7:16 PM

#

raven ridge I _think_ the answer is that the only reason both exist is to provide 2 differen...

There’s actually a stronger (and super significant) difference on the new free-threaded builds: the “object” domain must be used for all objects… and only for objects: https://docs.python.org/3.13/howto/free-threading-extensions.html#memory-allocation-apis

Python documentation

C API Extension Support for Free Threading

Starting with the 3.13 release, CPython has experimental support for running with the global interpreter lock(GIL) disabled in a configuration called free threading. This document describes how to ...

#internals-and-peps

main.py

dist.py

main.py

dist.py