spark magnet Aug 27, 2023, 5:01 PM

#

It might be that it needs new syntax to handle def f(**kwargs): properly, so we can use def f(***all_args):

dusk comet Aug 27, 2023, 5:02 PM

#

args = {'a': 1, 0: 98, 1: 99, 'b': 2}
def f(*args, **kwargs): return args, kwargs
``` if you pass `**args` to this function, positional args will be pulled out into `*args`, they will not be in `**kwargs`

spark magnet Aug 27, 2023, 5:03 PM

#

right, so we might need triple-star

raven ridge Aug 27, 2023, 5:03 PM

#

hm, yeah - my initial reaction is that it ought to raise TypeError when unpacking a dict containing positional args for a call to a function that doesn't accept positional args, but you're right that this would mean that wrapper functions would need to keep accepting *args.

#

maybe triple star is a good workaround for that... And reasonably intuitive, I guess

#

though if we had a triple star, I think we ought to have it both at the call site and the parameter list

spark magnet Aug 27, 2023, 5:04 PM

#

raven ridge though if we had a triple star, I think we ought to have it both at the call sit...

definitely

faint river Aug 27, 2023, 5:06 PM

#

***bargs = both args and kwargs lemon_smile

#

~~oh god imagine saying star star star bargs outloud in a class~~

raven ridge Aug 27, 2023, 5:09 PM

#

this seems like a reasonably good idea to me. I've wished for something like this occasionally. You can usually just pass all parameters by keyword, which isn't so awkward - but that doesn't work if the function has positional-only arguments, and then you are stuck building up both a sequence and a mapping for the positional vs keyword arguments

#

actually, though - if that's the only major use case for this - using one mapping to provide all the arguments to a function that uses positional-only arguments - maybe we don't need new syntax at all. Maybe we just need something new in functools

faint river Aug 27, 2023, 5:11 PM

#

raven ridge though if we had a triple star, I think we ought to have it both at the call sit...

def f(***bargs):
  print(bargs)
f("a", 123, flag=True, other_flag=False)

{0: "a", 1: 123, "flag": True, "other_flag": False}

?

raven ridge Aug 27, 2023, 5:11 PM

#

yeah, that's what I'd expect

faint river Aug 27, 2023, 5:13 PM

#

can't wait to see this in python 3.17 lemon_fingerguns

spark magnet Aug 27, 2023, 5:14 PM

#

faint river can't wait to see this in python 3.17 <:lemon_fingerguns:754441879990435944>

let me know when the pull request is ready to review 😄

faint river Aug 27, 2023, 5:19 PM

#

def f(a): ...
f(***{"a": "abc", 0: "def"})

what happens here?

raven ridge Aug 27, 2023, 5:20 PM

#

an error, just like if you did f(*["def"], **{"a": "abc"})

spark magnet Aug 27, 2023, 5:21 PM

#

i like that @raven ridge is writing the PEP for me!

faint river Aug 27, 2023, 5:21 PM

#

def f(a, **kwargs): ...
f(***{0: "def", "a": "abc"})

how about this?

#

would you get "a" in kwargs?

spark magnet Aug 27, 2023, 5:22 PM

#

faint river ```py def f(a, **kwargs): ... f(***{0: "def", "a": "abc"}) ``` how about this?

a = "def", kwargs = {"a": "abc"}

faint river Aug 27, 2023, 5:22 PM

#

would think as such

#

now for the kicker

#

def f(a, **kwargs): ...
f(***{"a": "abc", 0: "def"})

#

(order is swapped)

spark magnet Aug 27, 2023, 5:23 PM

#

faint river ```py def f(a, **kwargs): ... f(***{"a": "abc", 0: "def"}) ```

that's the same result

faint river Aug 27, 2023, 5:23 PM

#

so first you must filter out all int keys and sort them I guess

spark magnet Aug 27, 2023, 5:24 PM

#

yes

#

and decide what this means: f(***{99: "a"})

#

(a typeerror)

raven ridge Aug 27, 2023, 5:24 PM

#

Maybe something in functools like ```py
def apply_mixed_args(func, mapping):
pos = {}
kwargs = {}
for key, val in mapping.items():
if isinstance(key, int):
pos[key] = val
else:
kwargs[key] = val

args = []
try:
    for i in range(len(pos)):
        args.append(pos[i])
except KeyError:
    raise TypeError("Arguments to pass positionally must be contiguous integers beginning with 0")

return func(*args, **kwargs)

#

I've wanted this rarely enough that solving it with a new stdlib function seems better to me than solving it with new syntax

faint river Aug 27, 2023, 5:26 PM

#

it does seem like a very niche problem for new syntax yeah. ~~matmul operator is cowering~~

raven ridge Aug 27, 2023, 5:27 PM

#

naming that function might be the toughest part, heh

faint river Aug 27, 2023, 5:28 PM

#

apply_bargs lemon_fingerguns_shades

spark magnet Aug 27, 2023, 5:28 PM

#

barge_into_function

raven ridge Aug 27, 2023, 5:29 PM

#

heh

faint river Aug 27, 2023, 5:29 PM

#

apply_neds_bats

feral cedar Aug 27, 2023, 5:29 PM

#

kool_aid

raven ridge Aug 27, 2023, 5:33 PM

#

the idea of a new stdlib function sidesteps a lot of the issues and ambiguities we mentioned above, too. There's no question of what happens if you do ```py
def foo(**kwargs):
pass

apply_mixed_args(foo, {0: 42})
``` It's an error, because you pass positional args to a function that doesn't take them. There's no question of what happens if you do foo(*args, **kwargs, ***bargs) in one function call, because - well, you can't.

#

and I suspect that selling people on triple star would be considerably harder than selling them on an enhanced double star, but you're right - the enhanced double star couldn't be transparently proxied, so callables that currently accept *args and **kwargs would need to keep doing so in the future even if f(**{0: 42}) could be used at the call site.

#

there's another nasty case that happens if you allow two-star f(**{0: 42}) actually: the implementation would need to detect f(**{0: 42, 1: 43}, **{0: 10}) and raise a TypeError for that as well

#

I guess my opinion is that triple star is a bad idea (too magical for too niche a feature, especially for something that's possible today and just inconvenient). I think a new stdlib function would be enough, but if we did any new syntax, I don't think it should be more than just allowing integer keys in mappings unpacked with ** in a function call, without changing the behavior for **kwargs parameters in a function at all - they'd still only receive the keyword arguments, and you'd still need to use *args to receive positional arguments.

dusk comet Aug 27, 2023, 5:50 PM

#

spark magnet right, so we might need triple-star

yeah, i was literally thinking about that 🙂

dusk comet Aug 27, 2023, 6:08 PM

#

raven ridge an error, just like if you did `f(*["def"], **{"a": "abc"})`

even simpler example: ```py

def f(a): ...
...

f(0, a=0)
╭─── Traceback (most recent call last) ────╮
│ in <module> │
│ ╭─ locals ─╮ │
│ │ f = f │ │
│ ╰──────────╯ │
╰──────────────────────────────────────────╯
TypeError: f() got multiple values for
argument 'a'

#

def _(a, b=0, /, c=1, *args, *, d=2, **e, ***f): ...

syntax hell

charred wagon Aug 27, 2023, 6:16 PM

#

Where is os.truncate/os.ftruncate implemented for linux? I traced it to https://github.com/python/cpython/blob/042aa88bcc6541cb8b312f1119452f7a58a5b4df/Modules/clinic/posixmodule.c.h#L8067 but now I'm lost

fallen slateBOT Aug 27, 2023, 6:16 PM

#

Modules/clinic/posixmodule.c.h line 8067

os_ftruncate_impl(PyObject *module, int fd, Py_off_t length);```

raven ridge Aug 27, 2023, 6:19 PM

#

charred wagon Where is `os.truncate`/`os.ftruncate` implemented for linux? I traced it to http...

https://github.com/python/cpython/blob/042aa88bcc6541cb8b312f1119452f7a58a5b4df/Modules/posixmodule.c#L11784

fallen slateBOT Aug 27, 2023, 6:19 PM

#

Modules/posixmodule.c line 11784

os_ftruncate_impl(PyObject *module, int fd, Py_off_t length)```

charred wagon Aug 27, 2023, 6:19 PM

#

Thanks. GitHub search did not pick that up for some reason

#

I wonder why its docs say

Truncate the file corresponding to file descriptor fd, so that it is at most length bytes in size

When the linux manual says

The truncate() and ftruncate() functions cause the regular file
named by path or referenced by fd to be truncated to a size of
precisely length bytes.

My guess is the Python docs are trying to be more general, and for some platforms they cannot make the stronger guarantee of "precisely length bytes". Python doesn't seem to do anything special with the length when it calls ftruncate.

#

Or maybe I am misinterpreting the way it's worded

raven ridge Aug 27, 2023, 6:30 PM

#

POSIX says:

If fildes refers to a regular file, the ftruncate() function shall cause the size of the file to be truncated to length. If the size of the file previously exceeded length, the extra data shall no longer be available to reads on the file. If the file previously was smaller than this size, ftruncate() shall increase the size of the file. If the file size is increased, the extended area shall appear as if it were zero-filled. The value of the seek pointer shall not be modified by a call to ftruncate().

#

Old versions instead said:

If the file previously was smaller than this size, ftruncate() shall either increase the size of the file or fail. [XSI] [Option Start] XSI-conformant systems shall increase the size of the file. [Option End]
but even that doesn't allow for ftruncate to succeed without setting the size of the file to exactly the given length. So... 🤷‍♂️

charred wagon Aug 27, 2023, 6:36 PM

#

Thanks. Yeah, that is confusing. I will just trust the linux manual on this.

#

It's the behaviour I observed in practical tests anyway

grave jolt Aug 27, 2023, 9:32 PM

#

the opposite of cringe

spark magnet Aug 27, 2023, 9:44 PM

#

grave jolt the opposite of cringe

so it's a good thing?

grave jolt Aug 27, 2023, 9:44 PM

#

yes!

spark magnet Aug 27, 2023, 9:49 PM

#

it always strikes me as the opposite. I'll have to try to remember 🙂

sand goblet Aug 28, 2023, 12:54 AM

#

https://bugs.python.org/issue21644
here people are talking about using calloc in bytearray.__init__

#

They mention it makes initialization faster. but it looks like it makes it MUCH faster, at least on my Windows 10.

#

static int
bytearray___init___impl(PyByteArrayObject *self, PyObject *arg,
                        const char *encoding, const char *errors)
/*[clinic end generated code: output=4ce1304649c2f8b3 input=1141a7122eefd7b9]*/
{
    void *sval;
    Py_ssize_t count;
    PyObject *it;
    PyObject *(*iternext)(PyObject *);
    
    //
    // existing code
    //
    
    /* Is it an int? */
    if (_PyIndex_Check(arg)) {
        count = PyNumber_AsSsize_t(arg, PyExc_OverflowError);
        if (count == -1 && PyErr_Occurred()) {
            if (!PyErr_ExceptionMatches(PyExc_TypeError))
                return -1;
            PyErr_Clear();  /* fall through */
        }
        else {
            if (count < 0) {
                PyErr_SetString(PyExc_ValueError, "negative count");
                return -1;
            }
            if (count > 0) {
                if (self->ob_alloc == 0) { // new bytearray
                    if (!_canresize(self))
                        return -1;
                    // remember to avoid overflow by using size_t. see issue #22335.
                    sval = PyObject_Calloc((size_t)count + 1, 1); // + 1 for null terminator
                    if (sval == NULL) {
                        PyErr_NoMemory();
                        return -1;
                    }
                    self->ob_bytes = self->ob_start = sval;
                    Py_SET_SIZE(self, count);
                    self->ob_alloc = (size_t)count + 1;
                    return 0;
                }
                if (PyByteArray_Resize((PyObject *)self, count))
                    return -1;
                memset(PyByteArray_AS_STRING(self), 0, count);
            }
            return 0;
        }
    }

#

here's how I timed this change:

#

from timeit import timeit

setup = """
def f(n):
    b = bytearray(n)
    return b
"""

for n in range(12):
    print(timeit(stmt=f"f({n**10})", setup=setup, number=1000))

#

and here are the timing results:

times using calloc:
0.00034580007195472717
0.00039679999463260174
0.0008060999680310488
0.0025179999647662044
0.011856100056320429
0.011817399994470179
0.013045100029557943
0.016800999990664423
0.03710949991364032
0.04992749996017665
0.1916151000186801
0.5652574999257922

times without using calloc:
0.00024830002803355455
0.00025889999233186245
0.000635799951851368
0.0014845000114291906
0.2839431999018416
2.6696265999926254
15.101725699962117
74.59119629999623
I gave up here

#

someone should update the __init__ method so it does uses calloc.

#

They mention a bug with the other person's change not detecting an existing memoryview, but that's solved easily by just checking for it using _canresize

rose schooner Aug 28, 2023, 1:37 AM

#

sand goblet someone should update the `__init__` method so it does uses calloc.

i think i had this done once in a PR

#

maybe just a local change

rose schooner Aug 28, 2023, 1:42 AM

#

rose schooner maybe just a local change

yeah just a local change

#

although uh

sand goblet Aug 28, 2023, 1:44 AM

#

Maybe it's because it's a little faster with smaller sizes

rose schooner Aug 28, 2023, 1:44 AM

#

sand goblet and here are the timing results: ``` times using calloc: 0.00034580007195472717 ...

this seems to be only worth it when the size allocated exceeds 1 MB

sand goblet Aug 28, 2023, 1:44 AM

#

Yeah

rose schooner Aug 28, 2023, 1:44 AM

#

and there's not many cases where someone needs to allocate 1 MB with bytearray()... right?

sand goblet Aug 28, 2023, 1:45 AM

#

Are there a lot of cases where they need to allocate a large number of small bytearray?

#

Either way, allocating a large one is still a one-time thing, so I guess it shouldn't matter either way

dusk comet Aug 28, 2023, 12:20 PM

#

static hinge Aug 28, 2023, 4:58 PM

#

do/while will always run at least once because the condition is after the block.

#

it's kind of like shutes and ladders

misty oxide Aug 28, 2023, 5:29 PM

#

I'm writing a Bytecode -> Bytecode transform, and trying to support nonlocal variables.

My idea was to replace all freevar LOAD_DEREF instructions with a chain of instructions that does fn.__closure__[instr.arg - len(code.co_cellvars)].cell_contents, where fn is the python object of the currently executing function, and code is the code object being translated. Then, I turn all cellvars into local variables.

The replacing is actually sound. It's what the cpython does under the hood, so I'm just replacing one instruction with many.

What I'm worried about is what happens if I decide to empty the list of cellvars and freevars. Am I allowed to do that? Will something in python function creation or execution go wrong if I don't leave a proper trail of freevars and cellvars?

feral island Aug 28, 2023, 5:36 PM

#

you probably need to make sure that the relevant fields on the code object match reality

misty oxide Aug 28, 2023, 5:37 PM

#

Probably

#

But, even if there are no STORE_DEREF/LOAD_DEREF instructions in the bytecode?

misty oxide Aug 28, 2023, 6:18 PM

#

A related follow-up question that I just thought of. In python 3.11+, is it okay if there's a nonlocal variable and a local variable with the same name?

feral island Aug 28, 2023, 6:20 PM

#

it isn't possible. if you generate your own bytecode and code objects you can probably make it work, but it will be fragile

misty oxide Aug 28, 2023, 6:22 PM

#

👍

tacit hawk Aug 28, 2023, 9:14 PM

#

Is there some C memcpy equivalent for Python's bytearrays?

buffer = bytearray(16)
data = b'1234'
buffer[:8] = data # len is now 16 - 4 = 12, it should be still 16

pliant tusk Aug 28, 2023, 9:20 PM

#

tacit hawk Is there some C memcpy equivalent for Python's bytearrays? ```py buffer = bytea...

that code is replacing the last 8 characters with data which is only 4 bytes, you need to adjust your slice

#

!e ```py
data = b'1234'

what you did:

buffer = bytearray(16)
buffer[:8] = data
print('replace first 8 with data:', buffer)

replace first 4

buffer = bytearray(16)
buffer[:4] = data
print('replace first 4 with data:', buffer)

replace last 4 of first half

buffer = bytearray(16)
buffer[4:8] = data
print('replace last 4 of first half with data:', buffer)```

fallen slateBOT Aug 28, 2023, 9:25 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | replace first 8 with data: bytearray(b'1234\x00\x00\x00\x00\x00\x00\x00\x00')
002 | replace first 4 with data: bytearray(b'1234\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00')
003 | replace last 4 of first half with data: bytearray(b'\x00\x00\x00\x001234\x00\x00\x00\x00\x00\x00\x00\x00')

swift imp Aug 28, 2023, 9:55 PM

#

Pep 727 proposal looks delicious

#

Opens up a lot of code de-duplication

steel solstice Aug 28, 2023, 9:57 PM

#

how so?

swift imp Aug 28, 2023, 9:57 PM

#

Like pandas has their doc decorators that transforms and appends docstrings between various methods and functions. Can instead define types that are annotated with a doc and bake it into the type annotation

#

Like these two https://github.com/pandas-dev/pandas/blob/ad0eb141fce3a77a08aead0bfffa888a13bbaedc/pandas/util/_decorators.py#L408

fallen slateBOT Aug 28, 2023, 9:58 PM

#

pandas/util/_decorators.py line 408

class Substitution:```

swift imp Aug 28, 2023, 9:59 PM

#

https://github.com/pandas-dev/pandas/blob/ad0eb141fce3a77a08aead0bfffa888a13bbaedc/pandas/util/_decorators.py#L455

fallen slateBOT Aug 28, 2023, 9:59 PM

#

pandas/util/_decorators.py line 455

class Appender:```

dusk comet Aug 28, 2023, 10:47 PM

#

Editor developers (VS Code and PyCharm) have shown some interest, while showing concerns about the verbosity of the proposal, although not about the implementation (which is what would affect them the most). And they have shown they would consider adding support for this if it were to become an official standard. In that case, they would only need to add support for rendering, as support for editing, which is normally non-existing for other standards, is already there, as they already support editing standard Python syntax.

What does it mean "support for rendering"? Editors already can render code

tacit hawk Aug 28, 2023, 10:48 PM

#

pliant tusk that code is replacing the last 8 characters with `data` which is only 4 bytes, ...

ah yes, so I need to align the length

buffer[:8] = data + b'\x00' * 4

I want to keep the buffer with the same length, but here this length alignment seem to be a waste

pliant tusk Aug 28, 2023, 10:54 PM

#

you can just write only the first 4 instead of the first 8 with the slice

pliant tusk Aug 28, 2023, 10:54 PM

#

tacit hawk ah yes, so I need to align the length ```py buffer[:8] = data + b'\x00' * 4 ``` ...

buffer[:4] = data

#

then you don't need to align it

feral island Aug 28, 2023, 10:55 PM

#

dusk comet > Editor developers (VS Code and PyCharm) have shown some interest, while showin...

rendering parameter documentation when showing a tooltip for a function

dusk comet Aug 28, 2023, 10:56 PM

#

Ah, ok

peak spoke Aug 28, 2023, 10:56 PM

#

I thought of the annotated docstring a while back and the editor support has been the main thing that came to mind, the code would look a bit too busy imo with something longer on the docstring and if the editor couldn't collapse it

#

though the doc call looks a bit weird when everything else you'd see in annotations uses brackets instead of parentheses

tacit hawk Aug 28, 2023, 10:59 PM

#

pliant tusk then you don't need to align it

ah yes you are right haha, thanks

ripe tinsel Aug 29, 2023, 7:24 AM

#

I have a feature proposal to improve python as a language: an asyncronous/multithreaded for loop.

It occurred to me that the majority of the "for item in list" loops in the code I have optimised are processes that can run independently. This appears to be the general case with for loops, but not with while loops. Therefore, it would be convenient for a user to have a build-in keyword like mfor (multi-threaded for) or afor (async for). Alternative syntax could be something like "for item in list.async()". Await() and join() functions will be necessary for these loops.

This will be convenient for developers and beginners alike, and should allow users to speed up loops in many instances with minimal code.

#

Also, if the compiler notices that only mathematical operations are happening inside a loop, you can have the compiler send it to the GPU is CUDA is available

flat gazelle Aug 29, 2023, 7:30 AM

#

Unfortunately, due to the GIL, this is mostly useless. Unless the for loop is doing IO, splitting it across threads will do just about nothing, even if each iteration is independent.

#

This is something best left to the .map methods on threadpools etc.

urban sandal Aug 29, 2023, 8:49 AM

#

ripe tinsel Also, if the compiler notices that only mathematical operations are happening in...

This particular case is better served by existing libraries like Numba and CuPy with explicit opt-in in hot spots (where initial costs can be paid at import rather than at observation by the interpreter.) initial warmup for code like this can cause unexpected performance losses if it were to happen in a short lived application.

feral cedar Aug 29, 2023, 11:16 AM

#

there's also Executor in concurrent.futures

dark umbra Aug 29, 2023, 11:32 AM

#

Anyon knows book Begaining python how is it?

willow torrent Aug 29, 2023, 1:55 PM

#

can I count on you?

feral island Aug 29, 2023, 2:11 PM

#

flat gazelle Unfortunately, due to the GIL, this is mostly useless. Unless the for loop is do...

there will not be a GIL in the near future

static hinge Aug 29, 2023, 2:27 PM

#

Guido has come for your GIL

feral island Aug 29, 2023, 2:31 PM

#

Sam Gross rather

static hinge Aug 29, 2023, 2:32 PM

#

I don't think we have a sticker of him

umbral plume Aug 29, 2023, 2:33 PM

#

a mfor keyword for multi-threaded for loops still sounds like syntactical sugar for sending stuff to be ran by a threadpool, which in turn means there'd have to be some trickery where the following indented block of code is secretly a function - it all sounds a little messy

static hinge Aug 29, 2023, 2:34 PM

#

it introduces a new scope

#

Didn't we just eliminate a scope for list comprehension?

#

one step forward, 2 steps back

feral island Aug 29, 2023, 2:35 PM

#

we also added one for type parameters 😄

#

and listcomps still have their own scope, it's mostly just an implementation change

static hinge Aug 29, 2023, 2:36 PM

#

I would suggest just giving all for loops their own scope, but that would absolutely break things

umbral plume Aug 29, 2023, 2:40 PM

#

if it had its own scope, with the same rules as classes or functions, that'd unfortunately break things as simple aspy count = 0 for i in range(10): if i % 2 == 0: count += 1 since count is now no longer a local variable within the loop

feral island Aug 29, 2023, 2:41 PM

#

umbral plume if it had its own scope, with the same rules as classes or functions, that'd unf...

nah it could be a nonlocal

#

the bigger problem is the data race

dusk comet Aug 29, 2023, 2:43 PM

#

maybe we should introduce "weak scopes":

if variable you are assigning to appears in surrounding scopes - use it
if not - it is a local variable

count = 0
for i in range(10):
    is_even = i % 2 == 0 # local
    if is_even:
        count += 1 # nonlocal
count # some value
is_even # error

umbral plume Aug 29, 2023, 2:50 PM

#

i can't quite find the words, but such scoping rules sound a little.. arbitrary, i dunno, since it kinda lulls you into a false sense of security of loops having their own scope, until you accidentally shadow a variable name from an outer scope

dusk comet Aug 29, 2023, 2:51 PM

#

agree

umbral plume Aug 29, 2023, 2:52 PM

#

i like the idea, but at that point we're approaching just bringing in a let or var keyword into python (though such an idea does sound super interesting!)

dusk comet Aug 29, 2023, 2:53 PM

#

there is already a thing that does the same thing as let/var: it is an annotation: x: T, it forces x to be a local variable

umbral plume Aug 29, 2023, 2:57 PM

#

>>> stuff = [5,6,7,8]
>>> def foo():
...     stuff: list
...     stuff.append(9)
...
>>> foo()
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "<stdin>", line 3, in foo
UnboundLocalError: cannot access local variable 'stuff' where it is not associated with a value
``` TIL

dark umbra Aug 29, 2023, 3:50 PM

#

Hello anyone know book Begaining python?

urban sandal Aug 29, 2023, 4:46 PM

#

umbral plume i can't quite find the words, but such scoping rules sound a little.. arbitrary,...

and the walrus escapes, so you aren't even safe from edge cases by linting for shadowing of variables.

>>> [x for x in range(10)]
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
>>> x
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
NameError: name 'x' is not defined
>>> [(x:=i) for i in range(10)]
[0, 1, 2, 3, 4, 5, 6, 7, 8, 9]
>>> x
9

jade raven Aug 29, 2023, 6:45 PM

#

static hinge I don't think we have a sticker of him

I should email him and ask him if he’d be willing to pose for a sticker

static hinge Aug 29, 2023, 6:47 PM

#

umbral plume i like the idea, but at that point we're approaching just bringing in a `let` or...

local count similar to global count

sand goblet Aug 29, 2023, 7:25 PM

#

I added an API function for bytearray which resizes it to an exact size without overallocating:
https://pastebin.com/dy31smce (lines 78 and 265)
https://pastebin.com/6ZjU96Mx
https://pastebin.com/2risAhc8
Is this good enough to submit as a pull request?

#

There's no way to do this in the current API

random thistle Aug 30, 2023, 12:41 AM

#

Anybody else looking forward to the “nogil” project? It’s going to be quite complex to keep reference counting in a threadsafe way. I know there were some advocating moving to a pure garbage-collected model, like Java. But then even a simple script can end up swallowing all of memory, if it is left running long enough. And we would like to avoid that.

Also I know some people want to avoid the name “Python 4.0”, after all the pain of the 2.x→3.x transition. But I think the threadsafeness changes will have sufficient implications for backward (in)compatibility that calling the new version “4.0” would be a good idea. What do you think?

spark magnet Aug 30, 2023, 1:20 AM

#

random thistle Anybody else looking forward to the “nogil” project? It’s going to be quite comp...

the core devs are deadset on not needing to call it 4.0

raven ridge Aug 30, 2023, 1:24 AM

#

and to limit backwards incompatibility to things using the C API, as I understand. The intention is for nothing in the Python API to need to change.

feral island Aug 30, 2023, 1:27 AM

#

I really don't think it will be anything like Python 2/3. Pure Python code will not need to change to support nogil; by contrast, pretty much every Python program had to be changed to support Python 3. C extension code will more often require changes and that migration will take a lot of ecosystem effort, but most people aren't writing C extensions

raven ridge Aug 30, 2023, 1:28 AM

#

yeah. Extension module devs are a very important subset of the Python developer ecosystem, but they're a tiny portion - I'd wager that fewer than 1 in 1000 Python users ever interacts with the C API

random thistle Aug 30, 2023, 5:58 AM

#

feral island I really don't think it will be anything like Python 2/3. Pure Python code will ...

Some of it will. Because there will be some Python code that assumes that pure-Python code will never execute concurrently on multiple threads; that assumption will be broken by removal of the GIL.

worthy sandal Aug 30, 2023, 7:26 AM

#

Hi guys, i saw a programming question named knight’s sequence. What does it mean by a knight sequence. What is 10-key sequence of knight?

spark magnet Aug 30, 2023, 12:02 PM

#

random thistle Some of it will. Because there will be some Python code that assumes that pure-P...

I think that will be declared incorrect code that should have been using locks. Changes to dict iteration also broke some incorrect code.

quick snow Aug 30, 2023, 1:00 PM

#

I think the more interesting compatibility break is in the other direction: New code that is developed on nogil Python, and runs (but with terrible performance) on older versions. We have these breaks whenever a new Python version comes out, but usually it's easy to tell (because the code just won't work on older versions).

urban sandal Aug 30, 2023, 1:30 PM

#

feral island I really don't think it will be anything like Python 2/3. Pure Python code will ...

Some C extensions may get this handled for them automatically by using Hpy.

raven ridge Aug 30, 2023, 5:40 PM

#

Things using Cython should get it for free, too.

#

Though there will still be separate gil/nogil ABI tags for a while, and that will need handling from each project

feral island Aug 30, 2023, 5:41 PM

#

random thistle Some of it will. Because there will be some Python code that assumes that pure-P...

That assumption is already wrong though. You can already use threading and threads can switch at any point.

#

It's going to be pretty hard in Python code to distinguish two threads running truly concurrently (with nogil) or switching at arbitrary points (current situation)

sand goblet Aug 30, 2023, 5:52 PM

#

So threading will make python faster like how it does in other languages?

feral island Aug 30, 2023, 5:52 PM

#

sand goblet So threading will make python faster like how it does in other languages?

Only if you add threads to your code.

#

And it will also make your code more prone to hard-to-debug race conditions

sand goblet Aug 30, 2023, 5:53 PM

#

Compared to multiprocessing?

raven ridge Aug 30, 2023, 5:54 PM

#

Yes, actually. Threads share more state than processes, so there's more ways to have data dependency bugs

urban sandal Aug 30, 2023, 6:18 PM

#

people shouldn't have anything but the minimal neccessary shared state (Which is often "nothing") when doing things concurrently, and use the appropriate stategy for guarding that shared state based on the concurrency patterns in use. But what people "Should" do is very far away from a lot of real world code.

feral island Aug 30, 2023, 6:20 PM

#

Yes, I guess the practical effect of nogil is that it will become a lot more important to make pure-Python code threadsafe, even if technically what you need to do to achieve thread-safety isn't terribly different from the current world

dusk comet Aug 30, 2023, 6:32 PM

#

feral island That assumption is already wrong though. You can already use threading and threa...

This is not true. Threads can switch only if GIL is dropped, which can happen (usually) after executing any bytecode instruction => threads can't switch in-between instruction

For example, consider operation x[::] = y. Lets assume that x and y are huge lists. This operation happens in C with acquired GIL, so no other thread can so anything => this operation is atomic in some sense.

But if there is no GIL, other threads can do different things to x and y while data is copied, which can result in broken data.

I can imagine a lot of code with these implicit assumptions about atomicness of some operations. And some atomic operations (that happens in C code) will become not atomic in noGIL, which will break code

feral island Aug 30, 2023, 6:35 PM

#

dusk comet This is not true. Threads can switch only if GIL is dropped, which can happen (u...

You are generally correct that currently switching can happen in fewer points, but I'm not sure in practice that makes much of a difference. Under nogil the list would be locked during the execution of the slicing operation, so other code can't interfere with it.

raven ridge Aug 30, 2023, 7:04 PM

#

also switching only happening between bytecode instructions or when the GIL is explicitly released doesn't help as much as you might assume, because all sorts of operations that look atomic can cause a Python __del__ method to fire, which runs new bytecode and gives new places where a context switch can happen.

#

!e ```py
class C:
def del(self):
print(x)

x = {1: C(), 2: C()}
x.update({1: 1, 2: 2})

fallen slateBOT Aug 30, 2023, 7:04 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | {1: 1, 2: <__main__.C object at 0x7f75834eb2d0>}
002 | {1: 1, 2: 2}

raven ridge Aug 30, 2023, 7:04 PM

#

the __del__ there sees an intermediate state where x is half-updated.

raven ridge Aug 30, 2023, 7:22 PM

#

this is a persistent annoyance when writing C extensions: any Py_DECREF of an object of user-controlled type can cause a Python __del__ to run, and that __del__ could examine the state of your extension module or call into it (or more likely yield control to another thread that does), so the extension module dev needs to guarantee that their module and all of their objects are in a sane state that's OK to be exposed to users whenever they call Py_DECREF

random thistle Aug 30, 2023, 9:19 PM

#

feral island It's going to be pretty hard in Python code to distinguish two threads running t...

Currently the switch only happens when you give up the GIL. In the future, there will be no GIL.

Can you say “Heisenbug”?

feral island Aug 30, 2023, 9:20 PM

#

Python code doesn't explicitly give up the GIL though

urban sandal Aug 30, 2023, 10:34 PM

#

^. Any python code which breaks by changing this would have to have been relying relying on extremely subtle internal behaviors that aren't even guranteed to be consistent across implementations, or on native code that was being called holding the GIL for them.

sand goblet Aug 31, 2023, 12:23 AM

#

What does getting rid of it accomplish?

raven ridge Aug 31, 2023, 12:25 AM

#

by "it" you mean the GIL? Greater parallelism for multi-threaded code

sand goblet Aug 31, 2023, 12:32 AM

#

Will that do anything besides make multithreaded pure python faster? Because it seems like that wouldn't matter that much since it would still be slower than using Cython / C-extension.

raven ridge Aug 31, 2023, 12:33 AM

#

it will make multithreaded C extensions faster as well

#

C extensions currently need to acquire the GIL whenever they want to create a Python object, store a reference to a Python object, call a Python callable, allocate memory with the Python allocators, inspect an object using the Python C API, etc. There's many things that extension modules can do today without holding the GIL, but there's also many that they can't, and so the GIL forces some operations to be done in serial rather than in parallel

dusk comet Aug 31, 2023, 12:36 AM

#

Why there is no noGIL single-threaded python build already?
Isnt that super simple? Just remove gil, any kinds of locks, forbid threading and that's it. It is way to get free performance in single-threaded apps

#

I heard that dropping and reacquiring GIL constantly is pretty slow

feral island Aug 31, 2023, 12:45 AM

#

dusk comet Why there is no noGIL single-threaded python build already? Isnt that super simp...

if "just remove gil" was simple we'd live in a different world

sand goblet Aug 31, 2023, 12:46 AM

#

You have to stop holding it manually, right? So in a single threaded program, you can just hold it the whole time?

feral island Aug 31, 2023, 12:46 AM

#

the GIL is very nice for single-threaded performance, because it means interpreter-internal data structures don't have to worry much about threading

#

that's why past attempts to remove the GIL tended to lead to huge performance regressions

feral island Aug 31, 2023, 12:46 AM

#

sand goblet You have to stop holding it manually, right? So in a single threaded program, yo...

in C code you release the GIL manually. In Python code you never interact with it directly

dusk comet Aug 31, 2023, 12:47 AM

#

feral island if "just remove gil" was simple we'd live in a different world

Just delete some lines from ceval.c, redefine some macros to expand to nothing...
I think this will work perfectly fine in single-threaded apps

sand goblet Aug 31, 2023, 12:47 AM

#

Shouldn't it not affect single threaded python programs?

#

Just never release it. Then there's no overhead for releasing/acquiring it.

dusk comet Aug 31, 2023, 12:49 AM

#

feral island in C code you release the GIL manually. In Python code you never interact with i...

Actually, you can drop GIL using ctypes, and then horrible things will happen

raven ridge Aug 31, 2023, 12:56 AM

#

dusk comet Why there is no noGIL single-threaded python build already? Isnt that super simp...

it actually used to work that way, up until (I think) Python 3.7 - the GIL used to be created on demand, the first time a thread was created (or maybe when something tried to acquire the GIL from a different thread)

#

https://docs.python.org/3/c-api/init.html#c.PyEval_InitThreads refers to this

raven ridge Aug 31, 2023, 12:58 AM

#

sand goblet Just never release it. Then there's no overhead for releasing/acquiring it.

releasing and reacquiring it isn't very expensive, honestly. Mutexes are pretty close to free when there's no contention.

sand goblet Aug 31, 2023, 1:09 AM

#

So basically, it starts making a difference when running multithreaded C code which has a lot of interactions with Python

#

Or when running multithreaded pure python.

raven ridge Aug 31, 2023, 1:29 AM

#

right, just in general: it allows multithreaded code to have greater parallelism

radiant garden Aug 31, 2023, 8:20 AM

#

dusk comet Just delete some lines from ceval.c, redefine some macros to expand to nothing.....

Best not make assumptions lest you rediscover the implications

rotund furnace Aug 31, 2023, 8:27 AM

#

hi

cyan raven Aug 31, 2023, 5:59 PM

#

What is the best way of proposing a pep, I mean I have to attach some code snippet that adds the stuff to the source code or at least try to explain it.
Just create a patch? or add the stuff on my local fork, and link that directly?

steel solstice Aug 31, 2023, 6:03 PM

#

What are you proposing?

cyan raven Aug 31, 2023, 6:04 PM

#

steel solstice What are you proposing?

sorry, it's more like an extra function to asyncio, which was discussed before, and guido said he would sponsor it.

steel solstice Aug 31, 2023, 6:05 PM

#

Oh the cancel thing

cyan raven Aug 31, 2023, 6:06 PM

#

steel solstice Oh the cancel thing

yes

steel solstice Aug 31, 2023, 6:07 PM

#

Yeah so you probably need to double check with guido that he'll sponsor it, his email is guido@python.org I'm not entirely convinced it'd need a pep but I'd trust him

cyan raven Aug 31, 2023, 6:07 PM

#

steel solstice Yeah so you probably need to double check with guido that he'll sponsor it, his ...

oh okay thank you

celest garden Aug 31, 2023, 8:41 PM

#

Hi. I want to contribute to python, is this a good place to get started with that? Or is my best bet joining the mailing list and looking for a mentor? I'm a student and want to learn more about how python works "under the hood." I've read some of the documentation about contributing.

cyan raven Aug 31, 2023, 8:52 PM

#

celest garden Hi. I want to contribute to python, is this a good place to get started with tha...

Pick an issue on GitHub and check out the projects board, first, you can just send a pr or fix small issues

timber fossil Sep 1, 2023, 9:34 AM

#

(C++) How do i have multiple python interpreters in one process?

#

i am using 3.12

#

I want to have a plugin system, where you can add files to a folder and they all will independently execute

#

Can be in sequence or threaded

dusk comet Sep 1, 2023, 9:37 AM

#

Subinterpreters?

timber fossil Sep 1, 2023, 9:37 AM

#

that sounds like it

#

how do i use them?

dusk comet Sep 1, 2023, 9:38 AM

#

Read C-API about them

timber fossil Sep 1, 2023, 9:39 AM

#

https://docs.python.org/3.12/c-api/init.html#c.Py_NewInterpreterFromConfig

Python documentation

Initialization, Finalization, and Threads

See also Python Initialization Configuration. Before Python Initialization: In an application embedding Python, the Py_Initialize() function must be called before using any other Python/C API funct...

#

this ?

dusk comet Sep 1, 2023, 9:40 AM

#

Yes

timber fossil Sep 1, 2023, 9:40 AM

#

How do i append modules to it?

#

or are they global

dusk comet Sep 1, 2023, 9:40 AM

#

Modules have to be in sys.path in order to be imported

timber fossil Sep 1, 2023, 9:41 AM

#

well i am providing my own modules

#

and disallowing for any external c modules and such

#

no io no os ...

dusk comet Sep 1, 2023, 9:41 AM

#

I dont think that is possible

#

Consider not using python as scripting language

timber fossil Sep 1, 2023, 9:42 AM

#

I already modified the python source code

dusk comet Sep 1, 2023, 9:42 AM

#

timber fossil I already modified the python source code

Then remove all modules you dont want to be imported
Keep in mind that some modules cannot be removed

timber fossil Sep 1, 2023, 9:43 AM

#

is PyImport_AppendInittab global? as in: every interpreter has the same modules from it?

dusk comet Sep 1, 2023, 9:44 AM

#

IIRC, all subinterpreters have their own object, objects cannot be a part of two subinterpreter worlds

timber fossil Sep 1, 2023, 9:45 AM

#

what about c functions

#

same thing?

dusk comet Sep 1, 2023, 9:45 AM

#

There is a paragraph about this in the docs

timber fossil Sep 1, 2023, 9:46 AM

#

so can the subinterpreters call built-ins which i added?

#

if the threads will each be executing different files do i still have to worry about the GIL?

#

nothing will be shared between threads

vagrant musk Sep 1, 2023, 10:00 AM

#

who here is an expert in creating tools using python?

timber fossil Sep 1, 2023, 10:33 AM

#

do the threads end themselves after the script is done executing?

#

or do i need to manually check whether a thread has stopped execution

cyan raven Sep 1, 2023, 7:18 PM

#

steel solstice Yeah so you probably need to double check with guido that he'll sponsor it, his ...

well, he said that he can't help through the whole process, so I might need some help.

If you're proposing to write the PEP and asking me to mentor you through the process, alas, I don't have time for that. In any case, it seems the OP in that thread is no longer interested -- perhaps you can add your opinion to the thread?

spark magnet Sep 1, 2023, 7:19 PM

#

cyan raven well, he said that he can't help through the whole process, so I might need some...

this is "the cancel thing"? Is there a thread somewhere about it? Maybe it doesn't need a full PEP.

cyan raven Sep 1, 2023, 7:21 PM

#

spark magnet this is "the cancel thing"? Is there a thread somewhere about it? Maybe it does...

yes, you can find one here: https://discuss.python.org/t/asyncio-cancel-a-cancellation-utility-as-a-coroutine-this-time-with-feeling/26304

Discussions on Python.org

Asyncio.cancel() a cancellation utility as a coroutine [This time w...

I propose asyncio.cancel(task, *, msg=None) a novel utility for helping in cancelling tasks/futures. Why? I have seen too much code that calls task.cancel() and assumes immediate cancellation. With no await of the cancelled task or even a callback (yuck) The first thing one needs to do after calling cancel() on the task is to await the task....

#

it's not that hard, but it seems like everyone gave up working on it 😄

#

so it can be implemented in the task.py

spark magnet Sep 1, 2023, 7:24 PM

#

cyan raven yes, you can find one here: https://discuss.python.org/t/asyncio-cancel-a-cancel...

i see, Guido suggested a PEP. Hmm, that's trickier then.

#

and he commented twice in the last hour

cyan raven Sep 1, 2023, 7:25 PM

#

spark magnet i see, Guido suggested a PEP. Hmm, that's trickier then.

oh, I just refreshed the page.

#

I see

tacit hawk Sep 1, 2023, 8:02 PM

#

is byte ordering of the bytes passed in concurrent calls to socket.sendall()preserved?

tacit hawk Sep 1, 2023, 8:41 PM

#

looks like it is not https://github.com/python/cpython/blob/6f97eeec222f81bd7ae836c149872a40b079e2a6/Modules/socketmodule.c#L4384-L4449 sendall() is done by cosecutive calls to send(), this loop is not atomic

GitHub

cpython/Modules/socketmodule.c at 6f97eeec222f81bd7ae836c149872a40b...

The Python programming language. Contribute to python/cpython development by creating an account on GitHub.

raven ridge Sep 1, 2023, 11:45 PM

#

tacit hawk is byte ordering of the bytes passed in concurrent calls to `socket.sendall()`pr...

having multiple threads access any mutable object without synchronization is a bug, unless that object is explicitly documented as being thread safe (like queue.Queue is)

quick snow Sep 2, 2023, 8:31 AM

#

raven ridge having multiple threads access any mutable object without synchronization is a b...

list.append isn't marked as thread safe (at least here). Are you telling me it isn't guaranteed to be?

unkempt rock Sep 2, 2023, 9:10 AM

#

jade raven Sep 2, 2023, 9:13 AM

#

quick snow list.append isn't marked as thread safe (at least [here](https://docs.python.org...

From past experience , it isn’t thread safe

#

!d collections.deque

fallen slateBOT Sep 2, 2023, 9:14 AM

#

collections.deque


class collections.deque([iterable[, maxlen]])```
Returns a new deque object initialized left-to-right (using [`append()`](https://docs.python.org/3/library/collections.html#collections.deque.append)) with data from *iterable*. If *iterable* is not specified, the new deque is empty.

Deques are a generalization of stacks and queues (the name is pronounced “deck” and is short for “double-ended queue”). Deques support thread-safe, memory efficient appends and pops from either side of the deque with approximately the same O(1) performance in either direction.

Though [`list`](https://docs.python.org/3/library/stdtypes.html#list) objects support similar operations, they are optimized for fast fixed-length operations and incur O(n) memory movement costs for `pop(0)` and `insert(0, v)` operations which change both the size and position of the underlying data representation.

jade raven Sep 2, 2023, 9:14 AM

#

This was my solution

cyan raven Sep 2, 2023, 12:46 PM

#

What is the difference between status accepted and status final?

#

In pep

umbral plume Sep 2, 2023, 12:50 PM

#

IIRC, "accepted" means the PEP's been accepted and is being worked on, and "final" is for once said PEP has been added and, well, finalised

#

https://peps.python.org/pep-0001/#pep-review-resolution

Once a PEP has been accepted, the reference implementation must be completed. When the reference implementation is complete and incorporated into the main source code repository, the status will be changed to “Final”.

cyan raven Sep 2, 2023, 1:04 PM

#

umbral plume <https://peps.python.org/pep-0001/#pep-review-resolution> > Once a PEP has been ...

oh so it's not completed yet, but the idea seems good.

#

is there a pep about descriptors as well?

steel solstice Sep 2, 2023, 1:09 PM

#

https://peps.python.org/pep-0252/

PEP 252 – Making Types Look More Like Classes | peps.python.org

Python Enhancement Proposals (PEPs)

feral island Sep 2, 2023, 2:41 PM

#

umbral plume IIRC, "accepted" means the PEP's been accepted and is being worked on, and "fina...

in practice we're not very good at moving peps from accepted to final

cyan raven Sep 2, 2023, 3:43 PM

#

feral island in practice we're not very good at moving peps from accepted to final

what does that mean? if a pep is accepted it needs to be implemented so this is how it can go into final?

raven ridge Sep 2, 2023, 5:17 PM

#

quick snow list.append isn't marked as thread safe (at least [here](https://docs.python.org...

Yes. As an implementation detail of current versions of CPython it happens to be, but that's not a guarantee that the language, or even the implementation, makes

#

It would be perfectly correct for a future version of CPython to change list.append in a way where if it was called concurrently from different threads, only one of the two items winds up being added in the end, for instance

dusk comet Sep 2, 2023, 5:23 PM

#

that will break a lot of code

raven ridge Sep 2, 2023, 5:35 PM

#

depending on the version of Python you're running. += for int isn't atomic. ```py

cat test.py

from concurrent.futures import ThreadPoolExecutor

x = 0

def increment_x_n_times(n):
global x
for i in range(n):
x += 1

with ThreadPoolExecutor(max_workers=10) as executor:
for i in range(10):
executor.submit(increment_x_n_times, 100_000)

print(x)

```shell-session
$ python3.9 test.py
360530
$ python3.9 test.py
655863
$ python3.11 test.py
1000000
$ python3.11 test.py
1000000

The fact that int += is atomic in some versions and not in others wasn't documented anywhere - if you look at the "what's new in Python 3.10" and "what's new in Python 3.11" pages, you won't find this mentioned anywhere - because it's an implementation detail that's subject to change.

raven ridge Sep 2, 2023, 6:00 PM

#

and because neither behavior was documented, it wouldn't be surprising if this changes back in some future version, or behaves differently in some other Python implementation

quick snow Sep 3, 2023, 6:56 AM

#

Interesting. We use list.append with threads in production code, but it's a list of errors that is (hopefully) only rarely appended to (and if an entry went missing, it didn't really matter), so I think we're fine.

frigid bison Sep 3, 2023, 10:52 AM

#

You use threading in production?

#

Threading has always been an unstable mess for me, what are you using to control the threads?

quick snow Sep 3, 2023, 10:58 AM

#

frigid bison Threading has always been an unstable mess for me, what are you using to control...

A ThreadPoolExecutor. The threads are pretty short-lived.

raven ridge Sep 3, 2023, 4:21 PM

#

"unstable"? that's an interesting take. I think threading is much less prone to subtle breakage than either multiprocessing or coroutine-based event loops like asyncio. multiprocessing is prone to subtle performance issues due to serializing data to send between processes as well as weird edge conditions (like, what happens if a process in the pool gets killed by the OOM killer while holding a multiprocessing lock?). coroutine-based event loops allow you to easily block the event loop and prevent parallelism without realizing you've done so

calm hawk Sep 4, 2023, 9:22 AM

#

/avatar @dull ferry

dull ferry Sep 4, 2023, 9:22 AM

#

w

#

gl finding it eheheh

swift sigil Sep 4, 2023, 6:17 PM

#

yes

next dagger Sep 5, 2023, 5:33 PM

#

dull ferry gl finding it eheheh

image compression creates unreadable strings. and your </> covers data

#

but offtopic so move to #ot0-psvm’s-eternal-disapproval

cyan raven Sep 5, 2023, 5:34 PM

#

is this a good way of testing this code?

    def test_task_cancel_and_await(self):
        # phase 1
        async def coro():
            t = self.new_task(self.loop, asyncio.sleep(1))
            await asyncio.cancel_and_await(t)
            self.assertTrue(t.cancelled())

        self.loop.run_until_complete(coro())

I haven't seen any specific section specialized for asyncio code testing(in cpython source code).
Should I just put it under the async BaseTask class?

cyan raven Sep 5, 2023, 5:35 PM

#

next dagger image compression creates unreadable strings. and your </> covers data

it seems that some people still can't understand that, this channel is for CPython internals, what we can do is tell them aggressively to don't do it.

neat delta Sep 5, 2023, 5:36 PM

#

strictly speaking, internals of non-cpython python flavors might also fit here - they just don't usually (ever?) come up

raven ridge Sep 5, 2023, 5:38 PM

#

We've talked occasionally about pypy internals here

#

MicroPython and CircuitPython, too, actually

worthy sandal Sep 7, 2023, 1:24 PM

#

Tips and tricks on python programming, could be useful for beginners -- https://book.pythontips.com/en/latest/exceptions.html

alpine rose Sep 8, 2023, 2:16 AM

#

cpython gc question. for reasons, i have gc off.

#

this code https://github.com/redis/redis-py/blob/e3de026a90ef2cc35a5b68934029a0ef2a5b2f53/redis/connection.py#L515 seems to raise (and later handle) an exception. but because the exception is stored in a local, i think it's keeping everything i care about in my code alive.

fallen slateBOT Sep 8, 2023, 2:16 AM

#

redis/connection.py line 515

if isinstance(response, ResponseError):```

alpine rose Sep 8, 2023, 2:16 AM

#

are my only options: a) gc.collect, or b) in every frame that could be a parent of this delete any local that's costly to keep around?
is there an easy change i can make to redis to prevent the cycle?

dusk comet Sep 8, 2023, 2:41 AM

#

You can delete your exception instance, it will decref all frames, and your locals will be deallocated
If you really need to store exception, you can create copy of it, like that: type(e)(*e.args) (im not sure).

#

And probably you can remove frames references from exception manually

alpine rose Sep 8, 2023, 3:12 AM

#

i don't have access to the exception instance. it's handled internally within redis. the only way i know of its existence is by following the gc.get_referrers to see what's keeping my locals alive
https://github.com/redis/redis-py/blob/e3de026a90ef2cc35a5b68934029a0ef2a5b2f53/redis/connection.py#L372

fallen slateBOT Sep 8, 2023, 3:12 AM

#

redis/connection.py line 372

self.read_response()```

alpine rose Sep 8, 2023, 3:13 AM

#

but there's something i don't understand, since i'm not able to make a minimal repro that exactly corresponds

feral island Sep 8, 2023, 3:13 AM

#

the function you linked is a little weird in that the exception is apparently returned from something it's calling?

alpine rose Sep 8, 2023, 3:15 AM

#

yeah, there's a bunch of exception returning. i believe (but am not 100%) that this is where the exception is created: https://github.com/redis/redis-py/blob/e3de026a90ef2cc35a5b68934029a0ef2a5b2f53/redis/_parsers/base.py#L88

fallen slateBOT Sep 8, 2023, 3:15 AM

#

redis/_parsers/base.py line 88

return ResponseError(response)```

alpine rose Sep 8, 2023, 3:15 AM

#

then returned again from https://github.com/redis/redis-py/blob/e3de026a90ef2cc35a5b68934029a0ef2a5b2f53/redis/_parsers/resp2.py#L43

fallen slateBOT Sep 8, 2023, 3:15 AM

#

redis/_parsers/resp2.py line 43

return error```

alpine rose Sep 8, 2023, 3:16 AM

#

then returned in another function, then actually raised at https://github.com/redis/redis-py/blob/e3de026a90ef2cc35a5b68934029a0ef2a5b2f53/redis/connection.py#L516

fallen slateBOT Sep 8, 2023, 3:16 AM

#

redis/connection.py line 516

raise response```

alpine rose Sep 8, 2023, 3:16 AM

#

then caught at https://github.com/redis/redis-py/blob/e3de026a90ef2cc35a5b68934029a0ef2a5b2f53/redis/connection.py#L373

fallen slateBOT Sep 8, 2023, 3:16 AM

#

redis/connection.py line 373

except ResponseError:```

feral island Sep 8, 2023, 3:23 AM

#

I tried this and now I'm wondering what this list could possibly be 😄 ```>>> import gc

gc.disable()
def inner():
... e = Exception()
... try:
... raise e
... except: pass
...
def outer():
... x = ["special string"]
... inner()
...
outer()
print([x for x in gc.get_objects() if type(x) is list and "special_string" in x])
[[b'print', 'print', b'(', 'print', b'[', b'x', 'x', b'for', 'x', 'x', 'x', b'x', 'x', b'in', 'x', b'gc', 'gc', b'.', b'get_objects', 'get_objects', b'(', b')', b'if', 'gc', b'type', 'type', b'(', b'x', 'x', b')', 'x', 'x', 'x', b'is', 'type', b'list', 'list', b'and', 'list', b'"special_string"', 'special_string', b'in', b'x', 'x', b']', 'x', b')', 'x', 'x', b'', 'print', 'print', 'print', 'print', 'print']]

#

oh this actually works, I just used an underscore instead of a space

#

>>> gc.disable()
>>> def inner():
...     e = Exception()
...     try:
...             raise e
...     except: pass
... 
>>> def outer():
...     x = ["special string"]
...     inner()
... 
>>> outer()
>>> print([x for x in gc.get_objects() if type(x) is list and len(x) == 1 and "special string" in x])
[['special string']]

#

I think there is a reference cycle between the exception and the frame locals for inner

#

and that's leaving the frame and locals for outer alive

#

putting finally: del e in inner fixes it

alpine rose Sep 8, 2023, 3:27 AM

#

hmm i was doing something similar, but was doing print(gc.get_referrers(x)) in outer which ends up being empty

feral island Sep 8, 2023, 3:28 AM

#

does that just not work if gc is disabled?

#

no that's not it

raven ridge Sep 8, 2023, 3:37 AM

#

I think there is a reference cycle between the exception and the frame locals for inner
Right - the exception e has a reference to the most recent frame, which has a reference to that frame's locals, including e - so that's your reference cycle. And the reference to the most recent frame also holds a reference to the calling frame, so that's what's keeping x alive

#

So:
most recent frame -> locals() -> e -> most recent frame
most recent frame -> second most recent frame -> locals() -> x

feral island Sep 8, 2023, 3:40 AM

#

Yes. And I think gc.get_referrers doesn't work at that point because the reference is still owned by the frame, which doesn't participate in GC because the interpreter knows how to dispose of the reference

#

But after the function returns, the locals survive in a frame object. If I do gc.get_referrers() on the list afterwards, I see a reference from a frame object

#

[<frame at 0x101b55010, file '<stdin>', line 3, code outer>, [['special string']]]

alpine rose Sep 8, 2023, 3:40 AM

#

okay, thanks, the gc.get_referrers behaviour was what i didn't understand / confused me a little

#

i also don't know the answer to my original question: 1) is there anything i can do to resolve the cycle other than gc.collect(), 2) is there an easy change to redis that would remove the cycle? seems like i can't really weakref anything

feral island Sep 8, 2023, 3:42 AM

#

the change to redis would be to do try: raise response finally: del response

#

possibly you can fix it in user code by finding the cycle objects by trawling through gc.get_referents and then mutating something so that the cycle goes away? that would be very fragile though

alpine rose Sep 8, 2023, 5:27 AM

#

thank you!

cyan raven Sep 8, 2023, 3:24 PM

#

is it a good approach in CPython documentation to use Links if I want to redirect the user to a category?
Or should I go with this:
:meth:cancel() <asyncio.Task.cancel>

so
:meth:cancel() <asyncio.Task.cancel>
vs

'"cancelled https://docs.python.org/3/library/asyncio-task.html?highlight=asyncio task#task-cancellation_"

dusk comet Sep 8, 2023, 6:15 PM

#

>>> from enum import Enum
>>> class X(Enum):
...   a = 1
...   b = 1.0
...
>>> X.a
<X.a: 1>
>>> X.b
<X.a: 1>
>>> X.a is X.b
True
``` is this the expected behaviour?

grave jolt Sep 8, 2023, 6:43 PM

#

dusk comet ```py >>> from enum import Enum >>> class X(Enum): ... a = 1 ... b = 1.0 ......

Hmm, the documentation is suspiciously vague about this

#

but I assume every item that compares equal to 1 will go as 1?..

#

I guess there's a hint here: https://docs.python.org/3/howto/enum.html#duplicatefreeenum that aliases are considered by equality

#

but that's an uh

#

stretch

grave jolt Sep 8, 2023, 6:53 PM

#

dusk comet ```py >>> from enum import Enum >>> class X(Enum): ... a = 1 ... b = 1.0 ......

Though why wouldn't this be expected behaviour?

dusk comet Sep 8, 2023, 6:53 PM

#

idk, i get why it is what it is
it is just a bit weird

#

and not explicitly documented

grave jolt Sep 8, 2023, 6:54 PM

#

dusk comet idk, i get why it is what it is it is just a bit weird

Would you expect it to explicitly differentiate values based on type?

#

Maybe a bit implementation-specific, but there's a thing called _value2member_map_, which is a dictionary

dusk comet Sep 8, 2023, 6:56 PM

#

>>> class X(IntEnum):
...  a = True
...  b = 1
...  c = 1.0
...
>>> X.a, X.b, X.c
(<X.a: 1>, <X.a: 1>, <X.a: 1>)
>>>
>>> class X(Enum):
...  a = True
...  b = 1
...  c = 1.0
...
>>> X.a, X.b, X.c
(<X.a: True>, <X.a: True>, <X.a: True>)

grave jolt Sep 8, 2023, 6:57 PM

#

well that's a certified bruh moment

dusk comet Sep 8, 2023, 6:58 PM

#

grave jolt Would you expect it to explicitly differentiate values based on type?

i was expecting that for a moment
but i dont think this feature would be very useful

dusk comet Sep 8, 2023, 6:58 PM

#

grave jolt well that's a certified bruh moment

IntEnum is a subclass of int, so it calls int(True), int(1), int(1.0) somewhere internally, and all of this becomes just 1

#

i think enums in python are overengineered

flat gazelle Sep 8, 2023, 7:00 PM

#

yeah, there is no real way for those values to be an integer subtype, due to the way the inheritance works out (which is an interesting limitation of inheritance I think may be worth expanding on)

eager ocean Sep 8, 2023, 7:01 PM

#

help me guys

dusk comet Sep 8, 2023, 7:01 PM

#

please open help thread #❓｜how-to-get-help

radiant garden Sep 8, 2023, 7:01 PM

#

Enum members can be arbitrary values

#

I think by default equal values are aliased too

#

but i agree that the standard enums are a bit odd

dusk comet Sep 8, 2023, 7:03 PM

#

>>> nan = float('nan')
>>>
>>> class X(Enum):
...   a = nan
...   b = nan
...
>>> X.a
<X.a: nan>
>>> X.b
<X.a: nan>
>>> X.a is X.b
True
>>>
>>> class X(Enum):
...   a = float('nan')
...   b = float('nan')
...
>>> X.a
<X.a: nan>
>>> X.b
<X.b: nan>
>>> X.a is X.b
False

#

there is a shortcut in most builtin collections: they first check for identity, and only then for equality

grave jolt Sep 8, 2023, 7:04 PM

#

ah yes it probably uses a dict for the lookups

misty oxide Sep 8, 2023, 7:29 PM

#

Does anybody have a favorite library to use for cpython bytecode/CodeType assembly?

dusk comet Sep 8, 2023, 7:29 PM

#

dis + bytecode (from pypi)

#

!pypi bytecode

fallen slateBOT Sep 8, 2023, 7:29 PM

#

bytecode v0.15.0

Python module to generate and modify bytecode

misty oxide Sep 8, 2023, 7:29 PM

#

I'm at a point where I have a list of valid dis.Instructions, but I need to populate co_codestring

dusk comet Sep 8, 2023, 7:30 PM

#

you mean co_code? (which is a bytes array of opcodes)

misty oxide Sep 8, 2023, 7:30 PM

#

Ah, cool.

#

I'm going to doublecheck, but I think it legitimately was co_codestring very briefly in 3.10.

#

But yeah

#

co_code

#

It may just be the CodeType constructor that calls it that.

#

Thank ya, looks like bytecode was what I was looking for.

dusk comet Sep 8, 2023, 7:33 PM

#

misty oxide Ah, cool.

no, it always was co_code

#

even in python2

misty oxide Sep 8, 2023, 7:34 PM

#

Intellisense is a dirty liar

#

(3.10)

dusk comet Sep 8, 2023, 7:36 PM

#

hmm, it is indeed called __codestring in typeshed (the place where all typed signatures of stdlib live)

misty oxide Sep 8, 2023, 7:36 PM

#

Wack

feral island Sep 8, 2023, 7:36 PM

#

it's positional-only there though apparently

#

so have fun with that

misty oxide Sep 8, 2023, 7:36 PM

#

Very fun

#

We already have a big series of if/elif statements on version, so not that big a deal.

#

Perfectly willing to fill up a screen or two.

dusk comet Sep 8, 2023, 7:39 PM

#

there are other inconsistencies in naming and ordering in __init__ and replace

feral island Sep 8, 2023, 7:40 PM

#

dusk comet there are other inconsistencies in naming and ordering in `__init__` and `replac...

I'd accept a typeshed PR to make them match. It probably doesn't matter practically though because the names of the pos-only args to __init__ are ignored and the order of the kw-only arguments to replace doesn't matter

misty oxide Sep 8, 2023, 7:44 PM

#

Are they supposed to match 1-1?

dusk comet Sep 8, 2023, 7:46 PM

#

no, but it would be nice

merry bramble Sep 8, 2023, 8:05 PM

#

dusk comet i think enums in python are overengineered

many people are saying this

swift imp Sep 8, 2023, 8:13 PM

#

merry bramble many people are saying this

In what way and why is that bad

feral island Sep 8, 2023, 8:15 PM

#

They change a lot from one version to another, which causes pain around releases and upgrades

grave jolt Sep 8, 2023, 8:23 PM

#

dusk comet i think enums in python are overengineered

Yes, absolutely

#

they are... way too extensible, or whatever

swift imp Sep 8, 2023, 8:23 PM

#

Gotcha. I think they're weird to be classes too. Like the way we define them, they're nothing like any other class def and it's difficult to tell what you can and cannot do

#

I also do not understand how enum.property works at all. Like this makes no sense to me

Note the property and the member must be defined in separate classes; for example, the value and name attributes are defined in the Enum class, and Enum subclasses can define members with the names value and name.

grave jolt Sep 8, 2023, 8:26 PM

#

This might sound crazy, but with typing being so popular, we could just use strings or whatever

#

Like ```py
ColorChannel = Literal["red", "green", "blue"]

#

Well, one thing it doesn't let you do is iterate over the members

#

actually

#

!e

from typing import get_args, Literal
ColorChannel = Literal["red", "green", "blue"]
print(get_args(ColorChannel))

fallen slateBOT Sep 8, 2023, 8:27 PM

#

@grave jolt :white_check_mark: Your 3.11 eval job has completed with return code 0.

('red', 'green', 'blue')

grave jolt Sep 8, 2023, 8:27 PM

#

boom

#

And if you want flags, you don't want flags, use a set instead

feral island Sep 8, 2023, 8:28 PM

#

and if you want this https://docs.python.org/3.12/howto/enum.html#enum-dataclass-support

Python documentation

Enum HOWTO

An Enum is a set of symbolic names bound to unique values. They are similar to global variables, but they offer a more useful repr(), grouping, type-safety, and a few other features. They are most ...

#

maybe you're doing it wrong

grave jolt Sep 8, 2023, 8:28 PM

#

I- ok that's strange

#

Well that page enumerates so many use cases

grave jolt Sep 8, 2023, 8:32 PM

#

grave jolt And if you want flags, you don't want flags, use a set instead

Flag = Literal["ignore_case", "multiline", "verbose"]

def re_search(pattern: str, haystack: str, flags: AbstractSet[Flag] = frozenset()) -> Match | None:
    ...

re_search("^[0-9]+  # yo are those digits??", "foo\n123\nbar", {"multiline", "verbose"})

feral island Sep 8, 2023, 8:33 PM

#

swift imp I also do not understand how `enum.property` works at all. Like this makes no se...

I think the point is you define the property in an "abstract enum" class, a class that inherits from Enum but doesn't define members

#

then to define your actual enum, you inherit from that abstract class and then you can have members with the same name

dusk comet Sep 8, 2023, 8:34 PM

#

enums arent powerful enough
they cant represent values with two orthogonal properties
for example, my things are either red or green, and they can be round or rectangular
there is no way to represent that as enum conveniently

feral island Sep 8, 2023, 8:34 PM

#

!e import enum class Abs(enum.Enum): @property def prop(self): return 42 class E(Abs): prop = 3 print(E.prop.prop)

grave jolt Sep 8, 2023, 8:34 PM

#

dusk comet enums arent powerful enough they cant represent values with two orthogonal prope...

a pair of Color and Shape?..

#

which might be enums

fallen slateBOT Sep 8, 2023, 8:35 PM

#

@feral island :white_check_mark: Your 3.11 eval job has completed with return code 0.

feral island Sep 8, 2023, 8:35 PM

#

wait I didn't even use enum.property

grave jolt Sep 8, 2023, 8:35 PM

#

lmao

#

||bottom text||

#

Maybe eventually there will be enum2, which will be an adaptation of Rust enums. Which will also cover the trivial case

#

a.k.a. algebraic data type, union type, sum type, discriminated union, tagged union, variant record, sealed traits, disjoint union, variant, choice type, coproduct, disjoint coproduct, tagged variant, product dual, tagged product dual, discriminated coproduct, or intuitionistic logical disjunction under the Curry–Howard correspondence

dusk comet Sep 8, 2023, 8:39 PM

#

grave jolt a pair of `Color` and `Shape`?..

my use-case was i bit more complex
i had 5 file types:

  1LangCache
2Lang  2Cache
HDLang HDCache
``` rows represent one property, columns - another
`1LangCache` has both column properties
i wanted to represent these 5 values as enum (or enumflag, i dont remember) in such a way, that i will be able to check if value has 1st or 2nd property, but was unable to do that

#

i wanted API that looks like this: ```py
e = MyEnum(...)
e.is_lang()
e.is_cache()
e.is_1()
e.is_2()
e.is_hd()

grave jolt Sep 8, 2023, 8:40 PM

#

grave jolt Maybe eventually there will be `enum2`, which will be an adaptation of Rust enum...

I mean, it's not hard to do. The hard part is convincing all the editors and type checkers to understand them

dusk comet Sep 8, 2023, 8:41 PM

#

dusk comet i wanted API that looks like this: ```py e = MyEnum(...) e.is_lang() e.is_cache(...

i could create 5 distinct enum values, and in each method perform manual check, but i didnt like that

grave jolt Sep 8, 2023, 8:41 PM

#

dusk comet my use-case was i bit more complex i had 5 file types: ```py 1LangCache 2Lang ...

I don't think I understand at all

dusk comet Sep 8, 2023, 8:42 PM

#

grave jolt I mean, it's not hard to do. The hard part is convincing all the editors and typ...

The hard part is convincing all the editors and type checkers to understand them
just dont do weird trickery with them, and they will be able to infer everything from stubs

grave jolt Sep 8, 2023, 8:42 PM

#

Well, I can't do this without trickery

dusk comet Sep 8, 2023, 8:42 PM

#

grave jolt I don't think I understand at all

i suck at expressing my thoughts in english...

grave jolt Sep 8, 2023, 8:43 PM

#

and I suck at understanding thoughts in english

#

If you have two orthogonal properties, why can't you like, unite them into a tuple?

dusk comet Sep 8, 2023, 8:44 PM

#

grave jolt If you have two orthogonal properties, why can't you like, unite them into a tup...

because there is a thing that is red and green at the same time

#

i cant do (Color.RED & Color.GREEN, Shape.ROUND)

grave jolt Sep 8, 2023, 8:45 PM

#

({"red", "green"}, {"round"})

dusk comet Sep 8, 2023, 8:45 PM

#

then it is not enum!

#

so enums suck

radiant garden Sep 8, 2023, 8:45 PM

#

you want something that fits a pattern except for special cases and want it to be elegant to implement?

dusk comet Sep 8, 2023, 8:45 PM

#

dusk comet i cant do `(Color.RED & Color.GREEN, Shape.ROUND)`

or maybe i can with EnumFlag?

grave jolt Sep 8, 2023, 8:45 PM

#

({Color.RED, Color.GREEN}, {Shape.ROUND})

grave jolt Sep 8, 2023, 8:46 PM

#

dusk comet or maybe i can with `EnumFlag`?

yes you can, Color.Red | Color.Green

dusk comet Sep 8, 2023, 8:46 PM

#

radiant garden you want something that fits a pattern except for special cases and want it to b...

yes 😭

grave jolt Sep 8, 2023, 8:46 PM

#

dusk comet so enums suck

I thought you were complaining that enums are too complicated, no?

#

🙂

radiant garden Sep 8, 2023, 8:46 PM

#

i find myself just refactoring that away when possible shrug

grave jolt Sep 8, 2023, 8:47 PM

#

I just add ifs

#

makes my job veri secure

#

Like that file with a cyrillic с in the name instead of latin c.

#

True story.

#

Someone probably did break-dancing on their keyboard and accidentally inserted the wrong c into a file.

dusk comet Sep 8, 2023, 8:50 PM

#

grave jolt Well, I can't do this without trickery

why do you need trickery? ```py
class Color(enum2.Enum[int]): # can hold only ints
red = 0
green = 1
blue = 2

Color.red # 0
Color(1) # 1 (new returns value of the same type)
0 in Color # True

grave jolt Sep 8, 2023, 8:50 PM

#

grave jolt Someone probably did break-dancing on their keyboard and accidentally inserted t...

Then they imported said file with auto-import, very convenient.

dusk comet Sep 8, 2023, 8:51 PM

#

grave jolt I thought you were complaining that enums are too complicated, no?

they are complicated, they are weak, and they suck
all at once 🙂

grave jolt Sep 8, 2023, 8:51 PM

#

grave jolt Then they imported said file with auto-import, very convenient.

Then someone "fixed" the filename when transferring the code to a different VCS/repo/whatever

#

but of course did not fix the import

#

and so when the code was attempted to build at the new system, there was a very nasty error message like No module named the_foo_and_с_things

#

It was there, just with a latin c

#

Now that's where VSCode's highlighting of suspicious characters would help. But everyone who has ever opened mixed cyrillic/latin text in VSCode just disables it

#

anyway...

grave jolt Sep 8, 2023, 8:53 PM

#

dusk comet why do you need trickery? ```py class Color(enum2.Enum[int]): # can hold only in...

I meant like Rust enums

dusk comet Sep 8, 2023, 8:54 PM

#

oh, that is indeed hard to do without magic

grave jolt Sep 8, 2023, 8:54 PM

#

...where each variant could hold some data, like rs enum Color { Rgb { red: u8, green: u8, blue: u8 }, Rgba { red: u8, green: u8, blue: u8, alpha: u8 }, Variable { name: String }, }

#

I usually just make a union of dataclasses, but it's kinda verbose and might be a bit WTF-ish to the reader

feral island Sep 8, 2023, 8:56 PM

#

ADTs are nice. I'm not sure they should be the same concept as enums though

#

I mostly use enums for things I want to make sure can go in an integer column in a database

grave jolt Sep 8, 2023, 8:56 PM

#

Yeah, I think Rust has some unfortunate naming here

#

what I meant was, if we had such a feature, it would technically also cover the trivial case of ```rs
enum Color { Red, Green, Blue }

radiant garden Sep 8, 2023, 8:59 PM

#

data goosedance

grave jolt Sep 8, 2023, 9:01 PM

#

feral island I mostly use enums for things I want to make sure can go in an integer column in...

Allow me to introduce you to https://www.postgresql.org/docs/current/datatype-enum.html

PostgreSQL Documentation

8.7. Enumerated Types

8.7. Enumerated Types 8.7.1. Declaration of Enumerated Types 8.7.2. Ordering 8.7.3. Type Safety 8.7.4. Implementation Details Enumerated (enum) types are data …

dusk comet Sep 8, 2023, 9:01 PM

#

grave jolt ...where each variant could hold some data, like ```rs enum Color { Rgb { re...

how would you store values inside enum members?
i can think of this: ```py
class Color(...):
rgb: tuple[int, int, int]
rgba: tuple[int, int, int, int]

c = Color(...)

how to check if it is a rgb kind?

c.kind == Color.rgb

how to get value from it?

c.value == (1,2,3) # ?

class IPAddrKind(...):
V4, V6

k = IPAddrKind(...)
if k.kind == IPAddrKind.V4:
print('v4 detected')
k.value # what is this? None? AttributeError?

#

i think i dont quite understand rust enums right now

grave jolt Sep 8, 2023, 9:04 PM

#

dusk comet how would you store values inside enum members? i can think of this: ```py clas...

calling them enums was a mistake IMO, it is pretty confusing

#

something like ```py
class Color(ADT):
@case
def rgb(r: int, g: int, b: int): ...

@case
def rgba(r: int, g: int, b: int, alpha: int): ...

@case
def variable(name: str): ...

color1 = Color.rgb(255, 0, 255)
color2 = Color.variable("foo")

#

though this is still pretty verbose, idk

dusk comet Sep 8, 2023, 9:06 PM

#

can rust "enums" have methods?
if no, you can omit @case

grave jolt Sep 8, 2023, 9:06 PM

#

class Color(ADT):
    class Rgb(Case):
        r: int
        g: int
        b: int

    class Rgba(Case):
        r: int
        g: int
        b: int
        alpha: int

    class Var(Case):
        name: str

grave jolt Sep 8, 2023, 9:06 PM

#

dusk comet can rust "enums" have methods? if no, you can omit `@case`

it's complicated, but yes they can, but rust is a very different language

grave jolt Sep 8, 2023, 9:07 PM

#

grave jolt ```py class Color(ADT): class Rgb(Case): r: int g: int ...

I mean, it's pretty much as verbose as any current alternative 🤷

dusk comet Sep 8, 2023, 9:07 PM

#

how would this work at runtime?

#

Color.Rgb(1,2,3) - what is this object? is it an instance of Color or Rgb? Or both?
how to check if my thing is of kind Color.Rgb?

#

class IPAddrKind(ADT):
    class V4(Case): pass
    class V6(Case): pass

grave jolt Sep 8, 2023, 9:10 PM

#

dusk comet how would this work at runtime?

Well, it almost works now if you remove the inheritance. Only minor touches are needed, like:

make it impossible to inherit from Color
swap the bare classes so that Rgb, Rgba and Var all inherit from Color
dataclassify the classes

#

So it would desugar to something like ```py
class Color:
pass

@dataclass(frozen=True)
class __Rgb(Color):
r: int
g: int
b: int

@dataclass(frozen=True)
class __Rgba(Color):
r: int
g: int
b: int
alpha: int

@dataclass(frozen=True)
class __Var(Color):
name: str

Color.Rgb = __Rgb
Color.Rgba = __Rgba
Color.Var = __Var
prohibit_further_subclasses(Color)

dusk comet Sep 8, 2023, 9:12 PM

#

dusk comet ```py class IPAddrKind(ADT): class V4(Case): pass class V6(Case): pass ...

maybe cases with no fields should become singletons

grave jolt Sep 8, 2023, 9:13 PM

#

well, that's bikeshedding

dusk comet Sep 8, 2023, 9:13 PM

#

prohibit_further_subclasses is doable by patching cls.__flags__ (there is some flag for final classes, that is why you cant subclass NoneType or bool)

swift imp Sep 8, 2023, 9:13 PM

#

feral island !e ```import enum class Abs(enum.Enum): @property def prop(self): ...

That's a lot of boiler plate

dusk comet Sep 8, 2023, 9:13 PM

#

grave jolt So it would desugar to something like ```py class Color: pass @dataclass(fr...

i like that 👍

grave jolt Sep 8, 2023, 9:14 PM

#

This still requires some changes to type checkers so that they understand exhaustiveness of matching a Color against all 3 cases. I have no idea how easy or hard it is to implement

#

and the further question is: do we really need this? are ADTs common in Python code? wouldn't a union of dataclasses work just as well?

#

Though asking if ADTs are common in Python when the only way to have them is my butt-backwards union is perhaps not very fair. It's like asking why the city should build a bike path if nobody cycles on the highway

#

but maybe it's not that backwards

#

It is composing already existing constructs, and it's not totally clear what a new "official" construct would add

feral island Sep 8, 2023, 9:18 PM

#

grave jolt ```py class Color(ADT): class Rgb(Case): r: int g: int ...

I wrote this thing when I was overexcited about things you can do with __prepare__: https://github.com/JelleZijlstra/taxonomy/blob/master/taxonomy/db/models/name.py#L2278

fallen slateBOT Sep 8, 2023, 9:18 PM

#

taxonomy/db/models/name.py line 2278

class NameTag(adt.ADT):```

feral island Sep 8, 2023, 9:18 PM

#

it does work but is um not type-checker friendly

grave jolt Sep 8, 2023, 9:19 PM

#

ah I did something similar

#

i don't remember why but it required meta-metaclasses

#

it was many years ago...

#

oh no, I am old

#

😦

swift imp Sep 8, 2023, 11:13 PM

#

feral island I wrote this thing when I was overexcited about things you can do with `__prepar...

the history on __prepare__ was that it was introduced for enum.Enum right?

#

I'm reading pep 3115 again, and I noticed this bit. Does it mean you can use __prepare__ as an instance method and have different affects?

The __prepare__ method will most often be implemented as a class method rather than an instance method because it is called before the metaclass instance (i.e. the class itself) is created.

merry bramble Sep 8, 2023, 11:39 PM

#

grave jolt i don't remember why but it required meta-metaclasses

okay I've done some crazy things with metaclasses in my time (we all went through that phase) but I've never needed a meta-metaclass

#

Colour me intrigued

grave jolt Sep 9, 2023, 1:49 PM

#

merry bramble Colour me intrigued

ok

#

the secret dies with me

cyan raven Sep 10, 2023, 6:44 PM

#

is it a bad idea to establish a new attribute for a task(asyncio) which describes the cancelled message? At the moment this is what I have found but this is strange and probably doing some unexpected stuff under the hood.

task._cancel_message  # users shouldnt use it

Basically, this attribute would have the same behaviour.

 except asyncio.CancelledError as e:
     print(e.args[0])

cyan raven Sep 10, 2023, 7:13 PM

#

So we could implement a public attribute, something like this: -> task.cancel_message

grave jolt Sep 11, 2023, 8:12 PM

#

I just stumbled upon this thread https://discuss.python.org/t/traceback-showing-local-variable-values-at-call-site-hacking-frame-f-locals-frame-f-lineno-etc/21411
I wonder if this idea gets some traction or if it's mostly forgotten?

#

My personal issue is that if this becomes default, it could have unintended side effects

#

Suppose you have a web application that logs every traceback. There might be some information you really do not want to log, like a credit card number together with its holder name.

misty oxide Sep 11, 2023, 11:17 PM

#

How can I unbind a cellvar?

#

def make_cell() -> CellType:
    unbound: None # Unbound cellvar.
    return (lambda: unbound).__closure__[0]

cell = make_cell()
cell.cell_contents = 5
del cell.cell_contents # Is this sufficient?

misty oxide Sep 12, 2023, 3:01 AM

#

Also, how can I set fn.__closure__ to a new tuple?

quiet crane Sep 12, 2023, 4:52 AM

#

Is it possible to use threads/multiprocessing in python? I'm surprised and disappointed of the poor quality assurance. Is all of python like this?

https://github.com/python/cpython/issues/105829#issuecomment-1714593169

GitHub

`concurrent.futures.ProcessPoolExecutor` pool deadlocks when submit...

Bug report Submitting many tasks to a concurrent.futures.ProcessPoolExecutor pool deadlocks with all three start methods. When running the same example with multiprocessing.pool.Pool we have NOT be...

hybrid relic Sep 12, 2023, 12:24 PM

#

heard that python's getting a JIT compiler in 3.13, has it landed yet? Looking up optimizations on cpython on google hasn't yielded much other than the initial announcement about the compiler

rose schooner Sep 12, 2023, 12:35 PM

#

hybrid relic heard that python's getting a JIT compiler in 3.13, has it landed yet? Looking u...

there's now a tier 2 optimization thing in cpython + branch prediction but not much about converting to native machine code

frigid bison Sep 12, 2023, 2:49 PM

#

rose schooner there's now a tier 2 optimization thing in cpython + branch prediction but not m...

What does tier 2 include?

jade raven Sep 12, 2023, 6:32 PM

#

quiet crane Is it possible to use threads/multiprocessing in python? I'm surprised and disap...

Is it possible to use threads/multiprocessing in python?

#

yes it is

#

I'm surprised and disappointed of the poor quality assurance. Is all of python like this?
bugs are often fixed when people report them, so if people don't report them or the issue isn't noticed by the core devs, then it might not be fixed promptly. it's the same in any other OSS language

paper echo Sep 13, 2023, 4:00 AM

#

quiet crane Is it possible to use threads/multiprocessing in python? I'm surprised and disap...

Of course it's possible, many many applications are built this way. And all software has bugs

paper echo Sep 13, 2023, 4:01 AM

#

quiet crane Is it possible to use threads/multiprocessing in python? I'm surprised and disap...

oh i see this is your own open issue

#

consider paying a consultant to fix the bug for you

formal wyvern Sep 13, 2023, 10:23 AM

#

wsg

#

im having a problem in the most basic thing

#

i dont want to take help from the forms

#

can anyone of yall help me?

#

dm me if you can help

grave jolt Sep 13, 2023, 11:16 AM

#

formal wyvern i dont want to take help from the forms

If you have a question, you should see #❓｜how-to-get-help and make a help post

formal wyvern Sep 13, 2023, 12:10 PM

#

grave jolt If you have a question, you should see <#704250143020417084> and make a help pos...

That's what I don't want to do

grave jolt Sep 13, 2023, 12:17 PM

#

formal wyvern That's what I don't want to do

why?

formal wyvern Sep 13, 2023, 12:22 PM

#

It's a dumb problem

grave jolt Sep 13, 2023, 12:24 PM

#

formal wyvern It's a dumb problem

Help posts are fine for any kind of question. You will get help much quicker if you open a help post. Very few people are willing to help via DMs, especially when you haven't described your problem

formal wyvern Sep 13, 2023, 12:25 PM

#

Sure

unkempt rock Sep 13, 2023, 10:17 PM

#

Bros the pyi trusted publishers docs have this link to octo-org/sample-project on GitHub which doesn’t exist and I’m crying

spark magnet Sep 13, 2023, 10:30 PM

#

unkempt rock Bros the pyi trusted publishers docs have this link to octo-org/sample-project o...

what's the link exactly?

unkempt rock Sep 13, 2023, 10:42 PM

#

https://github.com/octo-org/sampleproject
https://docs.pypi.org/trusted-publishers/creating-a-project-through-oidc/

#

I’m not quite sure if pypi warehouse wants the GitHub issue because it’s not a pypi bug.. I got legacy upload for me personally just now

mild moss Sep 14, 2023, 3:13 AM

#

unkempt rock I’m not quite sure if pypi warehouse wants the GitHub issue because it’s not a p...

https://docs.github.com/en/search?query=octo-org
github docs have a lot of fake links for octo-org too, they're merely serving as placeholders like how you might write https://example.com/ in place of a real website

unkempt rock Sep 14, 2023, 3:35 AM

#

Ah I see, that’s too bad the release.yml thing seemed cool

clear kindle Sep 14, 2023, 10:12 AM

#

is there a clean way to use logging lib to output to a different file every log? basically RotatingFileHandler but not based on time or size

dusk comet Sep 14, 2023, 10:17 AM

#

You can reimplement RotatingFileHandlet with small changes

quiet crane Sep 14, 2023, 7:25 PM

#

jade raven > I'm surprised and disappointed of the poor quality assurance. Is all of pytho...

there was no stress test at all 😳

quiet crane Sep 14, 2023, 7:26 PM

#

paper echo consider paying a consultant to fix the bug for you

We have already contributed a fix. I'm just surprised the level of testing was so low.

raven ridge Sep 14, 2023, 8:29 PM

#

quiet crane there was no stress test at all 😳

it has tens of thousands of users who use it in production, and no one else reported the bug you encountered in the many years since it was introduced. I think you overestimate how common the conditions required to trigger the bug are. If this is your first time finding a bug in a language's standard library, welcome to the club! All software has bugs, and I think it's quite sad that your attitude here is "how could a bug make it into the standard library?", rather than focusing on the good things - that there's an immediate workaround (using multiprocessing.pool instead), that there's an easy patch to apply (just affecting Python code, which you can patch without needing to recompile anything), that there's no security implications, etc, etc

#

You're acting as though this is a major issue, when in fact it's quite minor, as far as issues in a language's internals go. At least there's no CVE attached, heh

#

also, if you're disappointed by the level of testing that some particular module receives, note that the test suite is open source and you're welcome to contribute enhancements. Trying to assign blame to people who worked on this module in the past seems much less productive than trying to improve its quality going forward.

#

Though of course, note that race conditions are by definition non-deterministic, and are just about the hardest type of bug for a test suite to detect.

quiet crane Sep 15, 2023, 12:11 AM

#

I don't mean to put the blame on a person. I'm just bummed I got bit by a bug and needed to ventilate my frustrations.

quiet crane Sep 15, 2023, 12:14 AM

#

raven ridge Though of course, note that race conditions are by definition non-deterministic,...

This is why a multithreading library should have a lot of testing. Random testing or stress testing. Anyway this bug seems like it will be resolved soon 😊👍

raven ridge Sep 15, 2023, 12:15 AM

#

quiet crane I don't mean to put the blame on a person. I'm just bummed I got bit by a bug an...

I think you ought to reconsider the way that you do that. If you look at the responses you got from people, it's very evident that people found your chosen method of venting to be rude.

quiet crane Sep 15, 2023, 12:37 AM

#

Yes, I'll be more mindful of my approach and tone. Sorry 🙏

hybrid relic Sep 15, 2023, 2:59 AM

#

frigid bison What does tier 2 include?

guess we'll never know

little bloom Sep 15, 2023, 4:24 AM

#

Hey Everyone
I'm starting a course on udemy named 'The complete 2023 web development boot camp'.
I am searching for a buddy to join me on the course.
Let's learn together and help each other.
Please drop me a message if an anybody's interested. Thank you!

jade raven Sep 15, 2023, 5:14 AM

#

@hybrid relic @frigid bison https://docs.google.com/presentation/d/1_cvQUwO2WWsaySyCmIy9nj9by4JKnkbiPCqtluLP3Mg

Google Docs

Tiers of Execution

Tiers of Execution Making CPython execute efficiently

hybrid relic Sep 15, 2023, 8:59 AM

#

In the JVM ... and other adaptive VMs, switching between tiers can be expensive

#

surprisingly accurate haha

dusk comet Sep 15, 2023, 9:38 AM

#

Where are all CPython branches? I dont see them on github, i see only branches for major versions...

rose schooner Sep 15, 2023, 9:39 AM

#

dusk comet Where are all CPython branches? I dont see them on github, i see only branches f...

wdym "all cpython branches"?

dusk comet Sep 15, 2023, 9:40 AM

#

Im pretty sure in the past there were a lot of different branches

#

Maybe im wrong

steel solstice Sep 15, 2023, 9:41 AM

#

They could've just been deleted

burnt rose Sep 15, 2023, 9:41 AM

#

hey

steel solstice Sep 15, 2023, 9:41 AM

#

I do remember there being a few branches that were feature related

burnt rose Sep 15, 2023, 9:41 AM

#

is there someone i could talk?

rose schooner Sep 15, 2023, 9:41 AM

#

dusk comet Im pretty sure in the past there were a lot of different branches

yea actually

#

they're still there but they don't show up for some reason

#

oh

#

i think they're tags now

worthy salmon Sep 15, 2023, 11:18 AM

#

hey

#

i am learning python but sometime can't do a simple problem if it is new

#

can you please help me

#

and how to make coding notes if anyone know please give me sugessions

cyan raven Sep 15, 2023, 2:31 PM

#

are there any peps about __call__ and __new__?

spark magnet Sep 15, 2023, 2:34 PM

#

cyan raven are there any peps about `__call__` and `__new__`?

those probably predated PEPs. Not everything in the language is covered by a PEP. What information are you looking for?

cyan raven Sep 15, 2023, 3:01 PM

#

spark magnet those probably predated PEPs. Not everything in the language is covered by a PE...

well, I just want to learn a bunch of low-level stuff about them.

merry bramble Sep 15, 2023, 3:13 PM

#

cyan raven well, I just want to learn a bunch of low-level stuff about them.

The data model docs are often quite good for this kind of thing. They don't have much on __call__, but they have information on __new__ and on metaclasses:

Python documentation

3. Data model

Objects, values and types: Objects are Python’s abstraction for data. All data in a Python program is represented by objects or by relations between objects. (In a sense, and in conformance to Von ...

cyan raven Sep 15, 2023, 3:16 PM

#

merry bramble The data model docs are often quite good for this kind of thing. They don't have...

Thank you!

merry bramble Sep 15, 2023, 3:32 PM

#

Often you can learn a lot just by playing around with code, as well. Here's a little script to illustrate the order in which various methods are called when classes are created and called:

class Meta(type):
    def __new__(mcls, name, *args, **kwargs):
        print(f'{name}: entering metaclass __new__')
        cls = super().__new__(mcls, name, *args, **kwargs)
        print(f'{name}: exiting metaclass __new__')
        return cls

    def __init__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering metaclass __init__')
        super().__init__(*args, **kwargs)
        print(f'{cls.__name__}: exiting metaclass __init__')

    def __call__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering metaclass __call__')
        new = super().__call__(*args, **kwargs)
        print(f'{cls.__name__}: exiting metaclass __call__')
        return new


class Klass(metaclass=Meta):
    def __init_subclass__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering class __init_subclass__')
        super().__init_subclass__(*args, **kwargs)
        print(f'{cls.__name__}: exiting class __init_subclass__')

    def __new__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering class __new__')
        obj = super().__new__(cls, *args, **kwargs)
        print(f'{cls.__name__}: exiting class __new__')
        return obj

    def __init__(self, *args, **kwargs):
        print(f'{self.__class__.__name__}: entering class __init__')
        super().__init__(*args, **kwargs)
        print(f'{self.__class__.__name__}: exiting class __init__')

    def __call__(self, *args, **kwargs):
        print(f'{self.__class__.__name__}: entering class __call__')
        print(f'{self.__class__.__name__}: exiting class __call__')
        return 42


class SubKlass(Klass): pass

obj = Klass()
obj()

#

idk how to do the bot command thing but here's the output:

Klass: entering metaclass __new__
Klass: exiting metaclass __new__
Klass: entering metaclass __init__
Klass: exiting metaclass __init__
SubKlass: entering metaclass __new__
SubKlass: entering class __init_subclass__
SubKlass: exiting class __init_subclass__
SubKlass: exiting metaclass __new__
SubKlass: entering metaclass __init__
SubKlass: exiting metaclass __init__
Klass: entering metaclass __call__
Klass: entering class __new__
Klass: exiting class __new__
Klass: entering class __init__
Klass: exiting class __init__
Klass: exiting metaclass __call__
Klass: entering class __call__
Klass: exiting class __call__

cyan raven Sep 15, 2023, 3:46 PM

#

!e

class Meta(type):
    def __new__(mcls, name, *args, **kwargs):
        print(f'{name}: entering metaclass __new__')
        cls = super().__new__(mcls, name, *args, **kwargs)
        print(f'{name}: exiting metaclass __new__')
        return cls

    def __init__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering metaclass __init__')
        super().__init__(*args, **kwargs)
        print(f'{cls.__name__}: exiting metaclass __init__')

    def __call__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering metaclass __call__')
        new = super().__call__(*args, **kwargs)
        print(f'{cls.__name__}: exiting metaclass __call__')
        return new


class Klass(metaclass=Meta):
    def __init_subclass__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering class __init_subclass__')
        super().__init_subclass__(*args, **kwargs)
        print(f'{cls.__name__}: exiting class __init_subclass__')

    def __new__(cls, *args, **kwargs):
        print(f'{cls.__name__}: entering class __new__')
        obj = super().__new__(cls, *args, **kwargs)
        print(f'{cls.__name__}: exiting class __new__')
        return obj

    def __init__(self, *args, **kwargs):
        print(f'{self.__class__.__name__}: entering class __init__')
        super().__init__(*args, **kwargs)
        print(f'{self.__class__.__name__}: exiting class __init__')

    def __call__(self, *args, **kwargs):
        print(f'{self.__class__.__name__}: entering class __call__')
        print(f'{self.__class__.__name__}: exiting class __call__')
        return 42


class SubKlass(Klass): pass

obj = Klass()
obj()

fallen slateBOT Sep 15, 2023, 3:46 PM

#

@cyan raven :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | Klass: entering metaclass __new__
002 | Klass: exiting metaclass __new__
003 | Klass: entering metaclass __init__
004 | Klass: exiting metaclass __init__
005 | SubKlass: entering metaclass __new__
006 | SubKlass: entering class __init_subclass__
007 | SubKlass: exiting class __init_subclass__
008 | SubKlass: exiting metaclass __new__
009 | SubKlass: entering metaclass __init__
010 | SubKlass: exiting metaclass __init__
011 | Klass: entering metaclass __call__
... (truncated - too many lines)

Full output: https://paste.pythondiscord.com/RMNJCU4Y64IGDAQTEAKUBR2THE

worthy salmon Sep 15, 2023, 4:44 PM

#

can we make

cyan raven Sep 15, 2023, 6:46 PM

#

merry bramble Often you can learn a lot just by playing around with code, as well. Here's a li...

thank you btw how hard is it to understand the genobject.c file? I mean Im not professional in c, but I'd love to see what's going on in the background.

feral island Sep 15, 2023, 6:47 PM

#

cyan raven thank you btw how hard is it to understand the `genobject.c` file? I mean Im not...

that file covers coroutines and async generators, which I hear are pretty complicated under the hood

#

I have never had to touch that file myself though

#

Mark Shannon described async generators as a Jenga tower of state machines

flat gazelle Sep 15, 2023, 6:49 PM

#

That does check out

naive saddle Sep 15, 2023, 6:49 PM

#

aren't coroutines mostly just a generator under the hood though? with extra state to make the "generator" awaitable

#

or is that assuption outdated nowadays?

flat gazelle Sep 15, 2023, 6:50 PM

#

If you stare at the file enough it will start to make sense, C or no C knowledge

#

A coroutine is internally very similar to a generator, though they differ in one field each IIRC.

#

Most C functions that work on generators also work on coroutines

cyan raven Sep 15, 2023, 7:11 PM

#

feral island that file covers coroutines and async generators, which I hear are pretty compli...

huh, okay. I'm scared.

random thistle Sep 15, 2023, 8:55 PM

#

naive saddle aren't coroutines mostly just a generator under the hood though? with extra stat...

I would say, generators were initially introduced as a cut-down form of coroutine. The asyncio module was first introduced in 3.4, and implemented in what I thought was a really horrible way, entirely based on generators. Thankfully, the language designers realized that here was a need for proper coroutines, and so async/await was added in 3.5, and asyncio reworked to use it.

I did this diagram I call “Van Rossum’s Triangle” https://www.deviantart.com/default-cube/art/Van-Rossum-s-Triangle-679791228 which tries to illustrate the ways that control gets transferred between generators, coroutines and regular “mainline” code.

DeviantArt

default-cube

Van Rossum's Triangle by default-cube on DeviantArt

wary fern Sep 16, 2023, 12:24 AM

#

Hey folks, is there a way to mark a test as 'not multiprocess safe' when ran via test.regrtest?

jaunty steeple Sep 17, 2023, 8:21 AM

#

A function to kill thread would be nice similar to multiprocessing.terminate()

urban sandal Sep 17, 2023, 8:27 AM

#

generally speaking, you really really don't want to be trying to kill threads. design your threads such that you can signal to them to end what they are doing if you need that.

#

and that isn't python specific

verbal escarp Sep 17, 2023, 8:29 AM

#

i was just re-thinking my approach in https://github.com/Python-Fuzzylogic/fuzzylogic/blob/master/src/fuzzylogic/functions.py to initialize functions and pre-computing things, then return an inner function for work. i realized it's basically the factory pattern for functions

GitHub

fuzzylogic/src/fuzzylogic/functions.py at master · Python-Fuzzylogi...

Fuzzy Logic and Fuzzy Inference for Python 3. Contribute to Python-Fuzzylogic/fuzzylogic development by creating an account on GitHub.

#

however there's the issue of applying numba to speed those up, so now i was wondering how to keep the code in an un-executed state until they are officially initialized

#

i was contemplating .py modules but only compile them to ast, then apply numba to that instead of importing things directly

#

keeping the inner functions as strings is a big no-no because that would negate any help from IDEs etc

#

so, any idea how to have python code parseable but not immediately executed at runtime?

jaunty steeple Sep 17, 2023, 9:11 AM

#

urban sandal generally speaking, you really **really** don't want to be trying to kill thread...

I tried implementing signal to gracefully shutdown but gunicorn suppresses signals and kills the process hard

paper echo Sep 18, 2023, 12:37 AM

#

urban sandal generally speaking, you really **really** don't want to be trying to kill thread...

tbh i think this is a bit of FUD, most programs don't do things with external resources that would be unsafe to terminate suddenly

#

dealing with graceful shutdown can get hard very quickly in even trivial situations

urban sandal Sep 18, 2023, 12:45 AM

#

paper echo tbh i think this is a bit of FUD, *most* programs don't do things with external ...

I'd rather give someone generally true advice that errs on the side of them doing it correctly while steering them to the right means of doing it, than add something that's more likely than not, a footgun for someone else down the line. This isn't a new problem, it isn't python specific, it's a question that comes up somewhat regularly across languages from people who usualy don't have the experience about os threads and concurrency to know when it is or isn't "safe".

spark magnet Sep 18, 2023, 1:14 AM

#

the reason you can't kill threads isn't because of external resources. It's because they could be holding locks.

maiden dune Sep 18, 2023, 2:13 AM

#

qt's QThreads have a forcible terminate so if you absolutely need killable threads those are an option via pyqt/pyside

raven ridge Sep 18, 2023, 2:18 AM

#

if you terminate a QThread while it holds the GIL, no other thread will ever be able to acquire the GIL, and you'll deadlock your Python process. That sounds super unwise to me.

maiden dune Sep 18, 2023, 2:51 AM

#

yeah, that is an issue. i guess one way to handle that could be to make sure the termination is run later in the event loop, like with a QTimer.singleShot. so e.g. if you need to terminate a QThread within itself QTimer.singleShot(0, self.terminate). would that be ok?

#

could there actually even be any case where a QThread is terminated from the outside while holding the GIL?

#

by the time you reach the .terminate call in another thread, the GIL would have already been handed over, and it wouldnt be released again until the terminate call is completed, right?

raven ridge Sep 18, 2023, 3:03 AM

#

maiden dune by the time you reach the `.terminate` call in another thread, the GIL would hav...

you'd need to read the code of pyside or pyqt to be sure. It wouldn't surprise me if they drop the GIL before calling terminate on QThread. You usually drop the GIL whenever you're doing anything where you'd like another thread to do something

maiden dune Sep 18, 2023, 6:26 AM

#

raven ridge you'd need to read the code of pyside or pyqt to be sure. It wouldn't surprise m...

based on sip file for QThread, terminate wont drop the GIL:

public slots:
    void start(QThread::Priority priority = QThread::InheritPriority) /ReleaseGIL/;
    void terminate();
    void quit();

public:
    bool wait(unsigned long msecs = ULONG_MAX) /ReleaseGIL/;

same for pyside's shiboken xml, where these are the only methods with allow-thread=yes modifier, which according to here means the call gets wrapped in a Py_BEGIN/END_ALLOW_THREADS:

  <object-type name="QThread">
    <enum-type name="Priority"/>
    <modify-function signature="run()" thread="yes" />
    <modify-function signature="exec()" rename="exec_" allow-thread="yes" />
    <modify-function signature="msleep(unsigned long)" allow-thread="yes" />
    <modify-function signature="sleep(unsigned long)" allow-thread="yes" />
    <modify-function signature="usleep(unsigned long)" allow-thread="yes" />
    <modify-function signature="wait(unsigned long)" allow-thread="yes" />
    <modify-function signature="start(QThread::Priority)" allow-thread="yes">
      <modify-argument index="1">
        <rename to="priority"/>
      </modify-argument>
    </modify-function>
    <modify-function signature="exit(int)" allow-thread="yes" />
  </object-type>

so tl;dr is: seems like both in pyside and pyqt, QThread.terminate from the outside wont cause a deadlock since the GIL wont be handed back to the QThread before its termination is complete.

raven ridge Sep 18, 2023, 6:27 AM

#

Well, it won't cause a deadlock on the GIL, at least. It could still cause a deadlock on a different mutex or semaphore, though

maiden dune Sep 18, 2023, 6:30 AM

#

sure, but you specifically highlighted a deadlock involving the GIL, and that was what i was referring to

maiden dune Sep 18, 2023, 12:33 PM

#

actually, circling back around to the original idea of adding an terminate to threading. how about a soft interruption instead? something that just sets a queryable flag on the thread. would reduce the rigamarole of setting up a graceful exit. something like

from threading import interruption_requested, Thread

def loop():
  while not interruption_requested():
    time.sleep(1)
  print('thread interruption requested!')

thread = Thread(target=loop)
thread.start()
time.sleep(10)
thread.interrupt()

and maybe it could automatically be set for any child threads after things like KeyboardInterrupt/SIGINT, SIGHUP, sys.exit, etc.

quick snow Sep 18, 2023, 12:47 PM

#

What would be interesting as a "soft" way to stop a thread is Thread.throw. Right now it is only possible via ctypes, AFAIK.

GitHub

gofuncyourself/gofuncyourself.py at master · L3viathan/gofuncyourself

Go-style errors in Python, with a twist. Contribute to L3viathan/gofuncyourself development by creating an account on GitHub.

peak spoke Sep 18, 2023, 12:49 PM

#

That is somewhat funky because the thread can just be stuck in C code with no way of receiving the exception in a reasonable time if it's not made to expect it

quick snow Sep 18, 2023, 12:50 PM

#

I haven't tested this in the real world, of course, but I think it should just raise at the next possible Python moment.

peak spoke Sep 18, 2023, 1:04 PM

#

yeah, just that it could be far away depending on what it's doing. I also had some weird issue where I had delayed exceptions with pyside because some python code didn't check for exceptions

static hinge Sep 18, 2023, 3:06 PM

#

quick snow What would be interesting as a "soft" way to stop a thread is `Thread.throw`. Ri...

!pip result sounds like this

fallen slateBOT Sep 18, 2023, 3:06 PM

#

result v0.13.1

A Rust-like result type for Python

quick snow Sep 18, 2023, 3:53 PM

#

static hinge !pip result sounds like this

Yes, but less serious, mine "ensures" you're handling the error :D

dusk comet Sep 18, 2023, 4:48 PM

#

Cant you do the same in Err.__del__?
Doing this by spawning threads feels weird

quick snow Sep 18, 2023, 7:24 PM

#

dusk comet Cant you do the same in `Err.__del__`? Doing this by spawning threads feels weir...

not for my usecase, because I want to prevent assigning to err but never actually doing anything with it

unkempt rock Sep 18, 2023, 7:46 PM

#

Result > *, error as value

urban sandal Sep 18, 2023, 8:34 PM

#

maiden dune actually, circling back around to the original idea of adding an `terminate` to ...

This would be significantly worse in many real world code bases, and definitely shouldn't be done automatically. If you have a thread that's handling a queue, just send a special value for the queue to finish work, and then you also allow gracefully closing the queue and aren't constantly busy looping for "maybe they want that cancelled"

#

I really think this comes down to teach people to design concurrent code to handle gracefully shutting down and error handling, rather than trying to come up with a "one-size can't quite fit all" solution.

flat gazelle Sep 18, 2023, 8:38 PM

#

I do think there is value in having some better abstraction for thread exits than "if should_terminate:stop" in a loop every once in a while. But I am not really aware of one

urban sandal Sep 18, 2023, 8:43 PM

#

when the best possible abstraction is worse than not abstracting it, it probably shouldn't be abstracted.

daemon threads already close with the interpreter, because there you don't care if the (interpreter's) locks remain held

for 1-shot things in a thread, like moving blocking fileio to a thread in an async program, you probably just want to let the fileio finish most of the time

for long lived background threads, you probably should have a work queue or some other means for the background thread to communicate, and this becomes a viable means to also send "okay finish up" and handle that as appropriate to your application, without looping on that.

maiden dune Sep 18, 2023, 11:06 PM

#

urban sandal This would be significantly worse in many real world code bases, and definitely ...

This would be significantly worse in many real world code bases, and definitely shouldn't be done automatically.

Could you elaborate on why that would be the case? With this approach, a thread would be free to completely ignore and never use the interruption flag. Having it be set automatically wouldn't change anything about existing code, while making it easier to cover some commonly encountered exit conditions for code that wants it.

If you have a thread that's handling a queue, just send a special value for the queue to finish work, and then you also allow gracefully closing the queue and aren't constantly busy looping for "maybe they want that cancelled"

Not all threads are handling queues.

I really think this comes down to teach people to design concurrent code to handle gracefully shutting down and error handling, rather than trying to come up with a "one-size can't quite fit all" solution.

I agree with promoting better code design, but how would that be achieved without teaching the use of some kind of signalling mechanism? Whether dequeuing a sentinel, or polling a flag, the goal is to provide some update to the thread about the wider program state, right? They're variations of the same thing. The threading.Event case in particular is a fairly common approach, and often seen as a solution to many questions about graceful shutdowns. This would be a more convenient version of that.

maiden dune Sep 18, 2023, 11:07 PM

#

urban sandal when the best possible abstraction is worse than not abstracting it, it probably...

when the best possible abstraction is worse than not abstracting it, it probably shouldn't be abstracted

An explanation for why it's worse would be more helpful.

for long lived background threads, you probably should have a work queue or some other means for the background thread to communicate, and this becomes a viable means to also send "okay finish up" and handle that as appropriate to your application, without looping on that.

Unless, again, the thread isn't doing anything queue oriented, in which case forcing a queue into the mix would result in the same looped polling behaviour but now with even more overhead and complexity. And wouldn't any other means, short of injecting exceptions into the thread, also reduce to the same thing?

maiden dune Sep 18, 2023, 11:08 PM

#

flat gazelle I do think there is value in having some better abstraction for thread exits tha...

Yeah, I think the threading module could do with some sprucing up in general, especially with GIL removal on the horizon.

maiden dune Sep 18, 2023, 11:12 PM

#

quick snow What would be interesting as a "soft" way to stop a thread is `Thread.throw`. Ri...

that's cool, i like the idea of interruptions via injecting exceptions, though only if there's some explicit control over exactly where that could occur, like being able to catch a asyncio.CancelledError on an await in coroutines, or with generator.throw on a yield. maybe with a context manager? e.g.

from threading import allow_interruptions, InterruptionError

def loop():
  # do some stuff to set up...
  # so far, the code out here is guaranteed to not be interrupted by any injected exception

  while True:
    try:
      with allow_interruptions:
        # but the code inside here can be interrupted

        # do some stuff that blocks...
        info = queue.get()
    except InterruptionError:
      break
    else:
      # do uninterruptable stuff with dequeued info...

  # do clean up stuff
      
thread = Thread(target=loop)
thread.start()

# then later on:
thread.throw(CustomException) # raise a custom exception at the next interruption point

# or
thread.interrupt() # now equivalent to thread.throw(InterruptionError)

thread.join()

urban sandal Sep 18, 2023, 11:22 PM

#

maiden dune > This would be significantly worse in many real world code bases, and definitel...

you're adding a "way to do things" that encourages a specific way as the way to do it which leads to looping on the "should I stop" rather than it being driven by being told to stop. API design encourages code design. Sometimes, checking something like that might be the only way, but I would rather not encourage the worst way to check this with API design, there are already enough pitfalls in concurrency for API design to lead someone to thinking this is a good way to do it.

#

This would be better handled by a section in threading docs showing basic ways to handle thread shutdown if it's that common, and then letting people pick the one that fits what matches their needs best. (and building more on it from there based on their needs)

urban sandal Sep 18, 2023, 11:28 PM

#

maiden dune > when the best possible abstraction is worse than not abstracting it, it probab...

if the thread has no need to communicate already, it probably shouldn't be terminated abrupty as it has no way to communicate anything about the cancellation back. Unlike asyncio, which has builtin ways to still handle this in done_callback (if you're at low level cancellation), there's no equivalent in threading without already needing a means of communicating. Event driven code performs significantly better in concurrent systems than code that busy loops or polls, and the latter should be avoided when possible (yes, it isn't always possible)

maiden dune Sep 19, 2023, 2:13 AM

#

urban sandal you're adding a "way to do things" that encourages a specific way as the way to ...

you're adding a "way to do things" that encourages a specific way as the way to do it which leads to looping on the "should I stop" rather than it being driven by being told to stop. API design encourages code design. Sometimes, checking something like that might be the only way, but I would rather not encourage the worst way to check this with API design, there are already enough pitfalls in concurrency for API design to lead someone to thinking this is a good way to do it.

this argument could be made for literally any feature. by this logic nothing new should ever be added because someone somewhere might misunderstand how to use it or assume it's automatically better than every alternative in every case. that's not an issue with the feature itself, it's at most an issue with presentation, or just plain old user error. also not sure how this explains why it would be 'significantly worse for many real world code bases'; the flag poll method is already common for code that doesnt do any queue oriented work and a built-in flag would be a more convenient way of doing that.

This would be better handled by a section in threading docs showing basic ways to handle thread shutdown if it's that common, and then letting people pick the one that fits what matches their needs best. (and building more on it from there based on their needs)

there's nothing about this feature which would prevent the addition of such a section. a new feature isn't in competition or mutually exclusive with more docs.

maiden dune Sep 19, 2023, 2:20 AM

#

urban sandal if the thread has no need to communicate already, it probably shouldn't be termi...

if the thread has no need to communicate already, it probably shouldn't be terminated abrupty as it has no way to communicate anything about the cancellation back.

not sure i follow the logic here. if a thread already doesn't need to communicate overall, then not being able to communicate about a cancellation wont be an issue either, since it's already been established that there's no need to. also to be clear, this feature is about enabling a graceful termination, as in allowing the thread to do cleanup, etc. on its own terms. not abrupt abrupt as in killing a process.

Unlike asyncio, which has builtin ways to still handle this in done_callback (if you're at low level cancellation), there's no equivalent in threading without already needing a means of communicating.

if the problem is recovering information from threads after theyre done, thats a separate issue which has always been there with threads (though I suppose ThreadPoolExecutor addresses this to an extent with concurrent.futures.Future). dont see how making one method of interruption more convenient could make this worse.

in any case, something like this could suffice:

def loop():
    while not interruption_requested():
        # do stuff
    # clean up
    return 'cool result'

class ThreadWithResult(Thread):
    def __init__(self, target, args=None, kwargs=None):
        super().__init__()
        self.target = target
        self.args = args or ()
        self.kwargs = kwargs or {}
        self.result = None
        self.error = None
    def run(self):
        try:
            self.result = self.target(*self.args, **self.kwargs)

        except Exception as e:
            self.error = e

thread = ThreadWithResult(loop)
thread.start()
# ... later on
thread.interrupt()
thread.join()
if thread.error is None:
    # do things with result

(or alternatively add these attributes to the default Thread class to capture run's return value and uncaught exceptions)

#

if it specifically has to involve a callback, i suppose that could also be set with another method, or just as another attribute on the thread.

from threading import interruption_requested, interruption_callback, Thread

def loop():
    while not interruption_requested:

    # get callback 
    callback = interruption_callback()

    if callback:
        callback(...)

thread = Thread(target=loop)
thread.start()
# ... later on
thread.set_interruption_callback(print)
thread.interrupt()
thread.join()

urban sandal Sep 19, 2023, 2:22 AM

#

The argument I made doesn't apply to all features, it's saying the base case of not adding it is a better state than the API provided in the standard library encouraging the worst way to do it, especially when you can already do what you want there yourself without it being at the language level. Sometimes abstractions aren't necessary, and the social impacts of them are negative.

maiden dune Sep 19, 2023, 2:22 AM

#

urban sandal if the thread has no need to communicate already, it probably shouldn't be termi...

Event driven code performs significantly better in concurrent systems than code that busy loops or polls, and the latter should be avoided when possible (yes, it isn't always possible)

Far as I know, it's possible to have event driven code that involves busy waiting or polling; these aren't mutually exclusive concepts. Also I feel like characterizing this kind of loop as a busy wait would be inaccurate since there would be work being done between each poll. It wouldn't be like some kind of spinlock, just sitting there repeatedly checking for an interruption doing nothing else. I think maybe the use of sleep in original example loop, intended as a placeholder for work being done, might have given a misleading impression.

paper echo Sep 20, 2023, 11:47 AM

#

spark magnet the reason you can't kill threads isn't because of external resources. It's beca...

isn't that kind of an external resource? other than the GIL

spark magnet Sep 20, 2023, 1:27 PM

#

paper echo isn't that kind of an external resource? other than the GIL

I wouldn't call an in-process lock an external resource. What is it external to?

urban sandal Sep 20, 2023, 3:35 PM

#

They could be holding external locks too like a named semaphore but the problem remains the same whether you consider the resources being held as internal or external, killing a thread instead of communicating to close it prevents any neccessary cleanup and releasing of resources to happen. The same is not true with killing a process, as they recieve a signal from the OS.

charred fulcrum Sep 20, 2023, 3:35 PM

#

An in-process lock may not be external to the system, but it is still a resource that is external to the current thread.

spark magnet Sep 20, 2023, 3:43 PM

#

i guess i would say, "clean up held resources, especially locks". Internal vs external is vague and irrelevant.

paper echo Sep 20, 2023, 7:54 PM

#

spark magnet I wouldn't call an in-process lock an external resource. What is it external to...

right, but a thread holding an internal lock (i.e. one that i deliberately created inside my application) falls within the territory of "i know i'm not doing that, please let me just kill the thread"

spark magnet Sep 20, 2023, 8:00 PM

#

paper echo right, but a thread holding an _internal_ lock (i.e. one that i deliberately cre...

if the thread is holding the lock, your app would be wrong to terminate the thread.

#

now anything else needing the lock is deadlocked.

paper echo Sep 20, 2023, 8:00 PM

#

right, but i can also deadlock my app in 100 other ways

#

actually i manage to deadlock my scripts almost guaranteed every time any time i have to write while (item := queue.get()) is not None: ...

#

with asyncio there's a workaround:

while True:
    queue_get_task = asyncio.create_task(queue.get())
    shutdown_signal_task = asyncio.create_task(shutdown_event.wait())
    tasks = (shutdown_signal_task, queue_get_task)
    done, _ = await asyncio.wait(tasks, return_when=asyncio.FIRST_COMPLETED)
    if queue_get_task in done:
        ...
    if shutdown_signal_task in done:
        break

but i don't know of any equivalent with threads, other than "more threads" which seems kind of.. sketchy? bad?

with concurrent.futures.ThreadPoolExecutor(2) as local_executor:
    while True:
        queue_get_fut = local_executor.submit(queue.get)
        shutdown_signal_fut = local_executor.submit(shutdown_event.wait)
        futures = (shutdown_signal_fut, queue_get_fut)
        done, _ = concurrent.futures.wait(futures, return_when=concurrent.futures.FIRST_COMPLETED)

maybe this is just me being bad at software design, but i feel like this is not exactly a great experience for people who need to do a bunch of i/o and don't have an async-ready library available for that purpose

#

another option was floated in #async-and-concurrency when i last brought this up, which was to create my external resource handle in the main thread, but never actually use it there, and keep passing it off to run in worker threads, which is also kind of scary because such handles are often not even close to thread-safe

raven ridge Sep 20, 2023, 9:15 PM

#

paper echo right, but a thread holding an _internal_ lock (i.e. one that i deliberately cre...

you've conflated two different things here. Not every mutex your thread might be holding is one that you deliberately created inside your application. There's one inside of the libc stdio objects used by CPython to print to stdout, for instance. There's almost certainly one inside the libc malloc function that CPython uses to allocate memory. There's one in the C code for initializing a block scoped static variable, and possibly one for initializing a thread-local variable inside an extension module. And those are just examples of mutexes in libc / libpthread. There's also mutexes inside of, for instance, the logging module, so if your thread was in the middle of logging when you killed it, it could die holding a mutex that could cause a deadlock on any future attempt to log anything from any thread.

#

in other words, "process-local mutexes are an internal resource rather than an external one" does not imply "internal resources were deliberately created by me in my own application"

paper echo Sep 20, 2023, 10:01 PM

#

raven ridge in other words, "process-local mutexes are an internal resource rather than an e...

i see, thanks for clarifying

flat gazelle Sep 20, 2023, 10:08 PM

#

Another interesting thing to keep in mind is that there are synchronization scenarios where releasing a lock on exit will also cause a bug.

grand rain Sep 21, 2023, 2:12 AM

#

o/
https://peps.python.org/pep-0492/#await-expression

Any yield from chain of calls ends with a yield.
Isn't that false?

PEP 492 – Coroutines with async and await syntax | peps.python.org

Python Enhancement Proposals (PEPs)

#

I mean

dusk comet Sep 21, 2023, 2:45 AM

#

yield from () also doesn't use yield under the hood, because tuple iterator is not a generator

#

yielding from if False: yield function will also do the thing

grand rain Sep 21, 2023, 2:49 AM

#

I mean that actually executes

#

but returns stop instantly, and looks same in asyncio

uncut ridge Sep 21, 2023, 2:56 AM

#

dusk comet `yield from ()` also doesn't use yield under the hood, because tuple iterator is...

so yield from not always leads to yield?

#

and await is maybe await maybe block

uncut ridge Sep 21, 2023, 3:13 AM

#

dusk comet `yield from ()` also doesn't use yield under the hood, because tuple iterator is...

yield from iterable generally?

static hinge Sep 21, 2023, 3:36 PM

#

yield from is also how you access the return value of a generator.

#

very niche feature

#

at least since async/await came along

grand rain Sep 21, 2023, 6:01 PM

#

well, await is actually a synonym for yield from

#

This is more about coming of async in python in general

feral island Sep 21, 2023, 6:07 PM

#

grand rain well, `await` is actually a synonym for `yield from`

no, it's not

#

They can be used in a similar way semantically, but they do not do the same thing under the hood

dusk comet Sep 21, 2023, 6:22 PM

#

i guess they do similar thing
historically, yield from was used instead of await. Then await was introduced and it replaced yield from usage for async functions
there is still a function in stdlib that converts generator to async function

grand rain Sep 21, 2023, 6:34 PM

#

feral island no, it's not

in case of coroutines also?

#

also, __iter__ in futures binded to __await__

static hinge Sep 21, 2023, 7:50 PM

#

dusk comet i guess they do similar thing historically, `yield from` was used instead of `aw...

yield from was never used in async functions. It was used with @asyncio.coroutine.

#

yield from is also a way you can implement __await__

#

that's right, remove your 👎 reaction

uncut ridge Sep 21, 2023, 7:55 PM

#

replaced for async functions
not in async functions

static hinge Sep 21, 2023, 7:55 PM

#

lol.

cyan raven Sep 21, 2023, 8:09 PM

#

uncut ridge replaced for async functions not in async functions

you must be a really smart person, thank you for helping this conversation going forward.

grand rain Sep 21, 2023, 8:17 PM

#

cyan raven you must be a really smart person, thank you for helping this conversation going...

wdym

uncut ridge Sep 21, 2023, 8:18 PM

#

cyan raven you must be a really smart person, thank you for helping this conversation going...

where am I wrong?
unalivejoy simply misinterpreted denball's message, and disputing imagined statement
of course yield from was not used in async functions, it's just a syntax error by pep.
it was used in generator based coroutines before async await
no?

feral island Sep 21, 2023, 8:25 PM

#

Seems like people are mostly in agreement but not always using the most precise language

grand rain Sep 21, 2023, 8:26 PM

#

grand rain o/ https://peps.python.org/pep-0492/#await-expression > Any **yield from** chain...

but still, what about that?

feral island Sep 21, 2023, 8:27 PM

#

grand rain but still, what about that?

Yeah I don't know what to make of that sentence either.

uncut ridge Sep 21, 2023, 8:36 PM

#

we need to go back to 09-Apr-2015

#

damn

#

ModuleNotFoundError: No module named 'timetravel'

grand rain Sep 22, 2023, 2:14 AM

#

man

neat delta Sep 22, 2023, 2:54 AM

#

!rule 9 6 perhaps you should re-read the channel description. this channel is about python internals, which your message isn't. and in case you're wandering, nowhere in this entire server do we allow resumes

fallen slateBOT Sep 22, 2023, 2:54 AM

#

Rules

6. Do not post unapproved advertising.

9. Do not offer or ask for paid work of any kind.

raven ridge Sep 22, 2023, 4:20 AM

#

!warn 1129321197448986636 Please don't attempt to solicit work here, per rules 6 and 9.

fallen slateBOT Sep 22, 2023, 4:20 AM

#

:incoming_envelope: :ok_hand: applied warning to @gaunt sleet.

raven ridge Sep 22, 2023, 4:21 AM

#

I've deleted your message accordingly

soft drum Sep 22, 2023, 11:01 AM

#

lis = [1,2,3,4,5,6]
print(lis[5:65])

Why does this not result in an error?

dusk comet Sep 22, 2023, 11:24 AM

#

because builtin sequences ignore index errors in case of slicing
otherwise it would be VERY annoying

#

imagine x[:5] erroring because there are less than 5 elements

cyan raven Sep 22, 2023, 1:12 PM

#

where is the source code of the python discourse site(discussion. python), or is that just a fork of the original one?

#

https://github.com/discourse/discourse

GitHub

GitHub - discourse/discourse: A platform for community discussion. ...

A platform for community discussion. Free, open, simple. - GitHub - discourse/discourse: A platform for community discussion. Free, open, simple.

soft drum Sep 23, 2023, 1:54 PM

#

dusk comet because builtin sequences ignore index errors in case of slicing otherwise it wo...

But Golang shows errors on this !

dusk comet Sep 23, 2023, 2:38 PM

#

golang is not python

deep jolt Sep 24, 2023, 4:25 AM

#

cyan raven https://github.com/discourse/discourse

looks interesting 🙂

cyan raven Sep 24, 2023, 10:25 AM

#

https://discuss.python.org/t/official-list-of-core-developers/924/4
Any thoughts on this, I'm not sure if this git repository was created.

Discussions on Python.org

Official list of core developers

I can be wrong. If you are aware of someone else having an “OSS day” (@emily maybe?), tell me 😉 Yeah, I tried to put this number in perspective with the popularity of the Python language: #3 most popular language in the world according to TIOBE index… and only 2 full-time paid developers… Python is not a product. It’s hard to justify to your m...

spark magnet Sep 24, 2023, 12:08 PM

#

cyan raven https://discuss.python.org/t/official-list-of-core-developers/924/4 Any thoughts...

what git repo are you asking about?

cyan raven Sep 24, 2023, 12:20 PM

#

spark magnet what git repo are you asking about?

I mean, the person who wrote the post mentioned there's no official list of core developers and asked a bunch of other stuff. I'm kinda curious if anything's changed since then or what's up with it now.

unkempt rock Sep 24, 2023, 2:37 PM

#

Is there a PEP for implementation of package managers?

cyan raven Sep 24, 2023, 2:42 PM

#

unkempt rock Is there a PEP for implementation of package managers?

you mean pip?

#

pep about pip?

unkempt rock Sep 24, 2023, 2:44 PM

#

cyan raven pep about pip?

No, just in general

safe basalt Sep 24, 2023, 2:52 PM

#

!pep 518

fallen slateBOT Sep 24, 2023, 2:52 PM

#

**PEP 518 - Specifying Minimum Build System Requirements for Python Projects**

Status

Final

Created

10-May-2016

Type

Standards Track

safe basalt Sep 24, 2023, 2:53 PM

#

no wait

#

!pep 517

fallen slateBOT Sep 24, 2023, 2:53 PM

#

**PEP 517 - A build-system independent format for source trees**

Status

Final

Created

30-Sep-2015

Type

Standards Track

dusk comet Sep 24, 2023, 2:53 PM

#

there are some packaging-related peps, iirc

safe basalt Sep 24, 2023, 2:53 PM

#

There are several

dusk comet Sep 24, 2023, 2:54 PM

#

there is a lot of peps with "metadata" in their names

unkempt rock Sep 24, 2023, 2:54 PM

#

safe basalt !pep 517

What does "build-system independent system" mean here?

dusk comet Sep 24, 2023, 2:57 PM

#

"build-system independent format"

#

it is the universal format for describing building process, i guess

#

like setup.py or pyproject.toml

safe basalt Sep 24, 2023, 2:58 PM

#

not like a setup.py

#

!pep 621 provides the speficiation for a pyproject.toml

fallen slateBOT Sep 24, 2023, 2:58 PM

#

**PEP 621 - Storing project metadata in pyproject.toml**

Status

Final

Created

22-Jun-2020

Type

Standards Track

dusk comet Sep 24, 2023, 2:58 PM

#

then maybe setup.cfg?

safe basalt Sep 24, 2023, 2:58 PM

#

no

dusk comet Sep 24, 2023, 2:58 PM

#

ok 👍

#

so pyproject.toml is the only build-system independent format

safe basalt Sep 24, 2023, 2:59 PM

#

PEP 517 creates a specification for how to turn a pyproject.toml into a correct package

safe basalt Sep 24, 2023, 3:03 PM

#

dusk comet so pyproject.toml is the only build-system independent format

Yes.
setup.py and setup.cfg are a part of setuptools. They are not a part of any other build system.
And even the setuptools maintainers have started talking about getting rid of the setup.{py,cfg} now that they've added support for pyproject.tomls.

#

They're still faaar too prevalant to actually do that, but the talk is there

#

And not all pyproject.tomls are created equal, either.
Poetry uses the same file name, and it even looks almost the same, but they use their own structure and their own process that is not standards compliant

cyan raven Sep 24, 2023, 4:26 PM

#

is this how Cython implements class methods?
https://github.com/cython/cython/blob/9827c6085e2141db71c55ae231a4a09a878dd524/Cython/Compiler/Symtab.py#L2175

I suppose this module refers to the symbol table inside the compiler.

fallen slateBOT Sep 24, 2023, 4:26 PM

#

Cython/Compiler/Symtab.py line 2175

if name == "classmethod":```

cyan raven Sep 24, 2023, 5:12 PM

#

safe basalt And not all `pyproject.toml`s are created equal, either. Poetry uses the same fi...

I think the best combo for someone who wants to go with setuptools is to use setuptools backend in pyproejct.toml.

#

build-backend = 'setuptools.build_meta'

alpine rose Sep 24, 2023, 5:16 PM

#

cyan raven I mean, the person who wrote the post mentioned there's no official list of core...

the official list that i’m aware of is not public, presumably because it contains contact details

cyan raven Sep 24, 2023, 5:17 PM

#

alpine rose the official list that i’m aware of is not public, presumably because it contain...

oh okay, is there a discord server or something where core developers are talking with each other, just wondering tho.

merry bramble Sep 24, 2023, 5:47 PM

#

cyan raven oh okay, is there a discord server or something where core developers are talkin...

yes, but many core developers aren't on discord

#

And most important discussions take place in public

cyan raven Sep 24, 2023, 6:11 PM

#

merry bramble And _most_ important discussions take place in public

fair enough.

merry bramble Sep 24, 2023, 6:15 PM

#

The aim is of course that all important discussions should take place in public (either on GitHub or at discuss.python.org) — it's open source, after all. But occasionally there are things where a quick back-and-forth on an instant-messaging platform is really useful

cyan raven Sep 24, 2023, 6:16 PM

#

merry bramble The aim is of course that _all_ important discussions should take place in publi...

yes, but I'd love to see a list in public so I can see all core developers and their work, I don't think some public information can cause issues.

merry bramble Sep 24, 2023, 6:29 PM

#

cyan raven yes, but I'd love to see a list in public so I can see all core developers and t...

https://devguide.python.org/core-developers/developer-log/ is public

Python Developer's Guide

Developer log

This page lists the historical members of the Python development team. (The master list is kept in a private repository due to containing sensitive contact information.),,,,, Name, GitHub username,...

cyan raven Sep 24, 2023, 6:32 PM

#

merry bramble https://devguide.python.org/core-developers/developer-log/ is public

Thank you, is there a requirement about how many hours you should spend developing per day as a core developer?

merry bramble Sep 24, 2023, 6:34 PM

#

no

merry bramble Sep 24, 2023, 6:37 PM

#

merry bramble https://devguide.python.org/core-developers/developer-log/ is public

Some of the people listed here have not made contributions for many years, but are still officially core developers

cyan raven Sep 24, 2023, 6:38 PM

#

merry bramble Some of the people listed here have not made contributions for many years, but a...

so you can have commit privileges if you are a core dev I assume.

merry bramble Sep 24, 2023, 6:39 PM

#

yes

cyan raven Sep 24, 2023, 6:39 PM

#

merry bramble yes

age doesn't matter, does it?

alpine rose Sep 24, 2023, 6:44 PM

#

no, why would it?

cyan raven Sep 24, 2023, 6:48 PM

#

alpine rose no, why would it?

well, I'm thinking about making more contributions to CPython and if I have enough experience I might try being a core developer. I wasn't sure whether my age would fit(this is why I asked).

spark magnet Sep 24, 2023, 7:15 PM

#

cyan raven well, I'm thinking about making more contributions to CPython and if I have enou...

The release manager for 2.7 was a teenager I believe. Age doesn't matter.

merry bramble Sep 24, 2023, 7:16 PM

#

Core developers need to be people who have demonstrated commitment to the project, people we're confident will work well as part of the team and people whose judgement we have confidence in. But there's certainly no age requirement

spark magnet Sep 24, 2023, 7:24 PM

#

or time commitment.

upper timber Sep 24, 2023, 8:23 PM

#

Hi, is this a good place to ask about pypy?

#

I couldn't find any pypy related discussion in python-help forum (or maybe it's just that discord is autocorrecting it to pypi ? hmm)

cyan raven Sep 24, 2023, 8:56 PM

#

upper timber Hi, is this a good place to ask about pypy?

Yes.

upper timber Sep 24, 2023, 8:58 PM

#

I was wondering what would be the best way of studying machine codes emitted by pypy JIT.

It seems like my options are vmprof (which functions as both profiler and JIT log visualizer) and jitviewer. Am I missing anything?

I was just being wary because vmprof.com is down and jitviewer was not updated for few years.

#

I want some Godbolt-esque tool that I could use to study how pypy is responding to my attempt at optimizing my code

cyan raven Sep 24, 2023, 9:08 PM

#

upper timber I want some Godbolt-esque tool that I could use to study how pypy is responding ...

I'm sure that the others will be able to help, I'm not too familiar with Pypy.

unkempt rock Sep 25, 2023, 1:29 AM

#

are each pep styles attributed to different python versions and does this mean each has linguistic differences or syntax variety

spark magnet Sep 25, 2023, 1:42 AM

#

unkempt rock are each pep styles attributed to different python versions and does this mean e...

what styles do you mean?

cyan raven Sep 25, 2023, 1:34 PM

#

how is pypy getting on with the latest features, is it still up-to-date?

spark magnet Sep 25, 2023, 1:46 PM

#

cyan raven how is pypy getting on with the latest features, is it still up-to-date?

they have a 3.10 version.

cyan raven Sep 25, 2023, 2:17 PM

#

spark magnet they have a 3.10 version.

so it's not that active?

merry bramble Sep 25, 2023, 3:05 PM

#

cyan raven so it's not that active?

what do these bars mean?

cyan raven Sep 25, 2023, 3:09 PM

#

merry bramble what do these bars mean?

the activity of pypy on GitLab. I was just looking at the project. I was wondering why it doesn't support Python 3.11.

spark magnet Sep 25, 2023, 3:24 PM

#

cyan raven the activity of pypy on GitLab. I was just looking at the project. I was wonderi...

3.10 isn't too far behind current.

upper timber Sep 25, 2023, 3:31 PM

#

An impression I got is that many previous core developers are no longer active but bug fixes are still being resolved and all. Wasn’t there HN thread recently where large amount of people came out of woodwork and explained how they are deploying pypy at work?

static hinge Sep 25, 2023, 4:33 PM

#

safe basalt !pep 621 provides the speficiation for a `pyproject.toml`

It's so easy to remember. 621 is what to do when attending a convention. 6 hours of sleep, 2 meals, 1 shower, every day.

#

||totally not because of the famous site ending in 621||

cyan raven Sep 25, 2023, 4:49 PM

#

spark magnet 3.10 isn't too far behind current.

I didn't see any tasks in the todos text file, but I suppose they are working on it.

cyan raven Sep 25, 2023, 10:28 PM

#

Could someone link me to the code where the self is being passed as the first argument to the methods under the hood?

feral island Sep 25, 2023, 10:31 PM

#

cyan raven Could someone link me to the code where the self is being passed as the first ar...

there's two levels of that. One is that FunctionType has a __get__ implementation that returns a bound method object that inserts the extra argument

#

The other level is that in practice, as an optimization, we usually bypass that and the bytecode calls the function object directly with the extra argument added

#

you can trace the first one at https://github.com/python/cpython/blob/d73c12b88c2275fd44e27c91c24f3ac85419d2b8/Objects/funcobject.c#L962 (implementation of tp_descr_get for functions), which creates a method object (https://github.com/python/cpython/blob/d73c12b88c2275fd44e27c91c24f3ac85419d2b8/Objects/classobject.c#L108), which has a __call__ that ends up at https://github.com/python/cpython/blob/d73c12b88c2275fd44e27c91c24f3ac85419d2b8/Objects/classobject.c#L43

fallen slateBOT Sep 25, 2023, 10:34 PM

#

Objects/funcobject.c line 962

func_descr_get(PyObject *func, PyObject *obj, PyObject *type)```
`Objects/classobject.c` line 108
```c
PyMethod_New(PyObject *func, PyObject *self)```
`Objects/classobject.c` line 43
```c
method_vectorcall(PyObject *method, PyObject *const *args,```

feral island Sep 25, 2023, 10:35 PM

#

and there you can see some code like newargs[0] = self

#

for the second level, I think you'd have to look at the ways the CALL opcode is specialized, e.g. https://github.com/python/cpython/blob/d73c12b88c2275fd44e27c91c24f3ac85419d2b8/Python/bytecodes.c#L3374

fallen slateBOT Sep 25, 2023, 10:36 PM

#

Python/bytecodes.c line 3374

inst(CALL_METHOD_DESCRIPTOR_O, (unused/1, unused/2, callable, self_or_null, args[oparg] -- res)) {```

cyan raven Sep 25, 2023, 10:37 PM

#

feral island there's two levels of that. One is that FunctionType has a `__get__` implementat...

I wonder if I could implement something in pure Python that passes the self in using descriptor terminology.
like this for classmethod.

import functools

class ClassMethod:
    "Emulate PyClassMethod_Type() in Objects/funcobject.c"

    def __init__(self, f):
        self.f = f
        functools.update_wrapper(self, f)

    def __get__(self, obj, cls=None):
        if cls is None:
            cls = type(obj)
        if hasattr(type(self.f), '__get__'):
            # This code path was added in Python 3.9
            # and was deprecated in Python 3.11.
            return self.f.__get__(cls, cls)
        return MethodType(self.f, cls)

cyan raven Sep 25, 2023, 10:38 PM

#

feral island for the second level, I think you'd have to look at the ways the CALL opcode is ...

thank you

feral island Sep 25, 2023, 10:38 PM

#

cyan raven I wonder if I could implement something in pure Python that passes the self in u...

yes, pretty sure you can implement classmethod and friends in Python using __get__ pretty easily

merry bramble Sep 26, 2023, 6:08 AM

#

The descriptor HOWTO has a whole section giving pure-Python equivalents of property, classmethod, staticmethod and others: https://docs.python.org/3/howto/descriptor.html#pure-python-equivalents

Python documentation

Descriptor HowTo Guide

Author, Raymond Hettinger,, Contact,,. Contents: Descriptor HowTo Guide- Primer- Simple example: A descriptor that returns a constant, Dynamic lookups, Managed attributes, Cu...

lavish leaf Sep 26, 2023, 8:56 AM

#

hello

#

4 years ago i failed in senior secondary

#

can you tell me some valuable certifications that would make companies to overlook my gap and failure and still hire me

cyan raven Sep 26, 2023, 9:40 AM

#

lavish leaf 4 years ago i failed in senior secondary

#career-advice

cyan raven Sep 26, 2023, 9:41 AM

#

merry bramble The descriptor HOWTO has a whole section giving pure-Python equivalents of prope...

oh okay, thank you.

fervent pawn Sep 26, 2023, 3:38 PM

#

@fallen slate source reminder

fallen slateBOT Sep 26, 2023, 3:38 PM

#

Command: remind

Commands for managing your reminders.

Source Code

Go to GitHub

inland halo Sep 27, 2023, 2:37 PM

#

: guys is it necessary to build a team for online hackthons and competitions like machine learning projects on kaggle

cyan raven Sep 27, 2023, 3:46 PM

#

inland halo : guys is it necessary to build a team for online hackthons and competitions li...

#python-discussion

wanton aspen Sep 27, 2023, 9:16 PM

#

@feral island
I know its a really stupid question, but i got nervous a little bit, can you produce yourself?

feral island Sep 27, 2023, 9:20 PM

#

wanton aspen <@783088578363523104> I know its a really stupid question, but i got nervous a ...

No need to be nervous, questions is what this place is for. Not sure what "produce yourself" means though, do you mean "introduce"?

wanton aspen Sep 27, 2023, 9:22 PM

#

feral island No need to be nervous, questions is what this place is for. Not sure what "produ...

Im sorry i just learned english in youtube anyway, yeah i mean that.

feral island Sep 27, 2023, 9:26 PM

#

wanton aspen Im sorry i just learned english in youtube anyway, yeah i mean that.

I'm Jelle Zijlstra, I work on several open source projects related to the Python language, and I answer questions in a few channels on this server sometimes

wanton aspen Sep 27, 2023, 9:28 PM

#

feral island I'm Jelle Zijlstra, I work on several open source projects related to the Python...

Thats good, nice to meet you.

steep walrus Sep 28, 2023, 2:15 AM

#

feral island I'm Jelle Zijlstra, I work on several open source projects related to the Python...

That's good!
Are you free?

cyan raven Sep 28, 2023, 7:33 PM

#

is it common to have peps accepted but still unimplemented?
or its being implemented as accepted immediately?

feral island Sep 28, 2023, 7:37 PM

#

cyan raven is it common to have peps accepted but still unimplemented? or its being impleme...

There can be a time delay between acceptance and implementation. Usually when a PEP is accepted there is at least a prototype implementation

cyan raven Sep 28, 2023, 7:39 PM

#

feral island There can be a time delay between acceptance and implementation. Usually when a ...

I mean, I was wondering whether it's possible to have peps accepted and still unimplemented. (if I were about to go back and check out every single pep)

feral island Sep 28, 2023, 7:41 PM

#

cyan raven I mean, I was wondering whether it's possible to have peps accepted and still un...

It's theoretically possible but I think no PEP is currently in that state. For PEP 649 and 703 the SC has said they'd accept the PEP but I think there's no formal acceptance yet for either

cyan raven Sep 28, 2023, 7:42 PM

#

feral island It's theoretically possible but I think no PEP is currently in that state. For P...

okay, thank you.

#

yes just checked out the current state of pep 649:

static hinge Sep 28, 2023, 8:39 PM

#

I once saw a PR for a pep before it was officially submitted. The PR was done by guido btw

cyan raven Sep 28, 2023, 8:57 PM

#

what is the hash algorithm that Python is using? Like the maths formula.

peak spoke Sep 28, 2023, 9:02 PM

#

hash for what?

cyan raven Sep 28, 2023, 9:03 PM

#

peak spoke hash for what?

I'm talking about the built-in hash.

cyan raven Sep 28, 2023, 9:03 PM

#

peak spoke hash for what?

https://docs.python.org/3.5/library/functions.html#hash

peak spoke Sep 28, 2023, 9:03 PM

#

different types implement it differently

steel solstice Sep 28, 2023, 9:04 PM

#

default hash?

cyan raven Sep 28, 2023, 9:05 PM

#

steel solstice default hash?

SHA1?

steel solstice Sep 28, 2023, 9:05 PM

#

no lol

cyan raven Sep 28, 2023, 9:06 PM

#

steel solstice no lol

not sure what default hash means in this context.

 attr_tuple = tuple(getattr(self, attr) for attr in type(self).__slots__)
        return hash(attr_tuple)

#

hash(...)

grave jolt Sep 28, 2023, 9:11 PM

#

as Gobot said, there's no single algorithm. Everyone is free to implement their own hash

cyan raven Sep 28, 2023, 9:12 PM

#

grave jolt as Gobot said, there's no single algorithm. Everyone is free to implement their ...

hmm, Return the hash value of the object (if it has one).

steel solstice Sep 28, 2023, 9:17 PM

#

i cant find it but it includes the id and stuff

feral island Sep 28, 2023, 9:21 PM

#

object.__hash__ is mostly the same as id(), right?

cyan raven Sep 28, 2023, 9:22 PM

#

feral island `object.__hash__` is mostly the same as id(), right?

you mean something like this

a = 10
print(id(a))

feral island Sep 28, 2023, 9:23 PM

#

cyan raven you mean something like this ``` a = 10 print(id(a)) ```

no, that will use int.__hash__ which just returns the value (for small ints)

cyan raven Sep 28, 2023, 9:23 PM

#

https://github.com/python/cpython/blob/main/Python/bltinmodule.c#L1600

fallen slateBOT Sep 28, 2023, 9:23 PM

#

Python/bltinmodule.c line 1600

static PyObject *```

feral island Sep 28, 2023, 9:23 PM

#

!e ```
o = object()
print(id(o))
print(hash(o))

fallen slateBOT Sep 28, 2023, 9:23 PM

#

@feral island :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 140255253185424
002 | 8765953324089

feral island Sep 28, 2023, 9:24 PM

#

guess not!

flat gazelle Sep 28, 2023, 9:24 PM

#

I think its just divided by 16

cyan raven Sep 28, 2023, 9:24 PM

#

well, they are different.

flat gazelle Sep 28, 2023, 9:24 PM

#

!e

o = object()
print(id(o))
print(hash(o) * 16)

fallen slateBOT Sep 28, 2023, 9:24 PM

#

@flat gazelle :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 140145685562256
002 | 140145685562256

cyan raven Sep 28, 2023, 9:25 PM

#

flat gazelle !e ```py o = object() print(id(o)) print(hash(o) * 16) ```

🙂

flat gazelle Sep 28, 2023, 9:25 PM

#

(doesn't always work since the hash can end up negative - not sure what the circumstance would be, but I did just do it)

cyan raven Sep 28, 2023, 9:26 PM

#

cyan raven https://github.com/python/cpython/blob/main/Python/bltinmodule.c#L1600

not sure if this is what im looking for: https://github.com/python/cpython/blob/main/Objects/object.c#L878

fallen slateBOT Sep 28, 2023, 9:26 PM

#

Objects/object.c line 878

PyObject_Hash(PyObject *v)```

flat gazelle Sep 28, 2023, 9:28 PM

#

There is only one (non security/digest-related) hash algo that is actually part of python rather than an implementation detail of CPython, and that is the numeric hash - https://docs.python.org/3/library/stdtypes.html#hashing-of-numeric-types

Python documentation

Built-in Types

The following sections describe the standard types that are built into the interpreter. The principal built-in types are numerics, sequences, mappings, classes, instances and exceptions. Some colle...

raven lark Sep 28, 2023, 9:39 PM

#

!res

fallen slateBOT Sep 28, 2023, 9:39 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

static hinge Sep 28, 2023, 9:49 PM

#

Here's a common hash function used in Java. ```java
int hash = 7;
hash = 31 * hash + (int) id;
hash = 31 * hash + (name == null ? 0 : name.hashCode());
hash = 31 * hash + (email == null ? 0 : email.hashCode());
return hash;

trim merlin Sep 29, 2023, 1:19 PM

#

I'd like to propose implementing __instancecheck__ for types.GenericAlias and types.TypeAliasType. Has there already been a discussion about this before?

dusk comet Sep 29, 2023, 1:23 PM

#

it is impossible to check that some list is an instance of list[int]

static hinge Sep 29, 2023, 2:51 PM

#

It would be an O(n) operation

#

def is_int_list(value: Any) -> TypeGuard[list[int]]:
  return isinstance(value, list) and all(isinstance(item, int) for item in value)

dusk comet Sep 29, 2023, 2:52 PM

#

static hinge It would be an O(n) operation

no, it is not
it is literally impossible to check

static hinge Sep 29, 2023, 2:52 PM

#

of course it wouldn't stop you from adding some non-int to the list at some other point.

#

and it would probably break if the list is empty

dusk comet Sep 29, 2023, 2:53 PM

#

even adding ints to list[int] isn't always safe

static hinge Sep 29, 2023, 2:54 PM

#

What if you had ```py
class IntList(list[int]): pass

dusk comet Sep 29, 2023, 2:55 PM

#

then [0] is not an instance of IntList, so it is useless

static hinge Sep 29, 2023, 2:55 PM

#

what about list[int] in cls.__orig_bases__?

dusk comet Sep 29, 2023, 2:56 PM

#

dusk comet then [0] is not an instance of IntList, so it is useless

this same problem

trim merlin Sep 29, 2023, 3:37 PM

#

dusk comet even adding ints to list[int] isn't always safe

creating an object of list[int] returns a regular list object anyway, so that's not really a thing

#

but what's the problem with an O(n) impl in isinstance? why do you say it's literally impossible

feral island Sep 29, 2023, 3:38 PM

#

trim merlin creating an object of `list[int]` returns a regular list object anyway, so that'...

you can't know whether an empty list is a list[int] at runtime

trim merlin Sep 29, 2023, 3:39 PM

#

feral island you can't know whether an empty list is a `list[int]` at runtime

think about it as runtime type validation. will protobuf accept an empty list for list[int]? it will.

isintance() has always been about runtime type validation

#

this usecase is definitely not for static typing, even though it might help in type narrowing

misty oxide Sep 29, 2023, 3:47 PM

#

I'm doing bytecode analysis (3.10), and I'm trying to figure out the number of kwargs args vs kwargs in a CALL_FUNCTION_KW. Is there any way to determine this statically, or do I need to know the runtime value of the kwargs tuple? Will cpython ever generate this instruction when the kwargs tuple is not static and easily inferrable?

flat gazelle Sep 29, 2023, 3:48 PM

#

trim merlin think about it as runtime type validation. will protobuf accept an empty list fo...

Protobuf will copy the list, so it can do the check fine. In a regular program, some other piece of code could also refer to the list, and at any point after the check add a str to it

trim merlin Sep 29, 2023, 3:49 PM

#

flat gazelle Protobuf will copy the list, so it can do the check fine. In a regular program, ...

yeah, I'm not saying that the list object should be a list[str] object, I'm just saying that isinstance(mylist, list[str]) should work at that given point in time, for any list

#

this makes sense for new type aliases as well:

class Item: ...
type items = list[Item]

...

isinstance(mylist, Items)

flat gazelle Sep 29, 2023, 3:57 PM

#

def f(l, cb):
    if isinstance(l, list[int]):
        cb()
        for i in l:
            print(i + 2)
x = [1]
f(x, lambda: x.append('a'))
```I do not think it is all that sensible to have an each-element check sort of default as a type check. If you do need an each-element check, you should just use an each-element check, but it is not always correct (consider taking it as a ctor argument and using the field), and I would argue it is not a sane default.

static hinge Sep 29, 2023, 4:13 PM

#

Maybe limit it to Sequence[int]

swift imp Sep 29, 2023, 4:23 PM

#

flat gazelle ```py def f(l, cb): if isinstance(l, list[int]): cb() for i ...

Maybe there can be a new builtin that does sort of thing.

Sugar around isinstance(x, list) and all(isinstance(el, int) for el in x)

static hinge Sep 29, 2023, 4:25 PM

#

best to make it isinstance(x, Sequence) and (len(s) == 0 or all(isinstance(el, int) for el in x))

#

actually, all([]) => True

raven ridge Sep 29, 2023, 5:28 PM

#

trim merlin but what's the problem with an `O(n)` impl in isinstance? why do you say it's li...

People expect isinstance() to be O(1). Besides, even if you could check whether a given list is a list-of-int programmatically, you can't check whether a given iterable is an iterable-of-int programmatically, so this proposal would result in asymmetrical interfaces

trim merlin Sep 29, 2023, 5:37 PM

#

actually yeah, Iterable would be a problem as it could be exhaustive. hm.

#

point taken, O(1) thing also is a fair assumption as i haven't seen isinstance implementations ever do a for loop

dusk comet Sep 29, 2023, 6:55 PM

#

isinstance is doing both loop and recursion (to iterate through all types in given type tuple, and it does recursion because tuples can be nested)

#

>>> isinstance(1, ((((), ()), str), str))
False
>>> isinstance(1, ())
False

trim merlin Sep 29, 2023, 7:23 PM

#

dusk comet isinstance is doing both loop and recursion (to iterate through all types in giv...

i meant to say __instancecheck__ implementations are generally O(1)

#

it's a reasonable assumption that isinstance(x, T) will be constant time

dusk comet Sep 29, 2023, 7:43 PM

#

how would you do isinstance(x, list[int]) ?
will it be equivalent to isinstance(x, list)?

urban sandal Sep 29, 2023, 7:55 PM

#

trim merlin I'd like to propose implementing `__instancecheck__` for `types.GenericAlias` an...

I'd rather __isinstance_check__ not exist at all :) (this won't happen for pragmatic reasons and backwards compatability, among a lot else)

I think rather than having this, it would have been better for a builtin to be added that can determine if things are structurally equivalent even if not nominally a subclass for runtime structural subtyping, but that ship has long sailed.

this usecase is definitely not for static typing, even though it might help in type narrowing
It won't. If you look at pytype, much stronger inference than this can already be done should a type checker choose to.

The runtime use of this can't be any better than exhausting the iterator (as was already said by others) and with such a cost attached, people are free to do it themselves, but I agree with others that hiding a cost in there for iterables when most people won't need it at runtime isn't ideal.

magic zodiac Sep 29, 2023, 8:44 PM

#

Is it a good idea to join internships on LinkedIn from small tech companies provided for free with small projects to showgirl? Mostly these small tech companies are Indian
E.g
Meriskill
Info aid tech
Code samurai
Bharat intern
Etc

feral cedar Sep 29, 2023, 8:48 PM

#

magic zodiac Is it a good idea to join internships on LinkedIn from small tech companies prov...

#career-advice?

magic zodiac Sep 29, 2023, 8:49 PM

#

There were already having a discussion, didn't wanted to interupt there in the middle of of it

misty oxide Sep 29, 2023, 9:07 PM

#

misty oxide I'm doing bytecode analysis (3.10), and I'm trying to figure out the number of k...

Does anybody know if bytecode could ever be generated by cpython for the kwargs tuple in CALL_FUNCTION_KW, where the kwargs tuple is not in co_consts?

#

The same question applies to later versions, and to CALL in 3.12+.

#

Is there any situation where the kwargs are not statically known, other than fn(**kwargs), which uses a different instruction?

ripe tinsel Sep 29, 2023, 9:17 PM

#

Feature proposal: async import

I'm busy polishing code for production (desktop application) and I am using asynchronous functions to load some of the heavier libraries with minimal impact on startup time. Multithreading does not provide significant time advantages over async in my case, the main hurdle is the linear flow which has plenty of waiting time downstream. It occurs to me that this is probably a common problem, and a common solution might be useful for the greater community.

I propose an "async import" function that is a built-in function or part of the asyncio library. The syntax could be something like pd = async import pandas (equivalent to "import pandas as pd"), where "async" acts almost like a decorator, but defines and instantiates this function instead:

async def import_function(package): 
    import package as x
    return x

grave jolt Sep 29, 2023, 9:19 PM

#

well, not exactly this function

spark magnet Sep 29, 2023, 9:19 PM

#

when would you be able to use pandas after that?

grave jolt Sep 29, 2023, 9:19 PM

#

grave jolt well, not _exactly_ this function

(because it will block and act exactly as import package as x)

ripe tinsel Sep 29, 2023, 9:20 PM

#

When you return a value that was imported, it gets assigned to the namespace you chose.

grave jolt Sep 29, 2023, 9:21 PM

#

I mean, you can do this: ```py
pd = await asyncio.to_thread(import, "pandas")

ripe tinsel Sep 29, 2023, 9:21 PM

#

spark magnet when would you be able to use `pandas` after that?

I use pandas as an example, since it is often imported as pd. You can then call it as pd at any time

spark magnet Sep 29, 2023, 9:21 PM

#

ripe tinsel I use pandas as an example, since it is often imported as pd. You can then call ...

but the import might not be done yet, right?

feral island Sep 29, 2023, 9:22 PM

#

ripe tinsel I use pandas as an example, since it is often imported as pd. You can then call ...

You aren't really making clear how your proposal is different from just import pandas as pd

ripe tinsel Sep 29, 2023, 9:22 PM

#

grave jolt I mean, you can do this: ```py pd = await asyncio.to_thread(__import__, "pandas"...

Looks interesting, will read up on that. Thank you

grave jolt Sep 29, 2023, 9:22 PM

#

I suppose importing a package can be expensive. But is this expense from reading from the disk? Or is it from the CPU-bound work of just executing a lot of code?

#

In any case, I think the problem of UI taking too long to start is solved by running the UI in a separate thread

ripe tinsel Sep 29, 2023, 9:25 PM

#

feral island You aren't really making clear how your proposal is different from just `import ...

The software stops until the import statement returns a value. With a library like sentence_transformers this takes a second or two. With multiple libraries that take a second it adds up. Very few of these libraries are needed until the user actually does something.

My greatest pain in python is making the client wait for libraries to load. Async works really well for that, but @brittle mantle gave a great response so I will use that instead.

grave jolt Sep 29, 2023, 9:26 PM

#

that's not a great response if you haven't measured what is slow in the importing

#

Is it disk I/O or is it compiling and running Python code (which is CPU-bound)?

#

If it's CPU-bound, then you have no choice but to wait for the import to finish. If you use threading you can at least interleave this CPU-bound work with the work in your UI thread

#

I guess to_thread kinda does this

#

The main problem with adding a whole language feature for this is: async/await is not coupled to a particular loop implementation, be it asyncio or trio. So how do you decide what to use for the I/O?

cyan raven Sep 29, 2023, 9:33 PM

#

How does the dataclasses standard lib make sure that the fields are using type annotations?
Could someone link me to that part in the source code?

grave jolt Sep 29, 2023, 9:34 PM

#

cyan raven How does the `dataclasses` standard lib make sure that the fields are using type...

https://github.com/python/cpython/blob/main/Lib/dataclasses.py#L970

fallen slateBOT Sep 29, 2023, 9:35 PM

#

Lib/dataclasses.py line 970

cls_annotations = inspect.get_annotations(cls)```

cyan raven Sep 29, 2023, 9:38 PM

#

grave jolt https://github.com/python/cpython/blob/main/Lib/dataclasses.py#L970

thank you

ripe tinsel Sep 29, 2023, 9:47 PM

#

grave jolt Is it disk I/O or is it compiling and running Python code (which is CPU-bound)?

In the context of a GUI-based app, the underlying code is often written in C (I think matplotlib is based on Matlab). Matplotlib specifically appears to run in its own process that interacts with the GUI package through a backend. In my specific case I want to delay imports until matplotlib does its thing (which is why multithreading doesn't differ much from async implementations).

Some imports have to happen linearly, but when an import can be delayed then there is actually very little literature on that. I've looked at lazy import implementations, but async seemed to work better.

I also have a case where I have a function which has an import in it, but the return value is cached. This solved a different problem though, since the client has to wait for data to process in anyway if he inputs new data. However, I mention it because it was another workaround to the long import problem.

But thanks for your advice, the code you suggested was exactly what I was looking for.

deft pagoda Sep 29, 2023, 9:50 PM

#

I find this slightly annoying, possibly inconsistent that:

>>> tuple(range(3))
(0, 1, 2)

but

>>> from typing import NamedTuple
>>> class MyTuple(NamedTuple):
...     a: int
...     b: int
...     c: int
... 
>>> MyTuple(range(3))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: MyTuple.__new__() missing 2 required positional arguments: 'b' and 'c'

I know this can be fixed with unpacking, e.g.,:

>>> MyTuple(*range(3))
MyTuple(a=0, b=1, c=2)

But it gets really ugly with generator expressions:

>>> MyTuple(i for i in range(3))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
TypeError: MyTuple.__new__() missing 2 required positional arguments: 'b' and 'c'
>>> MyTuple(*(i for i in range(3)))
MyTuple(a=0, b=1, c=2)

Is there some given reason why NamedTuples can't be constructed from iterables?

feral island Sep 29, 2023, 9:52 PM

#

deft pagoda I find this slightly annoying, possibly inconsistent that: ```py >>> tuple(range...

It's a weird inconsistency, yes, but I don't think there's an alternative. If you interpret a single argument to the NamedTuple constructor as an iterable, you'll get to weird edge cases with single-element NamedTuples

cyan raven Sep 29, 2023, 9:56 PM

#

deft pagoda I find this slightly annoying, possibly inconsistent that: ```py >>> tuple(range...

wondering what you'd get from this: https://github.com/brettcannon/record-type/tree/main

GitHub

GitHub - brettcannon/record-type: Proof-of-concept `record` type fo...

Proof-of-concept record type for Python. Contribute to brettcannon/record-type development by creating an account on GitHub.

deft pagoda Sep 29, 2023, 9:57 PM

#

since they changed initialization from tuple([1, 2, 3]) to MyTuple(1, 2, 3) i guess there's nothing to be done about it

cyan raven Sep 29, 2023, 9:58 PM

#

deft pagoda since they changed initialization from `tuple([1, 2, 3])` to `MyTuple(1, 2, 3)` ...

nvm, you'd get the same thing.

deft pagoda Sep 29, 2023, 10:03 PM

#

maybe they should add a .from_iterable class method

cyan raven Sep 29, 2023, 10:04 PM

#

deft pagoda maybe they should add a `.from_iterable` class method

yes, there might be a solution since it's a really fresh idea.
You might open an issue on the GitHub page if you want.
I can do it as well, I'm personally interested.

feral island Sep 29, 2023, 10:05 PM

#

deft pagoda maybe they should add a `.from_iterable` class method

Why add that when * unpacking already works?

deft pagoda Sep 29, 2023, 10:06 PM

#

because it's uglier, especially in front of generator expressions

#

prefer:

MyTuple.from_iterable(i for i in range(3))

over

MyTuple(*(i for i in range(3)))

itertools.chain has similar

ripe tinsel Sep 29, 2023, 10:32 PM

#

grave jolt I mean, you can do this: ```py pd = await asyncio.to_thread(__import__, "pandas"...

A syntax like "async import" would be a convenience, but this works so it is already a feature.

pd = asyncio.run(asyncio.to_thread(__import__, "pandas"))

grave jolt Sep 29, 2023, 10:33 PM

#

but that would be hardcoding asyncio as the loop implementation to use

feral island Sep 29, 2023, 10:33 PM

#

ripe tinsel A syntax like "async import" would be a convenience, but this works so it is alr...

That line doesn't really do anything over import pandas as pd. You need to actually make the code async so that it can do other work while the import is running

grave jolt Sep 29, 2023, 10:33 PM

#

I mean, this could be considered, but that's a major change. For very little gain IMO

#

And yes, this exact line does the same as import pandas as pd

ripe tinsel Sep 29, 2023, 10:38 PM

#

feral island That line doesn't really do anything over `import pandas as pd`. You need to act...

According to this website, .to_thread() creates a coroutine that executes in a separate thread from the main thread when awaited (this syntax gives me an error which is why I use asyncio.run() to execute the coroutine.)

https://superfastpython.com/asyncio-to_thread/

Super Fast Python

Jason Brownlee

How to Use Asyncio to_thread()

You can run a blocking function in asyncio via the asyncio.to_thread() function. In this tutorial, you will discover how to execute blocking functions in new threads separate from the asyncio event…

feral island Sep 29, 2023, 10:38 PM

#

ripe tinsel According to this website, .to_thread() creates a coroutine that executes in a s...

yes, but then you are blocking on the result

#

so the end result is basically the same: you are blocking waiting for the import to finish

#

the actual import happens in a separate thread, but I don't see how that helps you

grave jolt Sep 29, 2023, 10:43 PM

#

Yeah, like asyncio.run(asyncio.sleep(5)) is exactly the same as time.sleep(5)

winged sphinx Sep 29, 2023, 10:44 PM

#

ripe tinsel A syntax like "async import" would be a convenience, but this works so it is alr...

I have a similar problem/workflow, with a 3-4 second import delay (total) after pruning and adding lazy imports where helpful (reviewed with -X importtime). This kills me on multiprocessing, where every process ends up with another 3-4 sec delay. Pandas is one of my worst cases. In my particular use case, there's a few points where this async solution actually would be helpful: where we're waiting on data (via async apis/requests), but before we need the rest of the stack (pandas, charting libraries, pyarrow, etc).

ripe tinsel Sep 29, 2023, 10:44 PM

#

feral island so the end result is basically the same: you are blocking waiting for the import...

In my experience, async only blocks if the await keyword is used. If you use asyncio.run() an an async function where there is no await keyword, my experience is that it behaves like multithreading provided you are not doing heavy calculations (but I can draw scatterplots and pie charts simultaneously with word clouds using async without await, and that feels pretty insantaneous to me)

feral island Sep 29, 2023, 10:45 PM

#

#internals-and-peps

what you did:

replace first 4

replace last 4 of first half

cat test.py

how to check if it is a rgb kind?

how to get value from it?