feral cedar Dec 1, 2022, 2:27 AM

#

oh. i've never seen anyone use generator iterator

raven ridge Dec 1, 2022, 2:41 AM

#

ah, that's not even the one I was thinking of - I was thinking of async def coroutines vs @types.coroutine generator coroutines

feral cedar Dec 1, 2022, 2:42 AM

#

i've never even heard of the latter lemon_eyes

raven ridge Dec 1, 2022, 2:42 AM

#

!d types.coroutine

fallen slateBOT Dec 1, 2022, 2:42 AM

#

types.coroutine


types.coroutine(gen_func)```
This function transforms a [generator](https://docs.python.org/3/glossary.html#term-generator) function into a [coroutine function](https://docs.python.org/3/glossary.html#term-coroutine-function) which returns a generator-based coroutine. The generator-based coroutine is still a [generator iterator](https://docs.python.org/3/glossary.html#term-generator-iterator), but is also considered to be a [coroutine](https://docs.python.org/3/glossary.html#term-coroutine) object and is [awaitable](https://docs.python.org/3/glossary.html#term-awaitable). However, it may not necessarily implement the `__await__()` method.

If *gen\_func* is a generator function, it will be modified in-place.

If *gen\_func* is not a generator function, it will be wrapped. If it returns an instance of [`collections.abc.Generator`](https://docs.python.org/3/library/collections.abc.html#collections.abc.Generator "collections.abc.Generator"), the instance will be wrapped in an *awaitable* proxy object. All other types of objects will be returned as is.

New in version 3.5.

feral cedar Dec 1, 2022, 2:43 AM

#

~~huh. is this how they did async before async and await?~~

#

oh wait, async and await were 3.4 right

raven ridge Dec 1, 2022, 2:45 AM

#

generators were the legacy way of doing coroutines. @types.coroutine is a way to bridge the gap between those legacy generator coroutines and the modern async def ones by adapting them to look and behave more like the modern ones

feral cedar Dec 1, 2022, 2:48 AM

#

i see. so you would just add that and you could use await on them

raven ridge Dec 1, 2022, 2:52 AM

#

yep

gray galleon Dec 1, 2022, 4:47 AM

#

raven ridge generators were the legacy way of doing coroutines. `@types.coroutine` is a way ...

i thought async coroutines are implemented using generators

raven ridge Dec 1, 2022, 4:50 AM

#

gray galleon i thought async coroutines are implemented using generators

no, though they use similar machinery under the hood.

#

https://peps.python.org/pep-0492/#differences-from-generators

PEP 492 – Coroutines with async and await syntax | peps.python.org

Python Enhancement Proposals (PEPs)

elder blade Dec 1, 2022, 9:32 AM

#

feral cedar i see. so you would just add that and you could use `await` on them

You also still need this for the bottom trap of every single async event loop. Remember: await becomes yield from - not yield - and doesn't actually yield anything until one of the coroutines (generators) hits a yield point.

Event loops needs this so that they can actually yield so usually you have some kind of wrapper function like: ```python
@types.coroutine
def trap(value):
return (yield value)

brave ore Dec 1, 2022, 2:18 PM

#

I second this. It will render a lot of the school manuals that many 10 year olds read completely useless, though.

rose schooner Dec 1, 2022, 2:36 PM

#

you can always use it as a variable name and ignore the style guidelines for it

#

what does annoy a little is whenever syntax highlighters highlight it

halcyon trail Dec 1, 2022, 3:13 PM

#

rose schooner you can always use it as a variable name and ignore the style guidelines for it

i know you can, I just genuinely think it's a bad idea

#

maybe now with mypy it's less of an issue since you'd probably get pretty bad mypy errors immediately when you make a mistake of that kind

#

in the past I remember genuinely wasting some time on it though

lavish glen Dec 1, 2022, 9:01 PM

#

Hello guys, i have a few questions:
is setup.py still used?
do you have to use setup.py to build a wheel?
is poetry viable? or can i still stick to pip for my packages?
is there a fully-comprehensive guide on how to use pip to distribute packages?
do you have to hand code the .toml?

I have a lot of confusion...

open question at:
https://discord.com/channels/267624335836053506/1047961312451371059

sacred yew Dec 1, 2022, 9:07 PM

#

lavish glen Hello guys, i have a few questions: is setup.py still used? do you have to use s...

#tools-and-devops

gray galleon Dec 2, 2022, 4:38 AM

#

!e ```py
import dis

dis.dis("f(1, a=1, b=2)")

fallen slateBOT Dec 2, 2022, 4:40 AM

#

@gray galleon :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 |   0           0 RESUME                   0
002 | 
003 |   1           2 PUSH_NULL
004 |               4 LOAD_NAME                0 (f)
005 |               6 LOAD_CONST               0 (1)
006 |               8 LOAD_CONST               0 (1)
007 |              10 LOAD_CONST               1 (2)
008 |              12 KW_NAMES                 2
009 |              14 PRECALL                  3
010 |              18 CALL                     3
011 |              28 RETURN_VALUE

gray galleon Dec 2, 2022, 4:40 AM

#

what does KW_NAMES and PRECALL do

#

!e ```py
import dis

dis.dis("f(1, a=1, b=2)")

fallen slateBOT Dec 2, 2022, 4:41 AM

#

@gray galleon :white_check_mark: Your 3.10 eval job has completed with return code 0.

001 |   1           0 LOAD_NAME                0 (f)
002 |               2 LOAD_CONST               0 (1)
003 |               4 LOAD_CONST               0 (1)
004 |               6 LOAD_CONST               1 (2)
005 |               8 LOAD_CONST               2 (('a', 'b'))
006 |              10 CALL_FUNCTION_KW         3
007 |              12 RETURN_VALUE

gray galleon Dec 2, 2022, 4:42 AM

#

somehow the keyword names can’t be seen in python 3.11

unkempt rock Dec 2, 2022, 5:08 AM

#

Is there a tool I can use to see the PVM code emitted from a python script

rich cradle Dec 2, 2022, 5:08 AM

#

pvm? the virtual machine?

#

the dis standard library module can show you the bytecode

pliant tusk Dec 2, 2022, 5:28 AM

#

gray galleon somehow the keyword names can’t be seen in python 3.11

im pretty sure dis just doesnt know where to look for them yet

#

they are still located in the co_consts tuple

#

https://github.com/python/cpython/blob/3.11/Lib/dis.py#L364 yea dis just checks if the op is LOAD_CONST

fallen slateBOT Dec 2, 2022, 5:34 AM

#

Lib/dis.py line 364

if op == LOAD_CONST:```

gray galleon Dec 2, 2022, 5:45 AM

#

gray galleon what does KW_NAMES and PRECALL do

.

pliant tusk Dec 2, 2022, 5:53 AM

#

gray galleon somehow the keyword names can’t be seen in python 3.11

just drafted a pull request to fix this

gray galleon Dec 2, 2022, 5:58 AM

#

what does KW_NAMES and PRECALL do
does KW_NAMES create internal pair data types that will be used as keyword args
if so then how about PRECALL

prime estuary Dec 2, 2022, 10:21 AM

#

gray galleon what does KW_NAMES and PRECALL do does KW_NAMES create internal pair data types ...

KW_NAMES indeed sets a variable to the tuple of names, which is then consumed by CALL_*. That way the location of the constants can be a separate oparg to the rest of the values. The reason PRECALL existed as part of the specialisation system. What it does is handle adjusting the arguments if a method is being loaded or if a class is being called, meaning CALL doesn't need to care about that, and can be specialised independently - there's currently 18 different call opcodes depending on what the callable is.

Actually, PRECALL is now gone in 3.12, it was judged as not being worth it.
https://github.com/python/cpython/pull/92925

gray galleon Dec 2, 2022, 10:24 AM

#

set what variable

#

just an internal variable not accessible by python?

rose schooner Dec 2, 2022, 10:32 AM

#

gray galleon set what variable

call_shape.kw_names ig

gray galleon Dec 2, 2022, 10:33 AM

#

dot is python’s way to make internal variables lol

#

like .0 is listcomp implicit argument

prime estuary Dec 2, 2022, 10:36 AM

#

It's a C local variable in the interpreter loop.

#

We know CALL is coming immediately after to use it.

#

Search for kwnames here: <https://github.com/python/cpython/blob/main/Python/bytecodes.c#L2876-L2881

fallen slateBOT Dec 2, 2022, 10:38 AM

#

Python/bytecodes.c lines 2876 to 2881

// stack effect: ( -- )
inst(KW_NAMES) {
    assert(kwnames == NULL);
    assert(oparg < PyTuple_GET_SIZE(consts));
    kwnames = GETITEM(consts, oparg);
}```

boreal umbra Dec 2, 2022, 10:34 PM

#

@unkempt rock I think dicts and sets share some of the same implementation, but I'm not entirely sure

#

hi wookie
do you get like pings when people send messages here?

sturdy timber Dec 2, 2022, 10:36 PM

#

I wasn't even typing, how did you know I was here 👀

boreal umbra Dec 2, 2022, 10:36 PM

#

did you accidentally start that typing event broadcast self-bot of yours?

sturdy timber Dec 2, 2022, 10:44 PM

#

I think it must be that if you have text in the message box for a channel whenever you go to that channel it says you're typing, that's a bit silly.

boreal umbra Dec 2, 2022, 11:23 PM

#

this channel has becomes internals-of-discord-client-and-peps

swift imp Dec 3, 2022, 12:26 AM

#

Lazy imports got rejected, I'm sad

boreal umbra Dec 3, 2022, 1:04 AM

#

why?

dusk comet Dec 3, 2022, 1:26 AM

#

!pep 690

fallen slateBOT Dec 3, 2022, 1:26 AM

#

**PEP 690 - Lazy Imports**

Link

Status

Draft

Python-Version

3.12

Created

29-Apr-2022

Type

Standards Track

dusk comet Dec 3, 2022, 1:27 AM

#

still a draft

boreal umbra Dec 3, 2022, 1:39 AM

#

where are pep decisions announced, anyway?

rich cradle Dec 3, 2022, 2:05 AM

#

the python-dev mailing list, i believe

native flame Dec 3, 2022, 3:28 AM

#

theres seems to have been some more support for hash(None) being a constant

gray galleon Dec 3, 2022, 4:17 AM

#

i didn’t know python 3.12 is already in development
like 3.11 were just released recently

raven ridge Dec 3, 2022, 4:19 AM

#

well, yeah - that's why 3.12 is under development.

#

any feature that didn't make it in before the 3.11 feature freeze would become part of 3.12

gray galleon Dec 3, 2022, 4:26 AM

#

so 3.12 is just a continuation of 3.11

rose schooner Dec 3, 2022, 4:27 AM

#

gray galleon i didn’t know python 3.12 is already in development like 3.11 were just released...

right as a new python major version is fully released, it branches and a new version continues the main branch

#

that's probably what they call a "feature freeze" or something

boreal umbra Dec 3, 2022, 4:45 AM

#

native flame theres seems to have been some more support for hash(None) being a constant

What is it currently?

#

!e print(hash(None))

fallen slateBOT Dec 3, 2022, 4:45 AM

#

@boreal umbra :white_check_mark: Your 3.11 eval job has completed with return code 0.

8790177024592

boreal umbra Dec 3, 2022, 4:45 AM

#

!e print(hash(None))

fallen slateBOT Dec 3, 2022, 4:45 AM

#

@boreal umbra :white_check_mark: Your 3.11 eval job has completed with return code 0.

8764759286608

boreal umbra Dec 3, 2022, 4:46 AM

#

Is it based on the id that None gets at interpreter start time?

native flame Dec 3, 2022, 4:48 AM

#

boreal umbra What is it currently?

its based on its id(), the default for all objects

raven ridge Dec 3, 2022, 5:11 AM

#

gray galleon so 3.12 is just a continuation of 3.11

sure - in the same way as the next version of any application is a continuation of the previous version. In the case of CPython there's a release schedule, so even though people always want to get new changes in, there's cutoff points once a year where they say that no new features will be added to the next version, so any new features start getting added to a branch for the version after that.

feral island Dec 3, 2022, 5:30 AM

#

boreal umbra where are pep decisions announced, anyway?

discuss.python.org nowadays

swift imp Dec 3, 2022, 11:34 AM

#

@boreal umbra https://discuss.python.org/t/pep-690-lazy-imports-again/19661/26?u=melendowski

Discussions on Python.org

PEP 690: Lazy Imports Again

Decision on PEP 690 - Lazy Imports The Python Steering Council has decided to reject PEP 690 on Lazy Imports. We agree with the widely accepted sentiment that faster Python startup time is desirable. Large command line tools in particular suffer as that is a human user experience. Lazy imports, as proposed, are one of many potential mechanism...

unkempt rock Dec 3, 2022, 11:53 AM

#

PEP 638 could make a way to simulate Perl's unless

b = False
unless! b: # same as `if not(b):`
  print("Good")

swift imp Dec 3, 2022, 12:53 PM

#

I don't want pep 638 to happen. I understand the point of macros in C to reduce verbosity but I really don't think they're needed in python

gray galleon Dec 3, 2022, 1:05 PM

#

cmon macros are cool

#

they also allow you to have dsls

umbral plume Dec 3, 2022, 1:19 PM

#

macros certainly sound cool, but i really don't think that they really suit python that well

#

one of the things that python is well known for is its really easy-to-read syntax - adding a new feature that allows developers to essentially create their own syntax is an easy way to reduce that readability greatly

grave jolt Dec 3, 2022, 1:22 PM

#

Rust macros are pretty cool. But it's a very different language

umbral plume Dec 3, 2022, 1:22 PM

#

like in the PEP, they give an example of creating a macro that simultaneously creates a dict of k:v pairs, as well as a dict consisting of the inverse v:k pairs: py bijection! color_to_code, code_to_color: "red" = 1 "blue" = 2 "green" = 3 already, i'm not a fan of macros enabling writing stuff like "red" = 1, seemingly assigning a literal to another literal

grave jolt Dec 3, 2022, 1:23 PM

#

Yeah you can just do that with a function

#

not sure why macros are needed for that

broken sluice Dec 3, 2022, 1:26 PM

#

main use of macros is for something like:

if DEBUG_MODE:
    assert <expensive-to-check-invariant>, ...

I don't want this to have any cost if DEBUG_MODE is false.

umbral plume Dec 3, 2022, 1:26 PM

#

the one compelling argument is saying that certain libraries like numba would benefit from being directly given the AST of a function, rather than its bytecode (which is apparently what happens right now when you use a numba JIT decorator), which is something macros would enable, but i don't know enough about those types of libraries to really say much on the matter

grave jolt Dec 3, 2022, 1:26 PM

#

__debug__ already exists

#

well, you can already get the AST of a function by fetching its code with inspect and parsing it with ast

grave jolt Dec 3, 2022, 1:27 PM

#

grave jolt `__debug__` already exists

...also, asserts don't run if you use the -O flag

flat gazelle Dec 3, 2022, 1:28 PM

#

There is an argument to be made that @dataclass et al make more sense as macros. But whether that's enough of a merit to pile on even more complexity, IDTS. match had the strong, compelling usecase of argument normalisation, and even then it only barely got in.

#

I don't think there is enough bad python code which would be significantly improved with macros

broken sluice Dec 3, 2022, 1:29 PM

#

presumably you want some of your asserts to run in all cases and some only in "debug". But __debug__ does seem to be a solution, I didn't know about that.

flat gazelle Dec 3, 2022, 1:29 PM

#

For libraries like lark and kivy, external files or Multiline strings are IMO fine

grave jolt Dec 3, 2022, 1:30 PM

#

aside, you don't benefit from removing a simpleif check very often

#

unless it's some kind of very tight loop

umbral plume Dec 3, 2022, 1:31 PM

#

the PEP also doesn't really delve that deep into what an implementation of a macro looks like, like it'd be nice to see what their idea of an actual implementation of that bijection! macro looks like, for example

broken sluice Dec 3, 2022, 2:56 PM

#

Actually, disabling debug seems to remove all of your asserts, not just the "expensive" ones, so macros are still marginally better.
Anyway, it's a fairly minor point I agree, the cost of evaluating if DEBUG_MODE isn't going to be particularly large.

radiant garden Dec 3, 2022, 3:40 PM

#

less than the typical function call at any rate

gray galleon Dec 3, 2022, 6:22 PM

#

i think macros’ main function is making domain specific languages with pythonic syntax
like so ```py
recipe:
ingredients:
"ingredient 1"
"ingredient 2"
...

steps:
...

unkempt rock Dec 3, 2022, 6:30 PM

#

gray galleon i think macros’ main function is making domain specific languages with pythonic ...

don't forget that macro calls have ! at the end, so it would be

recipe!:
  ingredients!:
    "ingredient 1"
    "ingredient 2"
    ...

  steps!:
    ...

quick snow Dec 3, 2022, 6:31 PM

#

gray galleon i think macros’ main function is making domain specific languages with pythonic ...

You can get pretty close with existing syntax:

class ApplePie(Recipe):
    with ingredients:
        "Ingredient 1"
        "ingredient 2"

    with steps:
        ...

#

(Alternatively: lists, or functions inside)

unkempt rock Dec 3, 2022, 6:31 PM

#

most of the time, you need to extend a class, or use a with statement

quick snow Dec 3, 2022, 6:33 PM

#

You can even do:

steps: (
    "Do thing",
    "Do other thing",
)

final flame Dec 3, 2022, 10:03 PM

#

quick snow You can get pretty close with existing syntax: ```py class ApplePie(Recipe): ...

Wait. What feature allows you to access those strings in the with block?

feral island Dec 3, 2022, 10:10 PM

#

final flame Wait. What feature allows you to access those strings in the with block?

__prepare__

#

oh wait, that still won't give you strings

deft pagoda Dec 3, 2022, 11:46 PM

#

quick snow You can get pretty close with existing syntax: ```py class ApplePie(Recipe): ...

i can do:

class ApplePie(Recipe):
    __ingredients__
    Milk
    Eggs

using prepare

radiant garden Dec 3, 2022, 11:48 PM

#

class ApplePie(Recipe):
    Ingredients
    - Milk
    - Eggs
    
    a: "Preheat oven to 200 degrees"
    b: "Mince the garlic in a bowl"

final flame Dec 3, 2022, 11:50 PM

#

Is any library actually using it like this?

radiant garden Dec 3, 2022, 11:50 PM

#

Not for any practical purpose I don't think

#

I've used it for easy_z3 but that's more of a poc of the tech

gray galleon Dec 4, 2022, 3:31 AM

#

radiant garden ```py class ApplePie(Recipe): Ingredients - Milk - Eggs a: ...

wait how

feral island Dec 4, 2022, 3:33 AM

#

metaclass __prepare__ that returns a weird mapping

gray galleon Dec 4, 2022, 3:36 AM

#

tell me more about that

feral island Dec 4, 2022, 3:39 AM

#

gray galleon tell me more about that

the __prepare__ method on the metaclass returns the mapping object in the class namespace

#

so the Ingredients line does something like ns = metaclass.__prepare__(); ns["Ingredients"]

#

so your __prepare__ method has to return an object with a __getitem__ that records all the times it was called

#

and it has to return some object that implements __neg__ so the - Milk line works

#

then the a: "Preheat oven" line, that's a variable annotation

#

I think that gets compiled to something like ns["__annotations__"]["a"] = "Preheat oven"

#

so your weird mapping can record that too

#

I actually wrote a real use for this once, it lets you write serializable algebraic datatypes like ```class Maybe(ADT):
Nothing(tag=0)
Just(value=Any, tag=1)

#

https://github.com/JelleZijlstra/taxonomy/blob/4e2f4fc3aad25bbd5e64f728a834c2213133d58f/taxonomy/adt.py#L36

fallen slateBOT Dec 4, 2022, 3:45 AM

#

taxonomy/adt.py line 36

class _ADTNamespace(MutableMapping[str, Any]):```

quick snow Dec 4, 2022, 8:25 AM

#

final flame Wait. What feature allows you to access those strings in the with block?

!pypi dont :P

fallen slateBOT Dec 4, 2022, 8:25 AM

#

dont v0.1.0

Context manager base class that allows customizing execution of the contents

grave jolt Dec 4, 2022, 12:19 PM

#

feral island I actually wrote a real use for this once, it lets you write serializable algebr...

Hey, I had made something similar, kinda
https://gist.github.com/decorator-factory/b2fd85ef8248c9230835461c1ec24597
but more cursed, including meta-metaclasses

swift imp Dec 4, 2022, 10:39 PM

#

So I like pep695 but don't like the new soft keyword of type why not just make it typedef? https://discuss.python.org/t/pep-695-type-parameter-syntax/21646?u=melendowski

Discussions on Python.org

PEP 695: Type Parameter Syntax

PEP 695 is posted. It proposes to add an improved syntax for specifying type parameters within a generic class, function, or type alias. It also introduces a new statement for declaring type aliases. This PEP has already gone through several cycles of discussions in the typing-sig forum and public (virtual) meet-ups in the Python typing commun...

boreal umbra Dec 5, 2022, 1:15 AM

#

swift imp So I like pep695 but don't like the new soft keyword of `type` why not just make...

creating a new (hard) keyword breaks any code that happens to use it as a name

quick snow Dec 5, 2022, 7:01 AM

#

boreal umbra creating a new (hard) keyword breaks any code that happens to use it as a name

It could be typedef and still be soft

radiant garden Dec 5, 2022, 10:12 AM

#

typedef doesn't give off big python vibes

gray galleon Dec 5, 2022, 10:28 AM

#

radiant garden typedef doesn't give off big python vibes

who cares about not giving python vibes

quick snow Dec 5, 2022, 10:32 AM

#

Why is

type ListOrSet[T] = list[T] | set[T]

needed, couldn't it be

ListOrSet[T] = list[T] | set[T]

?

#

I do think type foo = bar looks very unlike Python, and more like C, Java, ...
Why not

ListOrSet[T]: type = list[T] | set[T]

?

radiant garden Dec 5, 2022, 10:36 AM

#

indexing an undefined variable

#

your choices are to either have implicit T (current solution), or define ListOrSet first (ugly solution), or use some other syntactic structure (such as the one proposed)

radiant garden Dec 5, 2022, 10:36 AM

#

gray galleon who cares about not giving python vibes

core developers typically

#

fwiw I'm not all sold on the soft keyword yet

#

something like ListOrSet: TypeAlias[T] = list[T] | set[T] would also work although a tiny bit verbose

#

I don't think type aliases are exactly a first-class citizen

quick snow Dec 5, 2022, 10:42 AM

#

ListOrSet: type[T] = list[T] | set[T]

radiant garden Dec 5, 2022, 10:43 AM

#

type[T] has different denotations, though

#

also this is difficult to read by static analyzers as a type alias (as opposed to a non-type assignment)

#

ListOrSet: _[T] = list[T] | set[T] BABAXD

#

I get the vibe that the new soft keyword is trying to sneak in via the merits of the rest of the pep :>

flat gazelle Dec 5, 2022, 11:04 AM

#

honestly, I am strongly doubtful something only usable for typechecking is getting into the python core

unkempt rock Dec 5, 2022, 6:22 PM

#

I'm not sure if this is the right place to ask but this is a strange question, and google isn't much help. I've run into a situation where certain frames of the call stack are "missing" in the debugger and the traceback. Has anyone see this before? This is a minimal example:

from sqlmodel import Field, SQLModel


class A(SQLModel, table=True):
    id: int = Field(primary_key=True)


a = A(id=5)

breakpoint()
a.dict()

When you step into the dict method. It actually skips several frames and jumps into a different function call deep in the call stack (namely _calculate_keys()). You don't actually see the dict method. You can force an error by changing the library code, and use pdb.post_mortem to trick your way into the missing frame. Pdb will tell you you're in the dict method, but if you use the longlist command, it says EOF, it can't find any code. What's going on here? For more context, dict is a pydantic method, it's a normal python method and isn't a C extension or anything weird as far as I can tell. (SQLModel inherits BaseModel).

grave jolt Dec 5, 2022, 7:50 PM

#

unkempt rock I'm not sure if this is the right place to ask but this is a strange question, a...

pydantic is compiled with Cython

#

(IIRC)

unkempt rock Dec 5, 2022, 10:20 PM

#

grave jolt pydantic is compiled with Cython

Oh wow, I didn't see any trace of Cython in the pydantic repo but it turns out you're right. It said so in the installation docs which I didn't bother reading. Should've RTFM. Thanks.

dusk comet Dec 6, 2022, 2:40 PM

#

Why (*a,) is 2 times slower than tuple(a)? ```js
C:\Users\denba>py -m timeit -s "x = [*range(10**6)]" "(*x,)"
20 loops, best of 5: 16.7 msec per loop

C:\Users\denba>py -m timeit -s "x = [*range(10**6)]" "(*x,)"
20 loops, best of 5: 16.9 msec per loop

C:\Users\denba>py -m timeit -s "x = [*range(10**6)]" "tuple(x)"
50 loops, best of 5: 8.64 msec per loop

C:\Users\denba>py -m timeit -s "x = [*range(10**6)]" "tuple(x)"
50 loops, best of 5: 8.46 msec per loop

feral island Dec 6, 2022, 2:43 PM

#

dusk comet Why `(*a,)` is 2 times slower than `tuple(a)`? ```js C:\Users\denba>py -m timeit...

  0           0 RESUME                   0

  1           2 BUILD_LIST               0
              4 LOAD_NAME                0 (x)
              6 LIST_EXTEND              1
              8 LIST_TO_TUPLE
             10 RETURN_VALUE

#

seems like it copies stuff twice: first into a list, then into a tuple

dusk comet Dec 6, 2022, 2:45 PM

#

Now i see. [*a] and list(a) takes same amount of time: ```js
C:\Users\denba>py -m timeit -s "x = [*range(10**6)]" "[*x]"
50 loops, best of 5: 8.54 msec per loop

C:\Users\denba>py -m timeit -s "x = [*range(10**6)]" "list(x)"
50 loops, best of 5: 8.58 msec per loop

elder blade Dec 6, 2022, 3:05 PM

#

feral island ```In [264]: dis.dis("(*x,)") 0 0 RESUME 0 1 ...

Oh wow this is suprisingly horrible

elder blade Dec 6, 2022, 3:06 PM

#

dusk comet Now i see. `[*a]` and `list(a)` takes same amount of time: ```js C:\Users\denba>...

That technically means that [*x] is slower as the alternative list(x) does a global lookup and function all on top of actually converting it to a list

gray galleon Dec 6, 2022, 3:19 PM

#

why do all comparison operators come from 1 instruction?
why not BINARY_GREATER, BINARY_EQUAL et cetera

grave jolt Dec 6, 2022, 7:02 PM

#

elder blade Oh wow this is suprisingly horrible

Actually in CPython there's no other way to build a tuple

feral island Dec 6, 2022, 7:03 PM

#

grave jolt Actually in CPython there's no other way to build a tuple

there is also BUILD_TUPLE

grave jolt Dec 6, 2022, 7:03 PM

#

🤔

feral island Dec 6, 2022, 7:03 PM

#

but that can't work with unpacking

grave jolt Dec 6, 2022, 7:04 PM

#

grave jolt Actually in CPython there's no other way to build a tuple

because a list looks like this:

(header) (length) (ptr)
                    |
                    V
                   [ ptr0 ptr1 ptr2 ptr3 ptr4 ...]

And in a tuple, the items are embedded directly into the memory location, kinda like this

(header) (length) (ptr0) (ptr1) (ptr2) ...

grave jolt Dec 6, 2022, 7:04 PM

#

feral island but that can't work with unpacking

yeah I meant a variable-length tuple

#

or rather, a non-known-in-advance

pliant tusk Dec 6, 2022, 7:06 PM

#

i wonder if there would be any speed benefits from makeing the underlying array for lists a tuple, so that operations like tuple([*iter]) would not require the copying from the list to the tuple

feral island Dec 6, 2022, 7:07 PM

#

pliant tusk i wonder if there would be any speed benefits from makeing the underlying array ...

then what if the list is mutated

pliant tusk Dec 6, 2022, 7:07 PM

#

it would only be allowed if the list has a refcount that proves it isnt referred to

#

and you might be able to optimize stuff like t = tuple(t) where t is a list

#

or you could check the refcount of the referred to tuple on mutations and copy if needed

#

then code like py x = [1,2,3,4] y = tuple(x) # y would point to x->tup del x would only require one backing array

raven ridge Dec 6, 2022, 7:48 PM

#

pliant tusk it would only be allowed if the list has a refcount that proves it isnt referred...

Not just that it isn't referred to, but that it can't ever be referred to - right?

#

otherwise you have to worry about things like ```py
x = [1,2,3,4]
y = tuple(x) # y would point to x->tup
x[0] = 42

pliant tusk Dec 6, 2022, 7:51 PM

#

right so list.__setitem__ would have a check like if l->tup->ob_refcount > 1 {reallocate()}

raven ridge Dec 6, 2022, 7:51 PM

#

ah, I see

pliant tusk Dec 6, 2022, 7:52 PM

#

pliant tusk right so `list.__setitem__` would have a check like `if l->tup->ob_refcount > 1 ...

and this check should compile down to a double dereference so that should be fast even when a realloc isnt needed (ie when there isnt a copy of the list)

raven ridge Dec 6, 2022, 7:55 PM

#

so that would make all lists in every Python program 16 bytes larger, and every list mutation or deletion a bit slower, and in exchange would make it faster to construct a tuple from a list.

#

seems possible, now that I understand what you're proposing, but I'm not convinced it's a good tradeoff.

pliant tusk Dec 6, 2022, 8:02 PM

#

yea i don't know if it would be a good idea, just would be interesting

raven ridge Dec 6, 2022, 8:03 PM

#

note that it would be possible to optimize (*x,) even without doing that. A possible implementation of (*x,) would be to call x.__length_hint__() first and, if it exists, pre-allocate a tuple of the hinted size and start unpacking directly into it. If __length_hint__ doesn't exist, or if the hint was wrong and there are more items than the hint suggested, it could fall back to the current way.

pliant tusk Dec 6, 2022, 8:03 PM

#

does it not call __length_hint__ now inside of UNPACK?

#

i know the tuple constructor calls it

feral island Dec 6, 2022, 8:04 PM

#

raven ridge note that it would be possible to optimize `(*x,)` even without doing that. A po...

this might be worse for small iterables because of the extra cost of calling __length_hint__

#

__length_hint__ isn't a slot I believe so calling it requires going through the method dict

raven ridge Dec 6, 2022, 8:07 PM

#

indeed, I probably should have put "optimize" in some scare quotes there.

raven ridge Dec 6, 2022, 8:09 PM

#

pliant tusk does it not call `__length_hint__` now inside of `UNPACK`?

the LIST_EXTEND might, but my point is that it doesn't necessarily have to BUILD_LIST / LIST_EXTEND / LIST_TO_TUPLE - it could just start with a tuple if __length_hint__ exists and is correct.

#

not necessarily better, and I'm not sure this is a common enough operation to be worth any special optimization, but that's one route that could work.

#

another would be assuming that unpacking is much more common for small iterables than large ones - it could start with a tuple of size 10, unpack into that, and if it fills it then it could realloc.

#

at the C level, tuples can be resized, so the fact that it doesn't know the final size isn't even necessarily a problem.

feral island Dec 6, 2022, 8:12 PM

#

list_extend (which I think is what LIST_EXTEND ends up calling) does use __length_hint__

grave jolt Dec 6, 2022, 9:22 PM

#

why are None, True and False in CamelCase?

glacial folio Dec 6, 2022, 9:26 PM

#

You mean Pascal case. To deferentiate between built in variables and defined variables (plus other reasons prob)

raven ridge Dec 6, 2022, 9:33 PM

#

https://peps.python.org/pep-0285/ says that True and False were chosen rather than true and false for consistency with None

PEP 285 – Adding a bool type | peps.python.org

Python Enhancement Proposals (PEPs)

#

Other languages (C99, C++, Java) name the constants “false” and “true”, in all lowercase. For Python, I prefer to stick with the example set by the existing built-in constants, which all use CapitalizedWords: None, Ellipsis, NotImplemented (as well as all built-in exceptions). Python’s built-in namespace uses all lowercase for functions and types only.

grave jolt Dec 6, 2022, 9:34 PM

#

glacial folio You mean Pascal case. To deferentiate between built in variables and defined var...

At first True and False weren't keywords actually

dusk comet Dec 6, 2022, 9:34 PM

#

Just noticed that python is around 40% faster on WSL than on Windows
ubuntu 22.04, Windows 10, CPython 3.11 (two different installations, one in windows, one in wsl), same benchmark
speed differs on different benchmarks, it is in range 20-50%
Why?

raven ridge Dec 6, 2022, 9:34 PM

#

raven ridge > Other languages (C99, C++, Java) name the constants “false” and “true”, in all...

what an oddly authoritative answer. Not super consistent - why should str be lower and Exception be pascal? - but authoritative nonetheless.

dusk comet Dec 6, 2022, 9:35 PM

#

grave jolt At first `True` and `False` weren't keywords actually

True, False = False, True

glacial folio Dec 6, 2022, 9:35 PM

#

The more u know

dusk comet Dec 6, 2022, 9:35 PM

#

grave jolt Dec 6, 2022, 9:35 PM

#

yep

#

in fact NotImplemented is still not a keyword

#

!e

NotImplemented = 42
print(NotImplemented)

fallen slateBOT Dec 6, 2022, 9:36 PM

#

@grave jolt :white_check_mark: Your 3.11 eval job has completed with return code 0.

grave jolt Dec 6, 2022, 9:36 PM

#

Built-in exceptions are classes so not sure what that comment is about 🤷‍♂️

dusk comet Dec 6, 2022, 9:36 PM

#

Ellipsis too

raven ridge Dec 6, 2022, 9:36 PM

#

At first True and False didn't exist at all, until surprisingly recently. Later they existed, but weren't keywords. Now they're keywords.

glacial folio Dec 6, 2022, 9:39 PM

#

dusk comet Just noticed that python is around 40% faster on WSL than on Windows ubuntu 22.0...

Might be cmake? Since it doesn't ship by default with windows

autumn sonnet Dec 6, 2022, 10:40 PM

#

dusk comet

you broke the matrix

#

hi, i just want to check, is it only me or anyone else...

i feel very anxious to read code if there isn't an empty line every 4,5 lines

dusk comet Dec 6, 2022, 10:49 PM

#

im using black, it enforces empty lines before/after every function/class
but im also adding empty lines inside functions to separate logic and syntactic blocks of code

radiant garden Dec 6, 2022, 11:25 PM

#

autumn sonnet hi, i just want to check, is it only me or anyone else... i feel very anxious t...

Good visual cues help when reading code!

#

Which is why a single line with a big line of comments above it stands out as Very Important Do Not Touch

strong gate Dec 7, 2022, 4:44 AM

#

I noticed that itertools is missing a chunk_by (or chunked) implementation as an undo for the chain function. Is there a technical reason (beyond no one created a PEP) for this?

feral island Dec 7, 2022, 5:09 AM

#

strong gate I noticed that itertools is missing a `chunk_by` (or `chunked`) implementation a...

maybe https://docs.python.org/3.12/library/itertools.html#itertools.batched?

prime estuary Dec 7, 2022, 7:05 AM

#

raven ridge what an oddly authoritative answer. Not super consistent - why should `str` be l...

I think I might have an idea as to why this is the case. Originally in like Python 1.X builtin types were totally different to classes, so you couldn't inherit etc. int(), str(), list() etc were merely functions which did a conversion, but were eventually replaced by the actual classes when that became possible. If you go to the earliest copy of the docs (Python 1.4), you can see that they're described as functions, and the builtin types in their section don't get given a specific identifier:
https://docs.python.org/release/1.4/lib/node26.html#SECTION00330000000000000000

raven ridge Dec 7, 2022, 7:26 AM

#

prime estuary I think I might have an idea as to why this is the case. Originally in like Pyth...

I wondered if it was related to the class vs type distinction that used to exist, but Exception would have been a type as well, AFAICT

#

I like the theory that it might be related to whether or not a given type could be subclassed back in the long long ago

strong gate Dec 7, 2022, 8:05 AM

#

feral island maybe https://docs.python.org/3.12/library/itertools.html#itertools.batched?

I was checking for 3.11, didn't see that. Thanks

prime estuary Dec 7, 2022, 8:42 AM

#

raven ridge I wondered if it was related to the class vs type distinction that used to exist...

Well exceptions were originally string values - that's why the exc_info tuple exists, and why except Exception1, Exception2: is a syntax error... Oh wow. They weren't even a specific class apparently, just a string compared by identity - you'd be raising like ("IOError", "Invalid permissions", <traceback>). So then the rule makes sense, CamelCase would be used for constant names, including None.
https://docs.python.org/release/1.4/lib/node25.html#SECTION00320000000000000000

gray galleon Dec 7, 2022, 9:58 AM

#

gray galleon why do all comparison operators come from 1 instruction? why not `BINARY_GREATER...

.

rose schooner Dec 7, 2022, 11:08 AM

#

gray galleon .

of all the bad ideas you suggested this one may or may not actually make some sense

#

well first of all that reduces the work

#

they all just call the same thing (PyObject_RichCompare()) anyway

#

the mode of comparison is just passed through the oparg

gray galleon Dec 7, 2022, 12:54 PM

#

they are doing that for other ops too

#

as BINARY_OP

lean kayak Dec 7, 2022, 3:18 PM

#

Hello all - I wanted to get some clarification around a stylistic choice. For multiline arglists, I adopted a style long ago (that I thought was PEP-8 compliant, but now I can't seem to find where I got it...) for multiline arglists:

def some_function(
        first_argument: int,
        second_argument: int,
        third_argument: int,
    ) -> int:
    """Get the product of three integers."""
    return first_argument * second_argument * third_argument

I've seen the variant where the closing ) of the function declaration is on the same line as the final parameter, and I've seen it where it's on its own line but in line with the def keyword. I also know that double-indenting the multiline arglist is a common style. What I can't seem to find is whether it's atypical to indent the closing ) in line with the function body in these situations. To me, this is very readable, but I can definitely imagine counterarguments to it. Thoughts?

pliant tusk Dec 7, 2022, 3:28 PM

#

does anyone know if Read After Frees are considered critical python vulns? And they use some obviously weird code so the code structures probably wouldn't show up by accident

feral island Dec 7, 2022, 3:31 PM

#

pliant tusk does anyone know if `Read After Free`s are considered critical python vulns? And...

yes, a read after free is bad; it can lead to segfaults and security issues

pliant tusk Dec 7, 2022, 3:33 PM

#

yea ik its bad, im just trying to determine if i should report it using the security email, because its not the kind of code that can be hit by accident. (it requires freeing a buffer inside of a dunder)

feral island Dec 7, 2022, 3:35 PM

#

probably fine to just report as a bug? there's a bunch of similar issues like https://github.com/python/cpython/issues/87353 where there were segfaults when doing something weird that haven't been treated as security issues. But maybe it's better to be safe than sorry

grave jolt Dec 7, 2022, 3:36 PM

#

C moment 😔

feral island Dec 7, 2022, 3:36 PM

#

https://github.com/python/cpython/issues/97592 is another one in the same style

grave jolt Dec 7, 2022, 3:36 PM

#

lean kayak Hello all - I wanted to get some clarification around a stylistic choice. For mu...

I am more used to this style

def some_function(
    first_argument: int,
    second_argument: int,
    third_argument: int,
) -> int:
    """Get the product of three integers."""
    return first_argument * second_argument * third_argument

I guess it's a bit easier to scan because the closing line is on a different level. And there's a tiny bit more space
But I guess it's more of a personal preference (or a preference of your team)

#

https://lukasz.langa.pl/1d1a43c4-9c8a-4c5f-a366-7f22ce6a49fc/

lukasz.langa.pl

Why the sad face? - Łukasz Langa

When you first encounter Black, a few things about it might surprise you. One of the those things might be

#

my team lead uses double indents for arguments, with blocks etc. and it drives me crazy

#

PEP8 suggests some styles but doesn't force one

lean kayak Dec 7, 2022, 3:41 PM

#

Right, right - and yeah I used to do it the way you do. I don't know why I landed on the double-indent style. Probably some SA SO discussion on it a few years ago.

grave jolt Dec 7, 2022, 3:42 PM

#

SA?

lean kayak Dec 7, 2022, 3:43 PM

#

Er, meant SO -- Stack Overflow

grave jolt Dec 7, 2022, 3:46 PM

#

o

deep bramble Dec 7, 2022, 4:31 PM

#

oh how I wish this was in ABCs

lunar harbor Dec 7, 2022, 7:38 PM

#

lean kayak Hello all - I wanted to get some clarification around a stylistic choice. For mu...

It doesn't matter much. What's more important is to just be consistent

#

hmm, this isn't the channel I thought it was

grave jolt Dec 7, 2022, 9:29 PM

#

kinda inconsistent 😉

rose schooner Dec 7, 2022, 10:40 PM

#

gray galleon as `BINARY_OP`

yep

deep bramble Dec 8, 2022, 1:35 AM

#

what exactly is it in abstract classes that causes a TypeError on initialization with missing abstract methods? I wasn't able to find any code in the abc module for it, just the code that collects them into cls.__abstractmethods__, and the raised type error's traceback doesn't even include the path to where it was raised from

raven ridge Dec 8, 2022, 3:18 AM

#

deep bramble what exactly is it in abstract classes that causes a TypeError on initialization...

It's apparently built into object.__new__ - https://github.com/python/cpython/blob/4246fe977d850f8b78505c982f055d33d52ff339/Objects/typeobject.c#L4971-L4976

fallen slateBOT Dec 8, 2022, 3:18 AM

#

Objects/typeobject.c lines 4971 to 4976

PyErr_Format(PyExc_TypeError,
             "Can't instantiate abstract class %s "
             "without an implementation for abstract method%s '%U'",
             type->tp_name,
             method_count > 1 ? "s" : "",
             joined);```

dusk comet Dec 8, 2022, 3:48 AM

#

Thats very weird. It makes every object creation slower, even if they are not instances of ABC-related classes.

raven ridge Dec 8, 2022, 4:02 AM

#

dusk comet Thats very weird. It makes every object creation slower, even if they are not in...

by one easily predicted branch based on a single bit check

#

https://github.com/python/cpython/blob/4246fe977d850f8b78505c982f055d33d52ff339/Objects/typeobject.c#L4937

fallen slateBOT Dec 8, 2022, 4:04 AM

#

Objects/typeobject.c line 4937

if (type->tp_flags & Py_TPFLAGS_IS_ABSTRACT) {```

dusk comet Dec 8, 2022, 4:05 AM

#

anyway it is not related to object itself
why code related to some module is included into object.__new__?

#

also this: ```py

type.dict['abstractmethods']
<attribute 'abstractmethods' of 'type' objects>

raven ridge Dec 8, 2022, 4:09 AM

#

https://peps.python.org/pep-3119/ says that making object.__new__ check Py_TPFLAGS_IS_ABSTRACT is an optimization over checking whether __abstractmethods__ is non-empty

PEP 3119 – Introducing Abstract Base Classes | peps.python.org

Python Enhancement Proposals (PEPs)

dusk comet Dec 8, 2022, 4:11 AM

#

it is making creation of instances of possibly abstract classes faster, but it is slowing down all other calls to object.__new__

raven ridge Dec 8, 2022, 4:23 AM

#

hm - does it need to be done this way to support other metaclasses?

#

like, does it need to be possible to create an abstract class without using ABCMeta so that you can have an abstract class with a different metaclass?

dusk comet Dec 8, 2022, 4:28 AM

#

Py_TPFLAGS_IS_ABSTRACT and related things are implementation details, so if you somehow created abstract class without abc you screwed up

raven ridge Dec 8, 2022, 4:28 AM

#

the PEP seems to be structured to say that abstract classes are a first class feature of Python, and that the abc module is just a helper, not the only way of creating abstract classes

dusk comet Dec 8, 2022, 4:33 AM

#

there is no public API (Python nor C) that allows us to create abstract classes
even Py_TPFLAGS_IS_ABSTRACT is not documented in Py_TPFLAGS list

#

this definition from glossary is not even related to object creation, it can be implemented in pure python without messing with any type flags

raven ridge Dec 8, 2022, 4:39 AM

#

hm, indeed...

deep bramble Dec 8, 2022, 11:35 AM

#

raven ridge It's apparently built into `object.__new__` - https://github.com/python/cpython/...

oh wow, that's incredibly weird

grave jolt Dec 8, 2022, 12:03 PM

#

raven ridge It's apparently built into `object.__new__` - https://github.com/python/cpython/...

mildly cursed lemon_cut

grave jolt Dec 8, 2022, 12:10 PM

#

fallen slate `Objects/typeobject.c` lines 4971 to 4976 ```c PyErr_Format(PyExc_TypeError, ...

!e
Here's a fun way to crash Python

from abc import ABC

class X(ABC):
    pass

X.__abstractmethods__ = iter(int, 1)
print(X())

fallen slateBOT Dec 8, 2022, 12:10 PM

#

@grave jolt :warning: Your 3.11 eval job timed out or ran out of memory.

[No output]

grave jolt Dec 8, 2022, 12:11 PM

#

I think I meant to reply to @raven ridge

deep bramble Dec 8, 2022, 12:17 PM

#

you can indeed create your own abstract class without the abc module

deep bramble Dec 8, 2022, 12:18 PM

#

dusk comet `Py_TPFLAGS_IS_ABSTRACT` and related things are implementation details, so if yo...

Py_TPFLAGS_IS_ABSTRACT is actually set by Cpython based on the existence of __abstractmethods__, so no, you don't need to use the abc module

#

https://github.com/python/cpython/blob/cd67c1bb30eccd0c6fd1386405df225aed4c91a9/Objects/typeobject.c#L765-L812

unkempt rock Dec 8, 2022, 3:12 PM

#

hexed jungle Dec 8, 2022, 3:39 PM

#

there is an error when I try to install the noise library for perlin noise. I use python 3.10 and am using pip to install it. There are some threads online about this but I am too stupid to understand these, can someone help fix this.

mild sonnet Dec 8, 2022, 4:02 PM

#

Here’s my code

#

Can someone tell me how to get my bot online

feral cedar Dec 8, 2022, 4:05 PM

#

you leaked your bot token. you should reset it

grave jolt Dec 8, 2022, 4:09 PM

#

Hey @mild sonnet, I removed your message because it had your bot token. Please change it as soon as possible.

If you have a question about discord.py, see #❓｜how-to-get-help or #discord-bots

raven ridge Dec 8, 2022, 6:51 PM

#

deep bramble you can indeed create your own abstract class without the abc module

Though __abstractmethods__ is undocumented, so it's not clear that you're meant to...

rose schooner Dec 8, 2022, 10:24 PM

#

hexed jungle there is an error when I try to install the noise library for perlin noise. I us...

follow the link

tacit hawk Dec 8, 2022, 11:18 PM

#

should stdlib imports come before third-party imports which comes before user packages import? this doesn't seem to be common but I do that

feral island Dec 8, 2022, 11:19 PM

#

tacit hawk should stdlib imports come before third-party imports which comes before user pa...

that's a common convention yes

burnt bloom Dec 8, 2022, 11:31 PM

#

I want to add a readonly property to the base object class. The property will be the address in memory. I know id() can do that, but it would be cool for that alternative method. Since it must be written in C due to it not being able to use extentsions like C#, how would I go about adding such a property?

hexed jungle Dec 8, 2022, 11:47 PM

#

rose schooner follow the link

I will try but I don't think it will do much. It is an error with python itself

rose schooner Dec 8, 2022, 11:48 PM

#

hexed jungle I will try but I don't think it will do much. It is an error with python itself

well you have to download build tools (provided by the link) to make it work
it's not an error within python itself

pliant tusk Dec 9, 2022, 12:19 AM

#

burnt bloom I want to add a readonly property to the base object class. The property will be...

You could use fishhook to do that

#

!e ```py

from fishhook import hook
@hook.property(object)
def addr(self):
return id(self)

print(int.addr)```

fallen slateBOT Dec 9, 2022, 12:20 AM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

140174779776640

burnt bloom Dec 9, 2022, 1:20 AM

#

pliant tusk You could use fishhook to do that

So, would this allow me to in that script do for example if I have a list that is [1, 2, 3] and the var name is list1. I can do list1.addr and it will print the memory location?

pliant tusk Dec 9, 2022, 1:21 AM

#

burnt bloom So, would this allow me to in that script do for example if I have a list that i...

Yea, it adds a property named addr to every instance of object

burnt bloom Dec 9, 2022, 1:22 AM

#

pliant tusk Yea, it adds a property named addr to every instance of object

This is a description I found. What does this all mean? And you can have static classes? And what are heap classes. I know what the heap is. But what are heap classes?

pliant tusk Dec 9, 2022, 1:23 AM

#

Static classes are classes defined in C

#

Heap classes are mutable by default (like all python classes and some very specific c classes)

pliant tusk Dec 9, 2022, 1:24 AM

#

burnt bloom This is a description I found. What does this all mean? And you can have static ...

I wrote that description a while ago and it was mostly referring to the memory location of a given class (heap or static memory)

#

But since then python has added the functionality for C classes in the heap, but fishhook still works

burnt bloom Dec 9, 2022, 1:25 AM

#

pliant tusk Heap classes are mutable by default (like all python classes and some very speci...

Wdym. C classes aren’t mutable? Like I can’t change properties of their classes? And wait C doesn’t have classes?

burnt bloom Dec 9, 2022, 1:25 AM

#

pliant tusk I wrote that description a while ago and it was mostly referring to the memory l...

U MADE IT?

pliant tusk Dec 9, 2022, 1:25 AM

#

*python classes defined in C

#

Like int or str or object

pliant tusk Dec 9, 2022, 1:25 AM

#

burnt bloom U MADE IT?

Yea it's one of my more fully fledged personal projects

burnt bloom Dec 9, 2022, 1:26 AM

#

pliant tusk Yea it's one of my more fully fledged personal projects

HOLY CRAP

#

How long have u been programming for?

#

That’s a rlly advanced project.

pliant tusk Dec 9, 2022, 1:27 AM

#

I released the first version of fishhook in 2020, and I think I started learning python a year or 2 prior

burnt bloom Dec 9, 2022, 1:28 AM

#

pliant tusk I released the first version of fishhook in 2020, and I think I started learning...

Damn. So how exactly does it work?

#

And how did u learn all the low level stuff u needed to do it?

#

Did u read cpython internals and learn C?

pliant tusk Dec 9, 2022, 1:29 AM

#

burnt bloom Did u read cpython internals and learn C?

Yea I read the internals a lot

burnt bloom Dec 9, 2022, 1:30 AM

#

pliant tusk Yea I read the internals a lot

I see. Did you learn C before python?

rose schooner Dec 9, 2022, 1:30 AM

#

burnt bloom How long have u been programming for?

it takes at least a year or two of programming to get most concepts right
the rest is learned at any time

burnt bloom Dec 9, 2022, 1:30 AM

#

Or C as u went along?

burnt bloom Dec 9, 2022, 1:31 AM

#

rose schooner it takes at least a year or two of programming to get most concepts right the re...

Hm I see. That’s just so cool.

pliant tusk Dec 9, 2022, 1:31 AM

#

I was leaning Objective C when I learned python

#

So ig plain C I learned as I went

burnt bloom Dec 9, 2022, 1:31 AM

#

I see.

#

Did u read that book for internals?

pliant tusk Dec 9, 2022, 1:31 AM

#

rose schooner it takes at least a year or two of programming to get most concepts right the re...

Very true

pliant tusk Dec 9, 2022, 1:31 AM

#

burnt bloom Did u read that book for internals?

No I learned more from #esoteric-python

rose schooner Dec 9, 2022, 1:32 AM

#

pliant tusk No I learned more from <#470884583684964352>

same

pliant tusk Dec 9, 2022, 1:33 AM

#

There was another user who showed me early on how you could change the builtin int.add with ctypes (fully manually) and it got me interested in doing it in a more general way

#

That's how fishhook came about

#

I don't know if the other user is still here, they used to be Juan or something

rose schooner Dec 9, 2022, 1:34 AM

#

pliant tusk There was another user who showed me early on how you could change the builtin i...

#esoteric-python message ?

pliant tusk Dec 9, 2022, 1:36 AM

#

You are faster at searching than I am lmao

#

Yea they are now @twilit garnet. And yea that got me interested in the internals

rose schooner Dec 9, 2022, 1:36 AM

#

i just searched in:esoteric-python int ctypes

pliant tusk Dec 9, 2022, 1:37 AM

#

Ah I did int .value = ctypes was smarter

rose schooner Dec 9, 2022, 1:41 AM

#

pliant tusk Yea they are now <@245270749919576066>. And yea that got me interested in the in...

i got interested when some code object i was building segfaulted because i removed a POP_TOP
skorb pointed to the line that caused the error and i got to modifying the internals to show a detailed error #esoteric-python message

pliant tusk Dec 9, 2022, 1:42 AM

#

rose schooner i got interested when some code object i was building segfaulted because i remov...

Lol I show up like 5 messages after that one

rose schooner Dec 9, 2022, 1:44 AM

#

pliant tusk Lol I show up like 5 messages after that one

i think the bug with your load_addr() thingy also gave motivation to start learning the internals

pliant tusk Dec 9, 2022, 1:47 AM

#

Oh yea how it used LOAD_DEREF?

#

I think I remember you asking for a ton of details on how load_addr worked

burnt bloom Dec 9, 2022, 2:12 AM

#

pliant tusk No I learned more from <#470884583684964352>

Oh fr? What is that?

burnt bloom Dec 9, 2022, 2:12 AM

#

pliant tusk There was another user who showed me early on how you could change the builtin i...

What’s that?

#

Sry I had to go, I’m doing be and stuff 😅

#

Also, ur lib has so much memory stuff and unlocking and locking. How does that work? @pliant tusk

pliant tusk Dec 9, 2022, 2:16 AM

#

burnt bloom Also, ur lib has so much memory stuff and unlocking and locking. How does that w...

Those functions toggle some type flags to trick python into thinking a given type is mutable/unmutable

burnt bloom Dec 9, 2022, 2:17 AM

#

pliant tusk Those functions toggle some type flags to trick python into thinking a given typ...

How did u figure that out?!

pliant tusk Dec 9, 2022, 2:17 AM

#

!e ```py

from fishhook import lock

class A:pass

lock(A)

A.a = 1 # fails ``` for fun you can do this too

burnt bloom Dec 9, 2022, 2:17 AM

#

And know u had to do that?

fallen slateBOT Dec 9, 2022, 2:17 AM

#

@pliant tusk :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 7, in <module>
003 | TypeError: cannot set 'a' attribute of immutable type 'A'

pliant tusk Dec 9, 2022, 2:17 AM

#

burnt bloom How did u figure that out?!

Lots of reading typeobject.c

burnt bloom Dec 9, 2022, 2:17 AM

#

pliant tusk Lots of reading typeobject.c

What is that?

#

And how did u know to read rhag specifically?

#

That

pliant tusk Dec 9, 2022, 2:18 AM

#

The file that controls how types work

burnt bloom Dec 9, 2022, 2:18 AM

#

Damn. Isn’t it like 10k lines tho?

#

How did u find what u need?

pliant tusk Dec 9, 2022, 2:20 AM

#

I looked specifically at the implementation of setattr

#

And how it sets slot functions to the correct pointers

burnt bloom Dec 9, 2022, 2:21 AM

#

pliant tusk And how it sets slot functions to the correct pointers

Slot functions?

#

And what do the pointers ur referencing do?

pliant tusk Dec 9, 2022, 2:22 AM

#

Slot functions specifically control dunder functions

#

So stuff like __add__

#

So the first version of fishhook was written to calculate those slot pointers, but now I just abuse setattr to set them for me

burnt bloom Dec 9, 2022, 2:32 AM

#

I see.

#

So u setattrfor objects?

burnt bloom Dec 9, 2022, 2:33 AM

#

pliant tusk Slot functions specifically control dunder functions

I see.

#

Where did u learn all this low level stuff? @pliant tusk

pliant tusk Dec 9, 2022, 2:40 AM

#

burnt bloom Where did u learn all this low level stuff? <@274715613115711488>

Lots of trial and error

burnt bloom Dec 9, 2022, 2:43 AM

#

pliant tusk Lots of trial and error

How long did it take u to make the lib?

#

And was it a consistent two years of python before u made it?

pliant tusk Dec 9, 2022, 3:05 AM

#

burnt bloom How long did it take u to make the lib?

A few weeks probably

burnt bloom Dec 9, 2022, 3:34 AM

#

pliant tusk A few weeks probably

That’s IT?

#

By the time u started working on it, were u well versed in how u were gonan do it?

pliant tusk Dec 9, 2022, 4:10 AM

#

burnt bloom By the time u started working on it, were u well versed in how u were gonan do i...

I had written some small specific stuff so I understood how to do it. Fishhook was just a new strategy meant to be more general

burnt bloom Dec 9, 2022, 4:13 AM

#

pliant tusk I had written some small specific stuff so I understood how to do it. Fishhook w...

I see. Well its a fantastic lib. Did u learn a lot of python in Uni if u did that?

pliant tusk Dec 9, 2022, 4:15 AM

#

I was still in HS when I released that lib lol

burnt bloom Dec 9, 2022, 4:20 AM

#

pliant tusk I was still in HS when I released that lib lol

WHAT

#

U making me sad now 😭

muted glacier Dec 9, 2022, 4:25 AM

#

burnt bloom U making me sad now 😭

💀

rose schooner Dec 9, 2022, 7:09 AM

#

burnt bloom U making me sad now 😭

i'm... still in (j)hs

warm breach Dec 9, 2022, 1:15 PM

#

pliant tusk !e ```py from fishhook import lock class A:pass lock(A) A.a = 1 # fails ``` ...

!e but what about 👀

import ctypes
from ctypes import pythonapi
from fishhook import lock

def setattr(obj, name, value):
    get_dict = pythonapi._PyObject_GetDictPtr
    get_dict.restype = ctypes.POINTER(ctypes.py_object)
    get_dict.argtypes = (ctypes.py_object,)
    ptr = pythonapi._PyObject_GetDictPtr(obj)
    ptr.contents.value[name] = value

class A:
    pass

lock(A)

setattr(A, "x", "what 🤔")

print(A.x)

fallen slateBOT Dec 9, 2022, 1:15 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

what 🤔

burnt bloom Dec 9, 2022, 1:53 PM

#

rose schooner i'm... still in (j)hs

💀

#

How long have u been coding?

frigid bison Dec 9, 2022, 2:13 PM

#

burnt bloom 💀

I think a lot of people here still are

twilit garnet Dec 9, 2022, 3:33 PM

#

pliant tusk Yea they are now <@245270749919576066>. And yea that got me interested in the in...

hehehe hello!

burnt bloom Dec 9, 2022, 3:37 PM

#

frigid bison I think a lot of people here still are

Not as as advanced XD

neat delta Dec 9, 2022, 6:28 PM

#

!pep 701 just saw that f-strings to formal grammar is in a draft. hope it gets approved soon 🥳

fallen slateBOT Dec 9, 2022, 6:28 PM

#

**PEP 701 - Syntactic formalization of f-strings**

Link

Status

Draft

Python-Version

3.12

Created

15-Nov-2022

Type

Standards Track

pliant tusk Dec 9, 2022, 6:44 PM

#

warm breach !e but what about 👀 ```py import ctypes from ctypes import pythonapi from fish...

yea fair that would still work, just like that would work on other immutable classes. (although you need to call pythonapi.PyType_Modified(py_object(cls)) after altering that dictionary to avoid crashes due to corrupted cache

pliant tusk Dec 9, 2022, 6:45 PM

#

twilit garnet hehehe hello!

hey, what do you think of fishhook, since your example way back kind of inspired it

unkempt rock Dec 9, 2022, 8:53 PM

#

neat delta !pep 701 just saw that f-strings to formal grammar is in a draft. hope it gets a...

I am really looking for the way it can work like a more powerful version of JavaScript template literals without the Python f-string syntax restrictions:

// Node.js v18.9.0 console

> `${`this`}` // example with delimiter collision
'this'
> `${"\n"}` // example with backslash
'\n'
> `${
... 'this' // comment
... }` // example with comment
'this'

grave jolt Dec 9, 2022, 9:13 PM

#

twilit garnet hehehe hello!

heyyyyyyyyyy!

#

welcome back @twilit garnet

rose schooner Dec 9, 2022, 10:45 PM

#

burnt bloom How long have u been coding?

3 years

#

5th grade started with lua(u)

burnt bloom Dec 10, 2022, 5:03 AM

#

rose schooner 5th grade started with lua(u)

Consistent 3 years?

rose schooner Dec 10, 2022, 9:16 AM

#

burnt bloom Consistent 3 years?

wdym consistent?

twilit garnet Dec 10, 2022, 1:58 PM

#

grave jolt welcome back <@245270749919576066>

hello! i need to come back to this place fr, i'm just so starved of free time to do so with uni and stuff now haha. i'm getting old!

grave jolt Dec 10, 2022, 1:59 PM

#

old people gang

twilit garnet Dec 10, 2022, 2:00 PM

#

pliant tusk hey, what do you think of fishhook, since your example way back kind of inspired...

it looks really cool! i'll definitely need to look further into it, i'd love to play around with it hehehe

twilit garnet Dec 10, 2022, 2:01 PM

#

grave jolt old people gang

hey i joined this server when i was 15 and now i'm 20, what a fever dream

exotic lichen Dec 10, 2022, 2:32 PM

#

,I need a code that firstly has a list(return of a subprocess),than (inside while True) is called the same function to get the list,but if the list elemens are gone or added the program is terminated
how can I do that
I have a code but it doesnt work properly.I can also show the code

warm breach Dec 10, 2022, 3:38 PM

#

!e

import sys

str_ = "abc123def"
print(sys.getrefcount(str_))

int_ = 123500
print(sys.getrefcount(int_))

list_ = [1, 2, 3]
print(sys.getrefcount(list_))

fallen slateBOT Dec 10, 2022, 3:38 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 4
002 | 4
003 | 2

warm breach Dec 10, 2022, 3:38 PM

#

anyone know why some objects like new strs and ints start with a refcount of 4?

feral island Dec 10, 2022, 3:43 PM

#

warm breach anyone know why some objects like new strs and ints start with a refcount of 4?

I guess one is the code object's co_consts, one is the variable, and one is the getrefcount() function

#

but that leaves one

warm breach Dec 10, 2022, 3:43 PM

#

also even deling the string only removes 1 reference

feral island Dec 10, 2022, 3:44 PM

#

gc.get_referrers() only finds co_consts

warm breach Dec 10, 2022, 3:44 PM

#

!e

import sys
import ctypes

str_ = "abc123def"
str_id = id(str_)
print(sys.getrefcount(str_))

del str_

print(sys.getrefcount(ctypes.cast(str_id, ctypes.py_object).value))

fallen slateBOT Dec 10, 2022, 3:44 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 4
002 | 3

flat gazelle Dec 10, 2022, 3:44 PM

#

try a string with a space in it

#

could be string interning

warm breach Dec 10, 2022, 3:48 PM

#

flat gazelle try a string with a space in it

hm, seems to be the same

#

but if I make it a non constant it does go back to 2 references

#

like

str_ = "abc 123 def"
str_ *= 2

#

so yeah I think it's the co_consts, but why does that increase refcount by 2 pithink

warm breach Dec 10, 2022, 3:53 PM

#

feral island I guess one is the code object's co_consts, one is the variable, and one is the ...

so is there a __code__ or co_consts even for module level statements? Is it accessible in runtime python code?

feral island Dec 10, 2022, 3:54 PM

#

warm breach so is there a `__code__` or `co_consts` even for module level statements? Is it ...

yeah everything is executed from a code object. Not sure how to get to it for a module though

#

Out[352]: <code object <module> at 0x7fb083b9c400, file "/Library/Frameworks/Python.framework/Versions/3.11/lib/python3.11/dis.py", line 1>

#

this seems to do it

warm breach Dec 10, 2022, 3:56 PM

#

ah cool thanks

#

also yeah, if I define the string in a function, as expected it's 2 references + 1 from co_consts = 3

#

!e

import sys

def foo():
    return "abc 123 def"
print(sys.getrefcount(foo()))

s2 = "500 def 123"
print(sys.getrefcount(s2))

fallen slateBOT Dec 10, 2022, 4:00 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 3
002 | 4

warm breach Dec 10, 2022, 4:00 PM

#

but somehow module level variables have 1 more reference from somewhere

#

I would have guessed some globals() dictionary but that doesn't explain the reference staying after del

dusk comet Dec 10, 2022, 6:23 PM

#

Maybe parser/compiler holds references to all string literals to use the same object every time some particular string literal is used in code?

rich cradle Dec 10, 2022, 6:44 PM

#

that exists, although i don't know how much it's used

#

that's known as interning

spare oriole Dec 10, 2022, 6:48 PM

#

#

floral peak Dec 10, 2022, 6:49 PM

#

@spare oriole https://docs.python.org/3/library/collections.abc.html#collections.abc.MutableMapping

Python documentation

collections.abc — Abstract Base Classes for Containers

Source code: Lib/_collections_abc.py This module provides abstract base classes that can be used to test whether a class provides a particular interface; for example, whether it is hashable or whet...

burnt bloom Dec 10, 2022, 8:41 PM

#

rose schooner wdym consistent?

Like keep going?

rose schooner Dec 10, 2022, 11:23 PM

#

burnt bloom Like keep going?

i don't remember taking any breaks so probably?

burnt bloom Dec 10, 2022, 11:51 PM

#

rose schooner i don't remember taking any breaks so probably?

Damn.

burnt bloom Dec 10, 2022, 11:52 PM

#

rose schooner i don't remember taking any breaks so probably?

Rn I’m starting to learn C, and working on discord bots. Hopefully to make a lib which can add certain things to python which it lacks. What else shld i do?

spark magnet Dec 10, 2022, 11:55 PM

#

burnt bloom Rn I’m starting to learn C, and working on discord bots. Hopefully to make a lib...

you're asking what does Python lack that you can add to your library?

burnt bloom Dec 11, 2022, 1:38 AM

#

spark magnet you're asking what does Python lack that you can add to your library?

Like, what other low level projects can I do.

warm breach Dec 11, 2022, 2:40 AM

#

https://github.com/python/cpython/blob/3.11/Include/cpython/tupleobject.h#L5-L11

fallen slateBOT Dec 11, 2022, 2:40 AM

#

Include/cpython/tupleobject.h lines 5 to 11

typedef struct {
    PyObject_VAR_HEAD
    /* ob_item contains space for 'ob_size' elements.
       Items must normally not be NULL, except during construction when
       the tuple is not yet visible outside the function that builds it. */
    PyObject *ob_item[1];
} PyTupleObject;```

warm breach Dec 11, 2022, 2:40 AM

#

is there a way to define a ctypes.Structure for a PyTupleObject?

#

since ob_item and the length of the tuple is only known at runtime

pliant tusk Dec 11, 2022, 3:17 AM

#

warm breach is there a way to define a `ctypes.Structure` for a `PyTupleObject`?

!e ```py
from ctypes import *

class PyTupleObject(Structure):
fields = [
('ob_refcount', c_ssize_t),
('ob_base', py_object),
('ob_size', c_ssize_t),
('_ob_items', py_object*0)
]

@property
def ob_items(self):
    items_addr = addressof(self._ob_items)
    return (py_object * self.ob_size).from_address(items_addr)

c = (0,1,2)
c_S = PyTupleObject.from_address(id(c))
c_S.ob_items[0] = c
print(c)

fallen slateBOT Dec 11, 2022, 3:17 AM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

((...), 1, 2)

warm breach Dec 11, 2022, 3:18 AM

#

pliant tusk !e ```py from ctypes import * class PyTupleObject(Structure): _fields_ = [ ...

what does the *0 do 👀

pliant tusk Dec 11, 2022, 3:19 AM

#

it just makes that slot a 0-length py_object array

warm breach Dec 11, 2022, 3:19 AM

#

ah hm

pliant tusk Dec 11, 2022, 3:19 AM

#

basically a placeholder for the later addressof

warm breach Dec 11, 2022, 3:20 AM

#

I was going to dynamically define the structure at runtime when size is known

#

not sure if that or address calculation is better

pliant tusk Dec 11, 2022, 3:20 AM

#

you should use the address calculation

#

(at least in my opinion)

#

because it means that if you do something like type(c_S).from_address(id(other_tuple)) it will still work

warm breach Dec 11, 2022, 3:23 AM

#

also do you know what's going on with pythonapi.PyTuple_GetItem, it seems to return an address not matching the actual objects

#

!e

from ctypes import *

class PyTupleObject(Structure):
    _fields_ = [
        ('ob_refcount', c_ssize_t),
        ('ob_base', py_object),
        ('ob_size', c_ssize_t),
        ('_ob_items', py_object*0)
    ]

    @property
    def ob_items(self):
        items_addr = addressof(self._ob_items)
        return (py_object * self.ob_size).from_address(items_addr)

tup = (0,1,2)
obj = PyTupleObject.from_address(id(tup))

print(id(tup[0]))
print(pythonapi.PyTuple_GetItem(obj, 0))

fallen slateBOT Dec 11, 2022, 3:24 AM

#

@warm breach :x: Your 3.11 eval job has completed with return code 139 (SIGSEGV).

139821935707976

warm breach Dec 11, 2022, 3:24 AM

#

👀 wut

pliant tusk Dec 11, 2022, 3:25 AM

#

!e you need to make sure to set the argtypes and return types properly, like so ```py
from ctypes import *

pythonapi.PyTuple_GetItem.restype = py_object
pythonapi.PyTuple_GetItem.argtypes = (py_object, c_ssize_t)

class PyTupleObject(Structure):
fields = [
('ob_refcount', c_ssize_t),
('ob_base', py_object),
('ob_size', c_ssize_t),
('_ob_items', py_object*0)
]

@property
def ob_items(self):
    items_addr = addressof(self._ob_items)
    return (py_object * self.ob_size).from_address(items_addr)

tup = (0,1,2)
obj = PyTupleObject.from_address(id(tup))

print(id(tup[0]))
print(pythonapi.PyTuple_GetItem(tup, 0))

fallen slateBOT Dec 11, 2022, 3:26 AM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 140197582378824
002 | 0

warm breach Dec 11, 2022, 3:27 AM

#

ah I see, nice

#

why isn't pythonapi typed 😔

#

also is that possible to work with the structure? instead of the actual tuple reference

pliant tusk Dec 11, 2022, 3:28 AM

#

because it is generated from the pythondll so it does not have type info

pliant tusk Dec 11, 2022, 3:28 AM

#

warm breach also is that possible to work with the structure? instead of the actual tuple re...

yes

#

!e ```py
from ctypes import *

class PyTupleObject(Structure):
fields = [
('ob_refcount', c_ssize_t),
('ob_base', py_object),
('ob_size', c_ssize_t),
('_ob_items', py_object*0)
]

@property
def ob_items(self):
    items_addr = addressof(self._ob_items)
    return (py_object * self.ob_size).from_address(items_addr)

pythonapi.PyTuple_GetItem.restype = py_object
pythonapi.PyTuple_GetItem.argtypes = (POINTER(PyTupleObject), c_ssize_t)

tup = (0,1,2)
obj = PyTupleObject.from_address(id(tup))

print(id(tup[0]))
print(pythonapi.PyTuple_GetItem(obj, 0))```

fallen slateBOT Dec 11, 2022, 3:29 AM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 139972054882120
002 | 0

warm breach Dec 11, 2022, 3:30 AM

#

so argtypes can be anything and it'll try to cast to that?

pliant tusk Dec 11, 2022, 3:30 AM

#

*just need to remember to use POINTER(struct) if you are passing it to something that expects a pyobject (which is just a pointer to a given structure)

pliant tusk Dec 11, 2022, 3:30 AM

#

warm breach so argtypes can be anything and it'll try to cast to that?

ish? you need to make sure it matches the abi of the function implementation

warm breach Dec 11, 2022, 3:32 AM

#

like if I do restype = ctypes.c_long that would be the pointer before casting to pyobject?

pliant tusk Dec 11, 2022, 3:32 AM

#

yea, but i would use ctypes.c_void_p

warm breach Dec 11, 2022, 3:32 AM

#

actually I guess that is same as not defining restype?

#

is there a predefined "default" restype or does it do something special if its not defined

pliant tusk Dec 11, 2022, 3:33 AM

#

yea i think it is c_int

#

argtypes does something special if it is not defined, it lets ctype handle the func like a varargs function.

#

so like printf (if you wanted to call libc.printf manually), you couldnt set argtypes if you want it to retain its varargs functionality

#

@warm breach you can also define a new interface using PYFUNCTYPE or CFUNCTYPE and then make copies of the functions from pythonapi if you want the same function with different interfaces

py_size = CFUNCTYPE(py_object, py_object, c_ssize_t)
struct_size = CFUNCTYPE(py_object, POINTER(PyTupleObject), c_ssize_t)

getitem_with_pyobj = py_size(pythonapi.PyTuple_GetItem)
getitem_with_struct = struct_size(pythonapi.PyTuple_GetItem)

warm breach Dec 11, 2022, 3:40 AM

#

👀 yeah that's useful thanks

pliant tusk Dec 11, 2022, 3:40 AM

#

warm breach 👀 yeah that's useful thanks

no prob, feel free to ping me if you have any more ctypes questions

warm breach Dec 11, 2022, 3:41 AM

#

also do you know how I can call PyTuple_GetSize with an python int of the tuple id

#

https://github.com/python/cpython/blob/3.11/Include/cpython/tupleobject.h#L22-L25

fallen slateBOT Dec 11, 2022, 3:41 AM

#

Include/cpython/tupleobject.h lines 22 to 25

static inline Py_ssize_t PyTuple_GET_SIZE(PyObject *op) {
    PyTupleObject *tuple = _PyTuple_CAST(op);
    return Py_SIZE(tuple);
}```

warm breach Dec 11, 2022, 3:41 AM

#

is it POINTER(py_object) for argtypes?

#

and then I tried casting the id into a pyobject, but maybe I am misunderstanding what py_object.from_address() does because supplying an id of a python object causes a segmentation fault pithink

pliant tusk Dec 11, 2022, 3:44 AM

#

warm breach and then I tried casting the id into a pyobject, but maybe I am misunderstanding...

so a py_object is a bit of a misnomer, its actually a pointer to a py_object. and from_address basically looks for that pointer at the address you pass it (id) so it is parsing the refcount as a pointer (and then crashing)

warm breach Dec 11, 2022, 3:44 AM

#

ah..

pliant tusk Dec 11, 2022, 3:44 AM

#

_ctypes.PyObj_FromPtr will do what you want

warm breach Dec 11, 2022, 3:44 AM

#

so.. the id int sort of is the py_object?

pliant tusk Dec 11, 2022, 3:45 AM

#

if you wanted to do py_object.from_address(arg) then arg would have to be the address of a pointer to your object

#

that is why you can access obj.ob_base with py_object.from_address like so py_object.from_address(id(obj) + 8)

#

for 1 that would give back int

warm breach Dec 11, 2022, 3:47 AM

#

I was previously doing

ctypes.cast(id, ctypes.py_object).value

to get a python object from id

#

not sure if that or _ctypes.PyObj_FromPtr is less cursed

pliant tusk Dec 11, 2022, 3:48 AM

#

yea that also works, as ctypes.cast goes straight from the address passed in

#

either works

warm breach Dec 11, 2022, 3:52 AM

#

pliant tusk _ctypes.PyObj_FromPtr will do what you want

also is there anywhere stuff like PyObj_FromPtr is documented? even just a list of them

pliant tusk Dec 11, 2022, 3:52 AM

#

i have no idea, I found all of this stuff by reading the source code

warm breach Dec 11, 2022, 4:19 AM

#

pliant tusk i have no idea, I found all of this stuff by reading the source code

!e do you know what's wrong with the types here 😔

import _ctypes
import ctypes
from ctypes import py_object

py_size = ctypes.CFUNCTYPE(ctypes.c_ssize_t, py_object)

PyTuple_Size = py_size(ctypes.pythonapi.PyTuple_Size)

tup = (1, 2, 3)
tup_id = id(tup)

py_obj = py_object(_ctypes.PyObj_FromPtr(tup_id))
print(py_obj)

size = PyTuple_Size(py_obj)
print(size)

fallen slateBOT Dec 11, 2022, 4:19 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | py_object((1, 2, 3))
002 | Exception ignored on calling ctypes callback function: <_FuncPtr object at 0x7fb67e394120>
003 | Traceback (most recent call last):
004 |   File "<string>", line 15, in <module>
005 | ctypes.ArgumentError: argument 1: <class 'TypeError'>: Don't know how to convert parameter 1
006 | 140421788063592

warm breach Dec 11, 2022, 4:20 AM

#

!e If I use the direct assignment instead of CFUNCTYPE it works fine with the same types

import _ctypes
import ctypes
from ctypes import py_object

PyTuple_Size = ctypes.pythonapi.PyTuple_Size
PyTuple_Size.argtypes = (py_object,)
PyTuple_Size.restype = ctypes.c_ssize_t

tup = (1, 2, 3)
tup_id = id(tup)

py_obj = py_object(_ctypes.PyObj_FromPtr(tup_id))
print(py_obj)

size = PyTuple_Size(py_obj)
print(size)

fallen slateBOT Dec 11, 2022, 4:20 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | py_object((1, 2, 3))
002 | 3

pliant tusk Dec 11, 2022, 4:22 AM

#

actually I was wrong about copying the function, you do it like this ```py
ctypes.pythonapi['PyTuple_Size']

#

CFUNCTYPE(...)(callable) is used to wrap python callables

warm breach Dec 11, 2022, 4:23 AM

#

ah okay

#

and then I just do the normal argtypes / restype assignment?

pliant tusk Dec 11, 2022, 4:24 AM

#

yea

warm breach Dec 11, 2022, 11:25 AM

#

anyone know where the struct of PyLongObject is defined?

#

doesn't seem to be in either
https://github.com/python/cpython/blob/3.11/Include/longobject.h
https://github.com/python/cpython/blob/3.11/Objects/longobject.c

blissful prawn Dec 11, 2022, 11:31 AM

#

Hi, my name is Mari-eve

rose schooner Dec 11, 2022, 11:41 AM

#

warm breach doesn't seem to be in either <https://github.com/python/cpython/blob/3.11/Includ...

Include/cpython/longintrepr.h

warm breach Dec 11, 2022, 11:43 AM

#

rose schooner `Include/cpython/longintrepr.h`

ah nice 👍

#

my cursed shenanigans can continue

rose schooner Dec 11, 2022, 11:44 AM

#

warm breach ah nice 👍

there's a multitude of places to search for some of the structs, particularly those of int, dict, and type(sys._getframe())

warm breach Dec 11, 2022, 11:44 AM

#

😔

#

also the cpp defs messes up my ide's find definitions

rose schooner Dec 11, 2022, 11:46 AM

#

warm breach doesn't seem to be in either <https://github.com/python/cpython/blob/3.11/Includ...

btw remember that most internal structs are contained in Include/cpython/

warm breach Dec 11, 2022, 11:52 AM

#

rose schooner btw remember that most internal structs are contained in `Include/cpython/`

yeah I think I just never opened that file since I thought it was something to do with reprs 🥴

gray galleon Dec 11, 2022, 1:28 PM

#

warm breach !e If I use the direct assignment instead of CFUNCTYPE it works fine with the sa...

wow

warm breach Dec 11, 2022, 1:28 PM

#

rose schooner btw remember that most internal structs are contained in `Include/cpython/`

scale of 0-10 how cursed would this be as a library

dusk comet Dec 11, 2022, 1:37 PM

#

v1 = view(view)
with v1.unsafe():
    v2 = view(view.unsafe.__exit__)
    with v2.unsafe():
        v2.value = None
    v1.value = None
assert False

frigid bison Dec 11, 2022, 1:53 PM

#

frigid bison Dec 11, 2022, 1:53 PM

#

dusk comet ```py v1 = view(view) with v1.unsafe(): v2 = view(view.unsafe.__exit__) ...

What is this sorcery

warm breach Dec 11, 2022, 2:07 PM

#

dusk comet ```py v1 = view(view) with v1.unsafe(): v2 = view(view.unsafe.__exit__) ...

well, value is a special method I have for IntView, the basic objects don't have it

#

but there is this:

#

view(None).move_to(view(5))

print(5)

#

warm breach Dec 11, 2022, 4:44 PM

#

warm breach ```py view(None).move_to(view(5)) print(5) ```

now with a <<= overload for move_from

tup = (1, 2, 3, 4)
print(type(tup), tup)

with view(tup).unsafe() as v:
    v[0] = "👀"
    v[4] = "🤔"
    v.size += 1
    print(type(tup), tup)

    v <<= 5
    print(type(tup), tup)

    v <<= ["what", "is", "this"]
    print(type(tup), tup)

<class 'tuple'> (1, 2, 3, 4)
<class 'tuple'> ('👀', 2, 3, 4, '🤔')
<class 'int'> 5
<class 'list'> ['what', 'is', 'this']

pliant tusk Dec 11, 2022, 5:05 PM

#

warm breach now with a `<<=` overload for `move_from` ```py tup = (1, 2, 3, 4) print(type(t...

oh extending a tuple at runtime like that is not safe 😬

grave jolt Dec 11, 2022, 5:11 PM

#

well, kinda says on the tin

#

unsafe

warm breach Dec 11, 2022, 5:19 PM

#

pliant tusk oh extending a tuple at runtime like that is not safe 😬

yeah I suppose it's just writing into unowned memory? or whatever is beyond the tuple struct pithink

#

is it possible to expand / resize a PyObject struct in place? I'm assuming no

quick snow Dec 11, 2022, 5:23 PM

#

You could create a new ~~tuple~~ object, and then rewire all references.

pliant tusk Dec 11, 2022, 5:25 PM

#

warm breach is it possible to expand / resize a PyObject struct in place? I'm assuming no

not safely or consistently

warm breach Dec 11, 2022, 5:27 PM

#

quick snow You could create a new ~~tuple~~ object, and then rewire all references.

is that actually possible?

#

re-directing references

quick snow Dec 11, 2022, 5:28 PM

#

warm breach re-directing references

Yes, I did that for a project once (although we limited our types of references): https://github.com/L3viathan/batchable

TL;DR: You can call gc.get_referrers() and then change those (in more or less complicated ways, depending on mutability etc.)

#

The interesting part is just the Proxy class and its replace() method

pliant tusk Dec 11, 2022, 5:29 PM

#

!e ```py
import gc

def replace_tuple(self, new):
for container in gc.get_referrers(self):
if isinstance(container, dict):
for k, v in container.items():
if v is self:
container[k] = new
elif isinstance(container, list):
for i, v in enumerate(container):
if v is self:
container[i] = new
elif isinstance(container, tuple):
for i, v in enumerate(container):
if v is self:
temp = list(container)
temp[i] = new
replace_tuple(container, tuple(temp))
elif isinstance(container, set):
container.remove(self)
container.add(new)

x = (1,2,3)
replace_tuple(x, (0,1,2,3,4))
print(x)```

fallen slateBOT Dec 11, 2022, 5:30 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

(0, 1, 2, 3, 4)

warm breach Dec 11, 2022, 5:40 PM

#

pliant tusk !e ```py import gc def replace_tuple(self, new): for container in gc.get_re...

though this doesn't find constants that are still the same object (id) right?

#

!e

import gc

def replace_tuple(self, new):
    for container in gc.get_referrers(self):
        if isinstance(container, dict):
            for k, v in container.items():
                if v is self:
                    container[k] = new
        elif isinstance(container, list):
            for i, v in enumerate(container):
                if v is self:
                    container[i] = new
        elif isinstance(container, tuple):
            for i, v in enumerate(container):
                if v is self:
                    temp = list(container)
                    temp[i] = new
                    replace_tuple(container, tuple(temp))
        elif isinstance(container, set):
            container.remove(self)
            container.add(new)

x = (1, 2, 3)
print(id(x))

def fn():
    t = (1, 2, 3)
    print(id(t))
    return t

replace_tuple(x, ("replaced", "tuple"))

print(x)
print(fn())

fallen slateBOT Dec 11, 2022, 5:41 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 139823796873536
002 | ('replaced', 'tuple')
003 | 139823796873536
004 | (1, 2, 3)

pliant tusk Dec 11, 2022, 5:46 PM

#

correct, it only loops through naive containers.

dusk comet Dec 11, 2022, 7:20 PM

#

quick snow You could create a new ~~tuple~~ object, and then rewire all references.

wont work for interned ints, for example
their references are stored in array and to do that you should find and change that array

quick snow Dec 11, 2022, 7:24 PM

#

dusk comet wont work for interned ints, for example their references are stored in array an...

You can look at the referrers of an interned int, and replace them. Where would it matter that I don't find that array? Yes — they would not get garbage-collected, that's true. But if I try to replace all references to 111 to point to 99 instead, that (mostly) works.

pliant tusk Dec 11, 2022, 8:50 PM

#

quick snow You can look at the referrers of an interned int, and replace them. Where would ...

Afaik you cannot get referrers for ints because they are not tracked by gc

quick snow Dec 11, 2022, 8:50 PM

#

pliant tusk Afaik you cannot get referrers for ints because they are not tracked by gc

!e

import gc
for x in gc.get_referrers(111):
    print(type(x), id(x))

fallen slateBOT Dec 11, 2022, 8:50 PM

#

@quick snow :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | <class 'list'> 140222762929152
002 | <class 'tuple'> 140222762985792

pliant tusk Dec 11, 2022, 8:52 PM

#

Huh neat, idk why I thought that didn't work

dusk comet Dec 11, 2022, 9:16 PM

#

>>> import sys, gc
>>> sys.getrefcount(111) - 10**9
15
>>> [*map(type, gc.get_referrers(111))]
[<class 'list'>, <class 'tuple'>, <class 'tuple'>, <class 'list'>, <class 'tuple'>, <class 'dict'>, <class 'dict'>, <class 'dict'>, <class 'list'>]
>>> len([*map(type, gc.get_referrers(111))])
9
``` `gc.get_referrers` is not returning all referrers, only some of them

pliant tusk Dec 11, 2022, 9:16 PM

#

^ah that is why I thought it didn't work

rose schooner Dec 11, 2022, 10:10 PM

#

quick snow You could create a new ~~tuple~~ object, and then rewire all references.

what about set refcnt to 1 (then adjusted for the ctypes call) and call ctypes.pythonapi._PyTuple_Resize?

feral island Dec 11, 2022, 10:20 PM

#

rose schooner what about set refcnt to 1 (then adjusted for the `ctypes` call) and call `ctype...

that returns you a new pointer and invalidates the old one, right?

#

so if your refcount isn't actually 1 all the other references are now pointing to garbage

sand bear Dec 12, 2022, 3:17 AM

#

ducky_devil

warm breach Dec 12, 2022, 6:57 AM

#

rose schooner what about set refcnt to 1 (then adjusted for the `ctypes` call) and call `ctype...

!e seems fine if you hold the only reference to the tuple

from ctypes import *

IncRef = pythonapi["Py_IncRef"]
IncRef.argtypes = (py_object,)

DecRef = pythonapi["Py_DecRef"]
DecRef.argtypes = (py_object,)

Resize = pythonapi["_PyTuple_Resize"]
Resize.argtypes = (POINTER(py_object), c_ssize_t)

SetItem = pythonapi["PyTuple_SetItem"]
SetItem.argtypes = (py_object, c_ssize_t, py_object)

def get_tup():
  t = py_object(("cat", "dog"))
  print(t.value)
  print(id(t.value))
  DecRef(t)

  for item in ["snake", "py", "rs"]:
    size = len(t.value)
    IncRef(item)
    DecRef(t)
    Resize(byref(t), size+1)
    SetItem(t, size, item)
    IncRef(t)

    print(t.value)
    print(id(t.value))
  
  IncRef(t)
  return t.value

print(get_tup())

fallen slateBOT Dec 12, 2022, 6:57 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | ('cat', 'dog')
002 | 140502162303552
003 | ('cat', 'dog', 'snake')
004 | 140502162303552
005 | ('cat', 'dog', 'snake', 'py')
006 | 140502162315904
007 | ('cat', 'dog', 'snake', 'py', 'rs')
008 | 140502162315904
009 | ('cat', 'dog', 'snake', 'py', 'rs')

warm breach Dec 12, 2022, 7:01 AM

#

^ though I'm not sure why the id only changes sometimes

#

iirc the PyTuple_Resize always creates a new tuple with small tuples?

dusk comet Dec 12, 2022, 7:33 AM

#

Overallocating?

warm breach Dec 12, 2022, 8:57 AM

#

feral island but that leaves one

I finally figured it out, the extra reference is from the one sent to sys.getrefcount, so it'll always be 1 more than the actual refcount before call 😔

grave jolt Dec 12, 2022, 10:02 AM

#

warm breach ^ though I'm not sure why the id only changes sometimes

The memory allocator itself often over-allocates (or rather allocates in chunks), so sometimes realloc will just extend the region and keep the same pointer.

#

That's my hypothesis

warm breach Dec 12, 2022, 10:03 AM

#

grave jolt The memory allocator itself often over-allocates (or rather allocates in chunks)...

actually speaking of which, do you know how to interpret the result from sys.getsizeof(<some tuple>)?

#

say we have

x = (1, 2, 3)

and my understanding of the struct is:

class PyTupleObject(Structure):
    ob_refcnt: c_ssize_t  # 8 bytes
    ob_type: POINTER(c_void_p)  # 8 bytes
    ob_size: c_ssize_t  # 8 bytes
    ob_item: c_ssize_t * N  # 8 bytes * N

#

so 8 + 8 + 8 + 8 * 3 = 40 bytes?

#

but sys.getsizeof(x) gives 64

#

I thought it was alignment but isn't 40 already 8 byte aligned?

grave jolt Dec 12, 2022, 10:08 AM

#

Hmmmm

warm breach Dec 12, 2022, 10:08 AM

#

() -> 40
(1,) -> 48
(1, 2) -> 56
(1, 2, 3) -> 64

#

so adding elements adds 8 bytes as expected

#

but it starts at 40 bytes...?

grave jolt Dec 12, 2022, 10:11 AM

#

Ahh I think I get it

#

Every object also contains its size IIRC

#

Hence another 8-byte field

#

Wait, no

warm breach Dec 12, 2022, 10:23 AM

#

I mean, considering 0 size tuple it should just be 3x8=24 from the struct.

#

so there's another 16 bytes from somewhere

#

!e also int, which works pretty much the same way except with a uint32 array, starts at 24 as expected when its empty (0)

from sys import getsizeof

x = 0
print(getsizeof(x))

fallen slateBOT Dec 12, 2022, 10:29 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

prime estuary Dec 12, 2022, 10:40 AM

#

I think it could be the GC data, since they are GC tracked.

warm breach Dec 12, 2022, 11:18 AM

#

prime estuary I think it could be the GC data, since they are GC tracked.

!e that doesn't explain the empty one though, it's specifically not tracked

import gc

t = ()
print(gc.is_tracked(t))

fallen slateBOT Dec 12, 2022, 11:18 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

False

prime estuary Dec 12, 2022, 11:22 AM

#

It'd be allocated with the capability to, since that's like a type flag.

rose schooner Dec 12, 2022, 11:22 AM

#

warm breach say we have ```py x = (1, 2, 3) ``` and my understanding of the struct is: ```py...

so (1, 2, 3).__sizeof__() returns 48

warm breach Dec 12, 2022, 11:22 AM

#

is that documented anywhere? pithink

#

eh

rose schooner Dec 12, 2022, 11:23 AM

#

then https://github.com/python/cpython/blob/3.11/Python/sysmodule.c#L1705

fallen slateBOT Dec 12, 2022, 11:23 AM

#

Python/sysmodule.c line 1705

return (size_t)size + _PyType_PreHeaderSize(Py_TYPE(o));```

rose schooner Dec 12, 2022, 11:23 AM

#

it's added to that

warm breach Dec 12, 2022, 11:23 AM

#

...

#

fml I questioned everything except the getsizeof implementation

#

what even is a PyType_PreHeaderSize

rose schooner Dec 12, 2022, 11:24 AM

#

https://github.com/python/cpython/blob/main/Include/internal/pycore_object.h#L278-L283

fallen slateBOT Dec 12, 2022, 11:24 AM

#

Include/internal/pycore_object.h lines 278 to 283

static inline size_t
_PyType_PreHeaderSize(PyTypeObject *tp)
{
    return _PyType_IS_GC(tp) * sizeof(PyGC_Head) +
        _PyType_HasFeature(tp, Py_TPFLAGS_PREHEADER) * 2 * sizeof(PyObject *);
}```

rose schooner Dec 12, 2022, 11:25 AM

#

tuple matches the requirement for _PyType_IS_GC(tp)

#

so add the size of this (equivalent to 2 pointers of size, so 16 bytes in 64-bit) https://github.com/python/cpython/blob/main/Include/internal/pycore_gc.h#L12-L20

fallen slateBOT Dec 12, 2022, 11:27 AM

#

Include/internal/pycore_gc.h lines 12 to 20

typedef struct {
    // Pointer to next object in the list.
    // 0 means the object is not tracked
    uintptr_t _gc_next;

    // Pointer to previous object in the list.
    // Lowest two bits are used for flags documented later.
    uintptr_t _gc_prev;
} PyGC_Head;```

rose schooner Dec 12, 2022, 11:27 AM

#

and there you have it

#

(1, 2, 3).__sizeof__() + 16 == 64

warm breach Dec 12, 2022, 11:30 AM

#

curious

#

is that struct actually preallocated together in memory as the tuple struct?

rose schooner Dec 12, 2022, 11:35 AM

#

warm breach is that struct actually preallocated together in memory as the tuple struct?

wdym?

warm breach Dec 12, 2022, 11:35 AM

#

PyGC_Head

#

is that allocated together with the object

#

or does it have nothing to do with the original struct and is maintained independent by the GC

rose schooner Dec 12, 2022, 11:36 AM

#

warm breach is that allocated together with the object

‫if you had Py_TRACE_REFS defined at compilation it may be

rose schooner Dec 12, 2022, 11:36 AM

#

warm breach or does it have nothing to do with the original struct and is maintained indepen...

otherwise it's just this

warm breach Dec 12, 2022, 11:42 AM

#

I think I'll just call the dunder __sizeof__ for struct size then

#

should be fairly safe...?

rose schooner Dec 12, 2022, 11:44 AM

#

warm breach should be fairly safe...?

i guess

feral island Dec 12, 2022, 1:43 PM

#

warm breach is that allocated together with the object

pretty sure it's really in front of the tuple itself in memory

feral island Dec 12, 2022, 1:44 PM

#

warm breach I think I'll just call the dunder `__sizeof__` for struct size then

what's your use case? for GC-tracked types you are going to need that GC header

warm breach Dec 12, 2022, 1:51 PM

#

feral island what's your use case? for GC-tracked types you are going to need that GC header

just trying to get a memory copy of the entire object at some address, and wondering how many bytes I have to copy for the entire object

#

very cursed but this essentially copies memory from an object (500) into another object (9015)

import view
x = 9015
v = view(x)

with v.unsafe():
    v <<= 500

print(x)
print(9015)

500
500

#

I'm actually not sure what happens to the GC header if it is copied to another object, and the original object is garbage collected... pithink

#

does the GC try to free the new memory it got copied to as well...?

feral island Dec 12, 2022, 2:00 PM

#

oh yeah, copying the GC header would make this worse

warm breach Dec 12, 2022, 2:00 PM

#

in any case currently I'm memmoveing from the object address + size from __sizeof__, so if the GC header is before the object address struct I guess it never gets moved?

feral island Dec 12, 2022, 2:04 PM

#

yes, you'll probably get trouble if you copy from a GC-tracked type into a non-GC-tracked type

#

since the GC will treat whatever memory is in front of the object as the header

warm breach Dec 12, 2022, 2:06 PM

#

feral island yes, you'll probably get trouble if you copy from a GC-tracked type into a non-G...

yeah that seems to be the combination that will cause a segfault

#

!e

from ctypes import memmove

obj = 5
src = ("dog", "cat")

memmove(id(obj), id(src), src.__sizeof__())
print(obj)

fallen slateBOT Dec 12, 2022, 2:06 PM

#

@warm breach :x: Your 3.11 eval job has completed with return code 139 (SIGSEGV).

('dog', 'cat')

warm breach Dec 12, 2022, 2:07 PM

#

!e whereas the reverse seems fine

from ctypes import memmove

obj = ("dog", "cat")
src = 5

memmove(id(obj), id(src), src.__sizeof__())
print(obj)

fallen slateBOT Dec 12, 2022, 2:07 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

feral island Dec 12, 2022, 2:07 PM

#

that's probably also UB, won't it free() the wrong pointer?

warm breach Dec 12, 2022, 2:09 PM

#

which free, on the object that gets overwritten?

feral island Dec 12, 2022, 2:11 PM

#

yes

#

or maybe it's fine if it's within a Python-maintained free list?

warm breach Dec 12, 2022, 2:13 PM

#

hm, I think? still not fully understanding the segfault for when a non gc tracked type gets overwritten with memory from a tracked type (no header)

#

the original GC and freeing of the source object should be unaffected right?

feral island Dec 12, 2022, 2:14 PM

#

warm breach hm, I think? still not fully understanding the segfault for when a non gc tracke...

the GC header is a doubly-linked list pointing to other GC-tracked objects. when the object is GCed, the interpreter will need to follow the links in the DLL to remove the GCed objects from the DLL

#

it decides whether to do that based on whether the type opts in to GC

#

but in your case, there is a GC-tracked object (according to its type) that doesn't actually have a GC header in front of it, just whatever random memory happens to be there

#

so 💥

warm breach Dec 12, 2022, 2:16 PM

#

ah that make perfect sense thanks

#

I thought lack of header just means the GC won't touch it

feral island Dec 12, 2022, 2:17 PM

#

for the reverse case, when the object is GCed, Python will (probably) ultimately call free() on the pointer. If the object is GC-tracked, it should actually do free(pointer - sizeof(GC header)), but in this case it won't because it thinks there's no GC header

#

what will happen if you free() the wrong pointer? no idea, probably depends on your malloc implementation

fair spade Dec 12, 2022, 2:29 PM

#

Hello

#

I am new here

#

Nd wanting to learn python for data analytics

#

I just want the complete roadmap and perfect problem practicing websites

#

Can anyone help me pls

warm breach Dec 12, 2022, 2:32 PM

#

feral island but in your case, there is a GC-tracked object (according to its type) that does...

!e so copying the header kind of seems to work 🥴

from ctypes import memmove, pythonapi, py_object

Py_IncRef = pythonapi["Py_IncRef"]
Py_IncRef.argtypes = (py_object,)

obj = 500
src = (1, 2)

Py_IncRef(py_object(src))
memmove(id(obj) - 16, id(src) - 16, src.__sizeof__() + 16)

print(500)

fallen slateBOT Dec 12, 2022, 2:32 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

(1, 2)

feral island Dec 12, 2022, 2:33 PM

#

warm breach !e so copying the header kind of seems to work 🥴 ```py from ctypes import memm...

I don't think so, that will mess up the doubly linked list

warm breach Dec 12, 2022, 2:33 PM

#

ah right pithink

feral island Dec 12, 2022, 2:33 PM

#

and write into memory that you don't own

warm breach Dec 12, 2022, 2:33 PM

#

also yeah that 😔

feral island Dec 12, 2022, 2:33 PM

#

possibly change the value of 499?

#

actually probably not, it's outside the range of ints that are allocated in the small ints array

warm breach Dec 12, 2022, 2:34 PM

#

ah are the small ints contiguous in memory?

#

!e

from ctypes import memmove, pythonapi, py_object

Py_IncRef = pythonapi["Py_IncRef"]
Py_IncRef.argtypes = (py_object,)

obj = 10
src = (1, 2)

Py_IncRef(py_object(src))
memmove(id(obj) - 16, id(src) - 16, src.__sizeof__() + 16)

print(8)
print(9)

fallen slateBOT Dec 12, 2022, 2:36 PM

#

@warm breach :x: Your 3.11 eval job has completed with return code 139 (SIGSEGV).

001 | 8
002 | 0
003 | free(): invalid pointer

warm breach Dec 12, 2022, 2:36 PM

#

👀 interesting

feral island Dec 12, 2022, 2:37 PM

#

it's -3 to 252 I think

dusk comet Dec 12, 2022, 3:37 PM

#

to 255 at least

#

to make bytes/bytearray indexing fast

#

and from -5 iirc

#

>>> x = -5
>>> y = -5
>>> x is y
True
>>> x = -6
>>> y = -6
>>> x is y
False

gray galleon Dec 12, 2022, 3:50 PM

#

warm breach !e ```py from ctypes import memmove obj = 5 src = ("dog", "cat") memmove(id(ob...

someone trying to make segfaults in python (real)

gray galleon Dec 12, 2022, 3:51 PM

#

dusk comet ```py >>> x = -5 >>> y = -5 >>> x is y True >>> x = -6 >>> y = -6 >>> x is y Fal...

what

sand bear Dec 12, 2022, 6:05 PM

#

🍪

pliant tusk Dec 12, 2022, 7:21 PM

#

warm breach ah are the small ints contiguous in memory?

remember that some types __sizeof__ return a value that is not representative of contiguous memory

#

!e py print('lists have constant size due to inner pointer:', list.__basicsize__) print('__sizeof__ adds in length of pointed to array', [1,2,3].__sizeof__())

fallen slateBOT Dec 12, 2022, 7:22 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | lists have constant size due to inner pointer: 40
002 | __sizeof__ adds in length of pointed to array 72

warm breach Dec 12, 2022, 7:22 PM

#

👀

pliant tusk Dec 12, 2022, 7:24 PM

#

also, if you want a way to call __sizeof__ on any object, use this ```py
def sizeof(obj):
return type(obj).sizeof(obj)

warm breach Dec 12, 2022, 7:25 PM

#

I never really understood how that worked

#

how sometimes some dunders are not accessible in the instance

#

and only by class call

pliant tusk Dec 12, 2022, 7:25 PM

#

if they are defined on an instance and on the type (like in classes)

#

!e py print(list.__sizeof__())

fallen slateBOT Dec 12, 2022, 7:25 PM

#

@pliant tusk :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 1, in <module>
003 | TypeError: unbound method list.__sizeof__() needs an argument

pliant tusk Dec 12, 2022, 7:26 PM

#

!e py print(type(list).__sizeof__(list))

fallen slateBOT Dec 12, 2022, 7:26 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

warm breach Dec 12, 2022, 7:29 PM

#

but even object instance have it as instance method no?

#

!e

print(object().__sizeof__())

fallen slateBOT Dec 12, 2022, 7:29 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

pliant tusk Dec 12, 2022, 7:29 PM

#

yea, types don't, because of how they work

warm breach Dec 12, 2022, 7:29 PM

#

ah hm

#

haven't tried to make a type struct yet

#

wonder what happens if you move one in memory 👀

pliant tusk Dec 12, 2022, 7:57 PM

#

wonky stuff happens

#

types have a lot of parts that are located at dynamic offsets

swift imp Dec 12, 2022, 9:37 PM

#

The pep692 discussion has me a little annoyed. Like yeah let's just keep annotations more and more verbose by requiring the use of Unpack instead of introducing a new syntax

rose schooner Dec 12, 2022, 10:09 PM

#

dusk comet to 255 at least

isn't it -5 to 256 (inclusive)?

#

so range(-5, 257)

peak spoke Dec 12, 2022, 10:31 PM

#

swift imp The pep692 discussion has me a little annoyed. Like yeah let's just keep annotat...

I've noticed that people who seemingly have never used typing beyond anything basic tend to comment on the typing pep discussions as against them

flat gazelle Dec 12, 2022, 10:37 PM

#

Huh, have the core devs moved on from "no syntax specifically for type hints"

feral island Dec 12, 2022, 10:42 PM

#

PEP 646 had some syntax specifically for type hints

#

and PEP 692 might still get it too

flat gazelle Dec 12, 2022, 10:47 PM

#

Huh, good to know

#

Probably for the best TBH

warm breach Dec 13, 2022, 4:39 AM

#

are there any plans for the return variable type hint to be exposed to a called function?

#

like here func() can somehow know that its caller has annotated its return type as int

x: int = func()

pliant tusk Dec 13, 2022, 4:43 AM

#

No but you can annotate a func with a given return type

#

def func() -> int:return 0

deft pagoda Dec 13, 2022, 7:31 AM

#

warm breach like here `func()` can somehow know that its caller has annotated its return typ...

you can look at outer scope of a function with some shenanigans

grave jolt Dec 13, 2022, 8:31 AM

#

warm breach are there any plans for the return variable type hint to be exposed to a called ...

What would it do with it though?

warm breach Dec 13, 2022, 8:36 AM

#

grave jolt What would it do with it though?

I guess you could have a library where that influences runtime-behavior

#

like

x: str = get_input()
y: int = get_input()  # parses to an int instead
z: list[int] = get_input()  # splits input to a list of ints

#

though I guess that would make it impossible to use without an assignment

#

~~though perhaps when used without an assignment, you receive the type hint of the callable it was called in~~(kind of confusing though, maybe just return None if no type hints available)

#

from typing import get_assignment

def fn():
    print(get_assignment())

abc: list[int] = fn()
>> ('abc', list[int])

xyz = fn()
>> ('xyz', None)

fn()
>> (None, None)

grave jolt Dec 13, 2022, 8:53 AM

#

Well, you can just do abc = fn(list[int])

warm breach Dec 13, 2022, 8:55 AM

#

but not as magical right firThump

#

also I suppose you may be able to get the type hint of a caller as a function as well

from typing import get_assignment

def outer(x, i: int):
    ...

def fn():
    print(get_assignment())

outer(fn())
>> ('x', None)

outer(i=fn())
>> ('i', int)

gray galleon Dec 13, 2022, 12:23 PM

#

warm breach like ```py x: str = get_input() y: int = get_input() # parses to an int instead...

x = get_input(type=str)
y = get_input(type=int)
z = get_input(type=list[int])

dusk comet Dec 13, 2022, 12:41 PM

#

Is it possible to get local annotations without re-parsing source code of function?

rose schooner Dec 13, 2022, 12:44 PM

#

warm breach also I suppose you may be able to get the type hint of a caller as a function as...

that's gonna be really hard to do

#

unless there's a thing in inspect that can do it well

warm breach Dec 13, 2022, 1:08 PM

#

rose schooner that's gonna be really hard to do

yeah it's pretty cursed 😔

#

also what to even annotate the return type as

deft pagoda Dec 13, 2022, 1:15 PM

#

warm breach yeah it's pretty cursed 😔

import inspect
def f():
    parent_frame = inspect.currentframe().f_back
    line = inspect.getframeinfo(parent_frame).code_context[0]
    print(line)

x: int = f()  # x: int = f()

don't think it works in repl

#

but i suppose you can parse this line for annotations using ast and etc

gray galleon Dec 13, 2022, 3:53 PM

#

when will python break gil
or it never will?

pseudo cradle Dec 13, 2022, 9:21 PM

#

Do you think having {} be an empty set gain any traction? IF {3,5,2} is a set, then {} should be an empty set

frigid bison Dec 13, 2022, 9:23 PM

#

A dictionary is a way more common use case of {} so it makes sense that it's an empty doct

quick snow Dec 13, 2022, 9:23 PM

#

And if {1: 2} is a dict, then {} should be an empty dict. Dicts are much more commonly used than sets, and in any case, even if this was better, it's a gigantic breaking change that is never gonna happen.

pseudo cradle Dec 13, 2022, 9:23 PM

#

Fair enough

#

Hmm

#

What if it was an object that counted as both an empty set and an empty dict, then when added to or an operation was applied, converts into one of those?

feral island Dec 13, 2022, 9:26 PM

#

pseudo cradle What if it was an object that counted as both an empty set and an empty dict, th...

that would be difficult to implement and difficult to understand for users

pseudo cradle Dec 13, 2022, 9:27 PM

#

It would be great fun to implement though! But if it would be difficult to understand or if there was overlapping functions I can see why it would cause issues.

frigid bison Dec 13, 2022, 9:34 PM

#

In #esoteric-python they would definitely like it

pseudo cradle Dec 13, 2022, 9:35 PM

#

Maybe I'll make an implementation and play around with it. Who knows, may be coming to a module near you 😛

feral cedar Dec 13, 2022, 9:51 PM

#

maybe in a statically typed language with type inference that would be fun

flat gazelle Dec 13, 2022, 9:52 PM

#

haskell does do polymorphic literals, and I don't hate it, but it does lead to some odd happenings. I do think the approach python takes with the numeric tower makes the most sense and leads to the fewest edge cases.

quick snow Dec 13, 2022, 9:52 PM

#

I tried it briefly, it gets a bit annoying because it's not easy to do self.__class__ = dict

native flame Dec 14, 2022, 3:41 AM

#

obviously its too late to change anything now but i think {} and {:} for empty sets and dicts wouldnt have been terrible

dusk comet Dec 14, 2022, 6:04 AM

#

{}[::]
Full slice of empty set is empty dict

quick snow Dec 14, 2022, 7:30 AM

#

{*()} ← empty set literal

gray galleon Dec 14, 2022, 8:27 AM

#

is there any performance benefits of using tuples even when the size is not known ahead of time
afaik the main incentive for creating tuples is that their size are known at compile time

flat gazelle Dec 14, 2022, 8:29 AM

#

gray galleon is there any performance benefits of using tuples even when the size is not know...

I wouldn't worry the performance of tuples vs alternate data structures, it is very unlikely it would matter. The main reason to use tuples is to get hashes by their contents.

gray galleon Dec 14, 2022, 8:30 AM

#

flat gazelle I wouldn't worry the performance of tuples vs alternate data structures, it is v...

their main use is for multiple return values?

#

what library use tuples to index a dict or set?

flat gazelle Dec 14, 2022, 8:32 AM

#

ah fair, I meant more main use for tuples where you couldn't use a list for the same thing

grave jolt Dec 14, 2022, 10:16 AM

#

gray galleon their main use is for multiple return values?

Well, for that you could just use a list

#

The only real difference from a list is the hashability

grave jolt Dec 14, 2022, 10:25 AM

#

gray galleon what library use tuples to index a dict or set?

functools.lru_cache does something similar, although not quite

#

The Python impl does use the hash of a tuple though

#

Also, a set of tuples is a common pattern to implement a game board or other set of points. Like a Game of Life

gray galleon Dec 14, 2022, 10:33 AM

#

whats the difference between LOAD_ATTR and LOAD_METHOD
they seem to do the same thing

dusk comet Dec 14, 2022, 10:58 AM

#

flat gazelle I wouldn't worry the performance of tuples vs alternate data structures, it is v...

Tuple hash is not cached, it is calculated on every hash(tup) call
Frozenset's hashes, for example, are cached

dusk comet Dec 14, 2022, 11:03 AM

#

gray galleon whats the difference between `LOAD_ATTR` and `LOAD_METHOD` they seem to do the s...

It works exactly as LOAD_ATTR, but it can do better thing if your attr is a python function
In that case it can not create new bound_method object, but put object and function itself on stack.
PRECALL(or CALL, idk) can look on two top items on stack: if they are not null - they are object and function (and you can call function directly with known first argument and all other passed arguments), otherwise - it is result of accessing attribute (and it can be called as usual).

#

When you are doing a.b(), it likely to be a call to a function (builtin or python), so it is possible to not create bound_method object and call function directly.
a.b(c) in this case works like A.b(a, c) and not like this: bound_method(A.b, a)(c)

quick snow Dec 14, 2022, 11:26 AM

#

Huh, TIL

rose schooner Dec 14, 2022, 11:40 AM

#

quick snow `{*()}` ← empty set literal

just use set() unless you're code golfing and wanna save some characters between set and some identifier before it
it's faster

quick snow Dec 14, 2022, 11:46 AM

#

rose schooner just use `set()` unless you're code golfing and wanna save some characters betwe...

(I know, I was just trolling. {*()} isn't even usually shorter than set().)

rose schooner Dec 14, 2022, 11:49 AM

#

quick snow (I know, I was just trolling. `{*()}` isn't even usually shorter than `set()`.)

what if it was

rose schooner Dec 14, 2022, 11:50 AM

#

dusk comet It works exactly as LOAD_ATTR, but it can do better thing if your attr is a pyth...

‫PRECALL does that in 3.11 but in 3.12 PRECALL is gone and CALL handles that too

warm breach Dec 14, 2022, 4:09 PM

#

gray galleon is there any performance benefits of using tuples even when the size is not know...

uh, I don't think tuple sizes are known at compile time

#

unless you mean tuple literals with other intrinsic literals which get inlined

rose schooner Dec 14, 2022, 10:15 PM

#

gray galleon is there any performance benefits of using tuples even when the size is not know...

well tuple sizes are not really "known" at compile time

#

unless you mean the tuples created by the BUILD_TUPLE or LOAD_CONST opcodes

gray galleon Dec 14, 2022, 11:53 PM

#

rose schooner unless you mean the tuples created by the `BUILD_TUPLE` or `LOAD_CONST` opcodes

yes i mean BUILD_TUPLE and interned tuple

true glacier Dec 15, 2022, 3:04 AM

#

i've inherited code and they were still using setup.py and distutils, but after researching a bit of the state of python packaging im unsure how to take the code to be more up to date

#

am i suppose to still use setup.py but have a pyproject.toml in addition to it?

#

or am i suppose to use setup.cfg or some combination of all 3?

elder blade Dec 15, 2022, 9:32 AM

#

pseudo cradle Do you think having `{}` be an empty set gain any traction? IF `{3,5,2}` is a s...

Would be cool if you could at least do {,} - the same as (,) for tuples

rose schooner Dec 15, 2022, 9:41 AM

#

elder blade Would be cool if you could at least do `{,}` - the same as `(,)` for tuples

tuples can do () already

rose schooner Dec 15, 2022, 9:53 AM

#

pseudo cradle Do you think having `{}` be an empty set gain any traction? IF `{3,5,2}` is a s...

huge backwards incompatibility

grave jolt Dec 15, 2022, 10:16 AM

#

rose schooner just use `set()` unless you're code golfing and wanna save some characters betwe...

how is it faster though? pithink

#

set involves a lookup of a global

#

oh wow, it is faster

#

that's... interesting

prime estuary Dec 15, 2022, 11:13 AM

#

{*()} isn’t explicitly optimised, so it ends up building an empty set, loading the empty tuple, updating the set with the contents (building an iterator probably), then finally giving you the result.

rose schooner Dec 15, 2022, 11:19 AM

#

grave jolt `set` involves a lookup of a global

it uses the new 3.11 opcode specializations and a function designed exactly for looking up globals or builtins

grave jolt Dec 15, 2022, 11:43 AM

#

rose schooner it uses the new 3.11 opcode specializations and a function designed exactly for ...

It's actually faster on 3.9 as well

rose schooner Dec 15, 2022, 11:48 AM

#

grave jolt It's actually faster on 3.9 as well

well that's a little surprising

quick snow Dec 15, 2022, 1:36 PM

#

rose schooner well that's a little surprising

Is it? {*()} isn't evaluated during compilation so it creates a set, loads the tuple, and updates the set with the tuple.
set() is a global lookup, but those got faster before 3.11 (in 3.10, though, I think, and not 3.9?).

rose schooner Dec 15, 2022, 1:37 PM

#

quick snow Is it? `{*()}` isn't evaluated during compilation so it creates a set, loads the...

having it faster in 3.9 is the surprising part

warm breach Dec 15, 2022, 1:54 PM

#

rose schooner having it faster in 3.9 is the surprising part

set() needs to do

LOAD_NAME                0 (set)
CALL_FUNCTION            0

while {*()} is doing

BUILD_SET                0
LOAD_CONST               0 (())
SET_UPDATE               1

#

I guess the extra LOAD_CONST and SET_UPDATE is enough to offset LOAD_NAME being slow?

rose schooner Dec 15, 2022, 1:54 PM

#

warm breach I guess the extra `LOAD_CONST` and `SET_UPDATE` is enough to offset `LOAD_NAME` ...

probably

rose schooner Dec 15, 2022, 1:55 PM

#

warm breach `set()` needs to do ```py LOAD_NAME 0 (set) CALL_FUNCTION ...

BUILD_SET 0 can do the work all by itself

warm breach Dec 15, 2022, 1:55 PM

#

rose schooner `BUILD_SET 0` can do the work all by itself

yeah might as well inline it

#

probably will be faster 🥴

#

!e

import dis
import timeit
from contextlib import suppress

def inline(code: str):
    lines = [ln for ln in code.strip('\n').splitlines() if ln]
    code = ""
    var_locals, names, consts = {}, {}, {None: 0}
    for line in lines:
        s = line.split(maxsplit=2)
        n_code = dis.opmap[s[0]]
        mem = int(s[1]) if len(s) > 1 else 0
        cache = "0000" * dis._inline_cache_entries[n_code]
        code += f"{n_code:02x}{mem:02x}{cache}"
        if len(s) < 3:
            continue
        data_field = map(str.strip, s[2][1:-1].split(','))
        if n_code in dis.haslocal:
            for value in data_field:
                var_locals[value] = 0
        elif n_code in dis.hasconst:
            for value in data_field:
                with suppress(NameError):
                    value = eval(value)
                consts[value] = 0
        elif n_code in dis.hasname:
            for value in data_field:
                if value != "NULL":
                    names[value] = 0

    return (lambda: 0).__code__.replace(
        co_code=bytes.fromhex(code),
        co_consts=tuple(consts.keys()),
        co_names=tuple(names.keys()),
        co_varnames=tuple(var_locals.keys()),
        co_nlocals=len(var_locals),
    )

fn2 = inline("""
RESUME                   0

LOAD_GLOBAL              1 (NULL, range)
LOAD_CONST               1 (300000)
PRECALL                  1
CALL                     1
GET_ITER
FOR_ITER                 4
STORE_FAST               0 (_)

BUILD_SET                0
STORE_FAST               1 (x)
JUMP_BACKWARD            5

LOAD_CONST               0 (None)
RETURN_VALUE
""")

def fn():
    for _ in range(300_000):
        x = set()

print("set():")
timeit.main(['-s', "from __main__ import fn", "fn()"])
print("inlined BUILD_SET:")
timeit.main(['-s', "from __main__ import fn2", "eval(fn2)"])

fallen slateBOT Dec 15, 2022, 1:56 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | set():
002 | 10 loops, best of 5: 32.1 msec per loop
003 | inlined BUILD_SET:
004 | 10 loops, best of 5: 27.4 msec per loop

elder blade Dec 15, 2022, 10:07 PM

#

Can anyone remind me how Python grow's containers? What algorithm is used to determine the new size of it?

rose schooner Dec 15, 2022, 10:08 PM

#

elder blade Can anyone remind me how Python grow's containers? What algorithm is used to det...

what type of container?

#

there's list, set, and dict and they each have different growing patterns

elder blade Dec 15, 2022, 10:10 PM

#

Oh they do? How does a set grow?

rose schooner Dec 15, 2022, 10:22 PM

#

elder blade Oh they do? How does a set grow?

you add to it

feral cedar Dec 15, 2022, 10:24 PM

#

elder blade Can anyone remind me how Python grow's containers? What algorithm is used to det...

it just multiplies the current capacity by a constant once a certain amount of elements are stored

rose schooner Dec 15, 2022, 10:25 PM

#

feral cedar it just multiplies the current capacity by a constant once a certain amount of e...

along with a few other operations

elder blade Dec 15, 2022, 10:25 PM

#

feral cedar it just multiplies the current capacity by a constant once a certain amount of e...

Is that constant different for lists, sets, and dictionaries?

grave jolt Dec 15, 2022, 10:25 PM

#

IIRC dicts and sets can only double?

feral cedar Dec 15, 2022, 10:25 PM

#

yeah. I think for lists it's 9/8 and for dicts it's 3

grave jolt Dec 15, 2022, 10:25 PM

#

oh

#

I'm wrong then

rose schooner Dec 15, 2022, 10:26 PM

#

elder blade Is that constant different for lists, sets, and dictionaries?

actually lists don't even multiply https://github.com/python/cpython/blob/main/Objects/listobject.c#L70

fallen slateBOT Dec 15, 2022, 10:26 PM

#

Objects/listobject.c line 70

new_allocated = ((size_t)newsize + (newsize >> 3) + 6) & ~(size_t)3;```

grave jolt Dec 15, 2022, 10:26 PM

#

Well... this is actually newsize + newsize/8 plus some change

#

kinda funky

feral island Dec 15, 2022, 10:27 PM

#

for sets it's either 4x or 2x

#

depending on whether size > 50k

rose schooner Dec 15, 2022, 10:27 PM

#

idk what this is https://github.com/python/cpython/blob/main/Objects/setobject.c#L242-L247

fallen slateBOT Dec 15, 2022, 10:27 PM

#

Objects/setobject.c lines 242 to 247

/* Find the smallest table size > minused. */
/* XXX speed-up with intrinsics */
size_t newsize = PySet_MINSIZE;
while (newsize <= (size_t)minused) {
    newsize <<= 1; // The largest possible value is PY_SSIZE_T_MAX + 1.
}```

grave jolt Dec 15, 2022, 10:27 PM

#

how can a hash table grow not in powers of 2?

grave jolt Dec 15, 2022, 10:27 PM

#

fallen slate `Objects/setobject.c` lines 242 to 247 ```c /* Find the smallest table size > mi...

left bitshift by 1, so multiplication by 2

#

actually idk why they're bitshifting...

#

kinda confusing tbh

feral island Dec 15, 2022, 10:28 PM

#

fallen slate `Objects/setobject.c` lines 242 to 247 ```c /* Find the smallest table size > mi...

seems like it rounds up to the nearest power of 2

rose schooner Dec 15, 2022, 10:29 PM

#

fallen slate `Objects/setobject.c` lines 242 to 247 ```c /* Find the smallest table size > mi...

if __builtin_clzll() was supported on all compilers it can support that's probably what they'd use

#

then there's https://github.com/python/cpython/blob/main/Objects/dictobject.c#L592-L629 for dicts
which i assume also grows according to powers of 2

GitHub

cpython/dictobject.c at main · python/cpython

The Python programming language. Contribute to python/cpython development by creating an account on GitHub.

pseudo cradle Dec 15, 2022, 11:14 PM

#

Yeah it has to grow by a power of something

pliant tusk Dec 16, 2022, 1:30 AM

#

!e ```py
set(map((l:=iter([0])).setstate, l))

fallen slateBOT Dec 16, 2022, 1:31 AM

#

@pliant tusk :warning: Your 3.11 eval job timed out or ran out of memory.

[No output]

warm breach Dec 16, 2022, 11:30 AM

#

@pliant tusk do you know if there's a way to refer to the own class's pointer in an instance method? Currently I'm doing:


class PyTupleObject(ctypes.Structure):
    _fields_ = ...


PyTupleObject.GetItem = pythonapi["PyTuple_GetItem"]
PyTupleObject.GetItem.argtypes = (ctypes.POINTER(PyTupleObject), Py_ssize_t)
PyTupleObject.GetItem.restype = ctypes.py_object

#

I would like to do something like this but I guess I wouldn't have a way to refer to the class itself in the class definition?

class PyTupleObject(ctypes.Structure):
    _fields_ = ...

    GetItem = pythonapi["PyTuple_GetItem"]
    GetItem.argtypes = '?'

flat gazelle Dec 16, 2022, 11:53 AM

#

the class body runs before the class is actually created, so you may be outta luck, outside of some lazy evaluatation kinda stuff.

pliant tusk Dec 16, 2022, 12:44 PM

#

warm breach I would like to do something like this but I guess I wouldn't have a way to refe...

!e Yea you cannot refer to a class inside its own definition. You could create a class property ```py
from ctypes import *
class bind(property):
def init(self, func, restype=c_int, argtypes=[]):
self.func = func
self.restype = restype
self.argtypes = argtypes
def set_name(self, owner, name):
self.name = name
def get(self, owner_self, owner_cls):
self.func.restype = self.restype if self.restype is not ... else POINTER(owner_cls)
self.func.argtypes = [c if c is not ... else POINTER(owner_cls) for c in self.argtypes]
setattr(owner_cls, self.name, self.func)
return self.func

class PyTupleObject(Structure):
fields = [
('ob_refcount', c_ssize_t),
('ob_type', py_object),
('ob_size', c_ssize_t),
('_ob_items', py_object*0)
]

@property
def ob_items(self):
      return (py_object * self.ob_size).from_address(addressof(self._ob_items))

GetItem = bind(pythonapi['PyTuple_GetItem'], restype=py_object, argtypes=[..., c_ssize_t])

t = ('a', 'b', 'c')
t_s = PyTupleObject.from_address(id(t))
print(PyTupleObject.GetItem(t_s, 0))

fallen slateBOT Dec 16, 2022, 12:44 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

warm breach Dec 16, 2022, 12:45 PM

#

👀

#

wait is there a way to bind that as an instance method?

pliant tusk Dec 16, 2022, 12:46 PM

#

yea you could just use a standard property

#

or skip the overwriting part and do some more stuff in __get__

warm breach Dec 16, 2022, 12:49 PM

#

hm...

#

is there a difference between this and casting the address of the struct into a py_object and calling the pythonapi with that

pliant tusk Dec 16, 2022, 12:50 PM

#

the only difference is how the call takes place from python

warm breach Dec 16, 2022, 12:50 PM

#

I think the py_object cast creates a reference?

pliant tusk Dec 16, 2022, 12:51 PM

#

py_object(obj) creates a reference, py_object.from_address does not

warm breach Dec 16, 2022, 12:52 PM

#

wait how does py_object.from_address work again

#

I remember you said it wasn't the id or addressof(struct) right?

pliant tusk Dec 16, 2022, 12:53 PM

#

warm breach wait how does `py_object.from_address` work again

it needs the address of a pointer to a given object

#

or you can do cast(id(obj), py_object) which also works and does not make a new reference

#

(when you use it with .value it will add a reference)

warm breach Dec 16, 2022, 1:24 PM

#

pliant tusk or skip the overwriting part and do some more stuff in `__get__`

!e I guess this isn't too terrible? pithink

from functools import partial
from ctypes import *

class bind(property):
    def __init__(self, func, restype=c_int, argtypes=[]):
        self.func = func
        self.restype = restype
        self.argtypes = argtypes
        self.func_set = False

    def __set_name__(self, owner, name):
        self.name = name

    def __get__(self, owner_self, owner_cls):
        if not self.func_set:
            self.func.restype = self.restype if self.restype is not ... else POINTER(owner_cls)
            self.func.argtypes = [c if c is not ... else POINTER(owner_cls) for c in self.argtypes]
            self.func_set = True
        if owner_self is None:
            return self.func
        return partial(self.func, owner_self)

class PyTupleObject(Structure):
    _fields_ = [
        ('ob_refcount', c_ssize_t),
        ('ob_type', py_object),
        ('ob_size', c_ssize_t),
        ('_ob_items', py_object * 0)
    ]

    @property
    def ob_items(self):
        return (py_object * self.ob_size).from_address(addressof(self._ob_items))

    GetItem = bind(pythonapi['PyTuple_GetItem'], restype=py_object, argtypes=[..., c_ssize_t])

t = ('a', 'b', 'c')
ts = PyTupleObject.from_address(id(t))
print(PyTupleObject.GetItem(ts, 0))
print(ts.GetItem(0))

t2 = (1, 2)
ts2 = PyTupleObject.from_address(id(t2))
print(PyTupleObject.GetItem(ts2, 1))
print(ts2.GetItem(1))

fallen slateBOT Dec 16, 2022, 1:24 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

warm breach Dec 16, 2022, 1:25 PM

#

I guess caching is not possible if accepting use both as class and instance?

#

not sure how I feel about it returning a functools.partial type though

pliant tusk Dec 16, 2022, 1:27 PM

#

It's not that bad

warm breach Dec 16, 2022, 1:28 PM

#

Is that what the descriptor is doing for normal objects anyways?

#

(except in C?)

feral island Dec 16, 2022, 1:31 PM

#

warm breach Is that what the descriptor is doing for normal objects anyways?

kind of? [] on a tuple will look in the tuple type's slots table

warm breach Dec 16, 2022, 1:33 PM

#

feral island kind of? `[]` on a tuple will look in the tuple type's slots table

but don't classes like that also support __getitem__ on class?

#

or does the bytecode already make that __class_getitem__

swift imp Dec 16, 2022, 1:33 PM

#

warm breach but don't classes like that also support `__getitem__` on class?

Abuse __class_getitem__

#

Beat me to it

feral island Dec 16, 2022, 1:38 PM

#

warm breach but don't classes like that also support `__getitem__` on class?

I think the bytecode calls PyObject_GetItem and that tries __class_getitem__ if there's no slot for getitem

#

yes https://github.com/python/cpython/blob/main/Objects/abstract.c#L147

fallen slateBOT Dec 16, 2022, 1:39 PM

#

Objects/abstract.c line 147

PyObject *```

warm breach Dec 16, 2022, 1:40 PM

#

feral island I think the bytecode calls `PyObject_GetItem` and that tries `__class_getitem__`...

ah interesting, never knew that method existed

#

since object class doesn't have __getitem__

feral island Dec 16, 2022, 1:54 PM

#

another fun fact here is that there are two slots for getitem, one for mappings (mp_subscript) and one for sequences (sq_item). I wonder when the sequence one is actually used though, because anything that accepts slices must use the mp_subscript slot. Seems like at least direct calls to PySequence_GetItem from C code will use sq_item.

warm breach Dec 16, 2022, 2:02 PM

#

https://github.com/python/cpython/blob/3.11/Lib/functools.py#L997-L1007

fallen slateBOT Dec 16, 2022, 2:02 PM

#

Lib/functools.py lines 997 to 1007

with self.lock:
    # check if another thread filled cache while we awaited lock
    val = cache.get(self.attrname, _NOT_FOUND)
    if val is _NOT_FOUND:
        val = self.func(instance)
        try:
            cache[self.attrname] = val
        except TypeError:
            msg = (
                f"The '__dict__' attribute on {type(instance).__name__!r} instance "
                f"does not support item assignment for caching {self.attrname!r} property."```

warm breach Dec 16, 2022, 2:02 PM

#

is this the standard way for caching instance attributes?

#

also is the thread lock there really important given the GIL?

feral island Dec 16, 2022, 2:03 PM

#

warm breach is this the standard way for caching instance attributes?

yes, it's the stdlib way

#

the lock was a mistake, let me find the issue

#

https://github.com/python/cpython/issues/87634

GitHub

functools.cached_property incorrectly locks the entire descriptor o...

BPO 43468 Nosy @tim-one, @rhettinger, @ncoghlan, @pitrou, @carljm, @jab, @serhiy-storchaka, @ztane, @graingert, @youtux PRs #27609 Note: these values reflect the state of the issue at the time it w...

warm breach Dec 16, 2022, 2:04 PM

#

ah... interesting

#

what would happen if the lock was just completely removed?

feral island Dec 16, 2022, 2:06 PM

#

user code that relies on the locking might break

#

does such code exist? we're not sure, but backward compatibility is important

warm breach Dec 16, 2022, 2:07 PM

#

isn't it still a race condition on which thread acquires that lock before setting the attribute?

#

whereas before it was a race condition which thread acquires the GIL for setting the dict

#

is there a difference in those 2 things?

feral island Dec 16, 2022, 2:08 PM

#

the body of the property could be doing something that requires only a single thread to be able to access it

#

or maybe if computing the property is really expensive, you never want two threads to do it at once

warm breach Dec 16, 2022, 2:09 PM

#

ah I see, like during the computation before the cache, hm

warm breach Dec 16, 2022, 7:02 PM

#

pliant tusk It's not that bad

realized there was no possible way to type hint the duality of the class / instance descriptor. So now I have this, bind decorator types the pythonapi funcptr using the type hints

@struct
class PyTupleObject(PyVarObject[_Tuple]):
    _ob_item_0: Py_ssize_t * 0

    @bind_api(pythonapi["PyTuple_GetItem"])
    def GetItem(self, index: int) -> object:
        """Return the item at the given index."""

    @bind_api(pythonapi["PyTuple_SetItem"])
    def SetItem(self, index: int, value: object) -> None:
        """Set a value to a given index."""

#

Even mypy is happy now 👍

from einspect.structs import PyTupleObject

instance = PyTupleObject.from_object(("abc", "def"))
reveal_type(instance.GetItem)

ret = instance.SetItem(1, 650)
reveal_type(ret)

note: Revealed type is "def (index: builtins.int) -> builtins.object"
note: Revealed type is "None"
Success: no issues found in 1 source file

#

type-safe unsafe python catok

pliant tusk Dec 16, 2022, 7:04 PM

#

Nice

warm breach Dec 16, 2022, 7:16 PM

#

can anything bad happen if you temporarily decrease an object's refcount to 1 👀

from einspect.structs import PyTupleObject

x = ("a", "b")
print(x)
ls = [x, x, x]

tup = PyTupleObject.from_object(x)
tup.ob_refcnt -= 5
tup.SetItem(0, 900)
tup.SetItem(1, "hi")
tup.ob_refcnt += 5

print(tup.into_object().value)
print(ls)

('a', 'b')
(900, 'hi')
[(900, 'hi'), (900, 'hi'), (900, 'hi')]

#

it shouldn't trigger gc.. right?

feral island Dec 16, 2022, 7:19 PM

#

warm breach can anything bad happen if you temporarily decrease an object's refcount to 1 👀...

if something else holds a ref to it and decrefs it, you'll lose the object

warm breach Dec 16, 2022, 7:27 PM

#

feral island if something else holds a ref to it and decrefs it, you'll lose the object

yeah that's pretty bad, would calling PyTuple_SET_ITEM directly be slightly safer 🥴

#

actually is there even a difference in calling that and just directly modifying the pointer array at ob_item

#

seems like it just does that anyways?https://github.com/python/cpython/blob/3.11/Include/cpython/tupleobject.h#L33-L37

fallen slateBOT Dec 16, 2022, 7:28 PM

#

Include/cpython/tupleobject.h lines 33 to 37

static inline void
PyTuple_SET_ITEM(PyObject *op, Py_ssize_t index, PyObject *value) {
    PyTupleObject *tuple = _PyTuple_CAST(op);
    tuple->ob_item[index] = value;
}```

feral island Dec 16, 2022, 7:31 PM

#

warm breach yeah that's pretty bad, would calling `PyTuple_SET_ITEM` directly be slightly sa...

I don't think you can call static inline directly

pliant tusk Dec 16, 2022, 7:43 PM

#

Yea you cannot

#

(Mind you, it's very easy to reimplement with the ctypes stucture that @warm breach already has defined)

warm breach Dec 16, 2022, 7:47 PM

#

pliant tusk Yea you cannot

this library seems to re-export it
https://github.com/brandtbucher/pycapi/blob/master/pycapi.c#L7543-L7553

fallen slateBOT Dec 16, 2022, 7:47 PM

#

pycapi.c lines 7543 to 7553

static PyObject *
capi_PyTuple_SET_ITEM(PyObject *Py_UNUSED(self), PyObject *args)
{
    PyObject *arg0;
    Py_ssize_t arg1;
    PyObject *arg2;
    if (!PyArg_ParseTuple(args, "OnO:PyTuple_SET_ITEM", &arg0, &arg1, &arg2)) {
        return NULL;
    }
    PyTuple_SET_ITEM(arg0, arg1, arg2);
    if (PyErr_Occurred()) {```

warm breach Dec 16, 2022, 7:47 PM

#

and does seem to sort of work

from pycapi import PyTuple_SET_ITEM

t = (1, 2, 3)
ls = [t, t]

PyTuple_SET_ITEM(t, 1, "what")

print(ls)

[(1, 'what', 3), (1, 'what', 3)]

#

I don't think it's really any safer than anything else though

pliant tusk Dec 16, 2022, 7:48 PM

#

Yea you can export it with additional C code, static inline means that there isn't a function pointer associated with the function after compilation, the assembly is just interposed into the functions where it is used

warm breach Dec 16, 2022, 7:59 PM

#

also what exactly is an "immortal interned" string 👀

#

I haven't really come across any that have that state

#

so it's deprecated? hm
https://github.com/python/cpython/blob/3.11/Objects/unicodeobject.c#L15589

fallen slateBOT Dec 16, 2022, 8:03 PM

#

Objects/unicodeobject.c line 15589

"PyUnicode_InternImmortal() is deprecated; "```

meager relic Dec 16, 2022, 8:35 PM

#

import library vs from library import * - why would someone use from .. * and is there a reason to / use case where it's best practice?

warm breach Dec 16, 2022, 8:50 PM

#

meager relic `import library` vs `from library import *` - why would someone use `from .. *` ...

there isn't really, but it's up to your judgement

#

star imports make it confusing which attribute is overriden and where names come from

#

python static inference is already not great so it's as confusing as it gets without any help from the ide either

quick snow Dec 16, 2022, 10:22 PM

#

meager relic `import library` vs `from library import *` - why would someone use `from .. *` ...

The only legitimate reason I can think of is an __init__.py that contains only imports; something like

from .models import *
from .tasks import *
from .utils import do_thing

tacit hawk Dec 16, 2022, 10:43 PM

#

Is __del__ granted to be called except at the interpreter exit?

quick snow Dec 16, 2022, 10:45 PM

#

tacit hawk Is `__del__` granted to be called except at the interpreter exit?

When the reference count reaches zero, yes.

spark magnet Dec 16, 2022, 11:43 PM

#

tacit hawk Is `__del__` granted to be called except at the interpreter exit?

you shouldn't rely on __del__. it might not be called, and running code there can cause problems.

pliant tusk Dec 17, 2022, 12:04 AM

#

I thought __del__ would always run, it just wasn't deterministic when

raven ridge Dec 17, 2022, 12:32 AM

#

!d object.__del__

fallen slateBOT Dec 17, 2022, 12:32 AM

#

object.\_\_del\_\_


object.__del__(self)```
Called when the instance is about to be destroyed. This is also called a finalizer or (improperly) a destructor. If a base class has a [`__del__()`](https://docs.python.org/3/reference/datamodel.html#object.__del__ "object.__del__") method, the derived class’s [`__del__()`](https://docs.python.org/3/reference/datamodel.html#object.__del__ "object.__del__") method, if any, must explicitly call it to ensure proper deletion of the base class part of the instance.

It is possible (though not recommended!) for the [`__del__()`](https://docs.python.org/3/reference/datamodel.html#object.__del__ "object.__del__") method to postpone destruction of the instance by creating a new reference to it. This is called object *resurrection*. It is implementation-dependent whether [`__del__()`](https://docs.python.org/3/reference/datamodel.html#object.__del__ "object.__del__") is called a second time when a resurrected object is about to be destroyed; the current [CPython](https://docs.python.org/3/glossary.html#term-CPython) implementation only calls it once.

raven ridge Dec 17, 2022, 12:34 AM

#

Mm. Just after that, it says

It is not guaranteed that __del__() methods are called for objects that still exist when the interpreter exits.

#

It's not deterministic when it runs or which thread it runs in, it's not guaranteed to be called when the interpreter is shutting down, and if it is called when the interpreter is shutting down it might not be able to find global variables or modules that it needs.

#

Also, it won't get run if someone pauses the cycle collecting GC with gc.disable()

grave jolt Dec 17, 2022, 12:45 AM

#

raven ridge Also, it won't get run if someone pauses the cycle collecting GC with `gc.disabl...

only for cycles though

#

radiant garden Dec 17, 2022, 12:59 AM

#

yeah, refcount destructors don't need any gc

warm breach Dec 17, 2022, 2:36 AM

#

tacit hawk Is `__del__` granted to be called except at the interpreter exit?

I think you can pretty much use weakref.finalize to replace it for running tasks on GC

#

which is what stdlib implementations like tempfile uses for cleanup, haven't really seen __del__ used in stdlib

quasi oriole Dec 17, 2022, 6:10 AM

#

hey how can i solve this error

#

https://textdoc.co/uSrgPF5ypc8HAf1b

Textdoc - Create, Edit, Share and Save Text Files

A secure web app that allows you to create, edit, share and save text files to your device or to Google Drive as an editable Doc

#

its run fine in vs code but in jupyter...

rugged harbor Dec 17, 2022, 6:12 AM

#

hey, please post in #1035199133436354600 @quasi oriole

quasi oriole Dec 17, 2022, 6:12 AM

#

ok

gray galleon Dec 17, 2022, 9:15 AM

#

does python have callcc?

dapper lily Dec 17, 2022, 9:18 AM

#

IIUC yield or async/await is essentially a form of call/cc

gray galleon Dec 17, 2022, 9:23 AM

#

🤔

dusk comet Dec 17, 2022, 11:27 AM

#

raven ridge It's not deterministic when it runs or which thread it runs in, it's not guarant...

I never understood how shutting down works. Why some globals can be missing? Why it is deleting globals in weird order (underscored first)?
Why not just call __del__ on every object (even if refcount is not zero) and then just free all memory?

dusk comet Dec 17, 2022, 11:29 AM

#

dapper lily IIUC yield or async/await is essentially a form of call/cc

What is call/cc?
EDIT: https://en.wikipedia.org/wiki/Call-with-current-continuation

warm breach Dec 17, 2022, 3:39 PM

#

dusk comet I never understood how shutting down works. Why some globals can be missing? Why...

it does normally, but since you can keep an object alive by keeping a reference in __del__ you can postpone the GC of that reference

#

but at interpreter shutdown it needs to go at some order to shut things down, you can't have an object alive forever

#

ideally your weakref.finalize or __del__ should have run well before interpreter shutdown

iron glade Dec 17, 2022, 6:11 PM

#

Hi everyone, I need some work on python to get hands-on practice. Thanks

raven ridge Dec 17, 2022, 6:56 PM

#

dusk comet I never understood how shutting down works. Why some globals can be missing? Why...

I never understood how shutting down works. Why some globals can be missing?
At shutdown, the interpreter destroys every module that was imported. One step in destroying a module object is clearing its globals dict. If clearing that globals dict causes an object to be garbage collected (because a global variable in that dict owned the last live reference to an object), then that object's __del__ will run. If that object's __del__ tries to use a global variable from a module whose globals dict has already been cleared, it will get a NameError, because the variable it's trying to access no longer exists as a global variable for that module.

Why it is deleting globals in weird order (underscored first)?
It has to pick an order to delete things in. This documented order tries to make it possible for you to work around those NameError, by guaranteeing that certain globals will be cleared before others - so if your global is one of that's cleared in the first pass (something whose name starts with _) then its __del__ can safely refer to globals that have not yet been cleared (things whose names don't start with _).

Why not just call __del__ on every object (even if refcount is not zero) and then just free all memory?
Memory is the least interesting resource we can talk about here. When the process is about to end, there's (almost) no reason to free memory at all. Freeing memory one allocation at a time is slow, and the OS kernel will reclaim all the memory allocated to the process when the process dies, anyway. Other resources like opened files, sockets, message queues, etc are the real reason why it's worthwhile for the interpreter to try to clean things up when shutting down. And if you call __del__ on every object in a random order, you'll still have the same problem: things will try to use an object after its __del__ has run. And instead of getting a NameError you might get an OSError, for instance, for trying to write to a closed file.

warm breach Dec 17, 2022, 7:32 PM

#

raven ridge > I never understood how shutting down works. Why some globals can be missing? A...

^ and of course there's also the possibility the interpreter never gets to shutdown

dusk comet Dec 17, 2022, 7:45 PM

#

Memory is the least interesting resource we can talk about here
But it must be cleared if i embedded interpreter in my app, used it once and then finalized it. If memory is not cleared, i will get a memory leak. In other cases i agree, there is no reason to free memory.

And instead of getting a NameError you might get an OSError, for instance
Yeah, this is tricky. Wrapping those errors in try-except or if gc.is_finalized(x) can't solve all problems. I cant come up with better idea.

I think, relying on __del__ of object with several references is bad. __del__ is good for one-use or one-reference objects (like file descriptors). In other cases it is better to use .close() or something like that manually.

Are there any use cases of __del__ that is used to finalize at shutting down time? (there are no such cases in stdlib, but maybe there are some in other libs)

What's the difference between __del__ and weakref.finalize? Weakrefs make reference graph more sparse and they are not using global names, but are there any benefits of using weakref.finalize?

warm breach Dec 17, 2022, 7:48 PM

#

dusk comet > Memory is the least interesting resource we can talk about here But it must be...

weakref.finalize probably offers more specific behavior on all implementations

raven ridge Dec 17, 2022, 7:48 PM

#

dusk comet > Memory is the least interesting resource we can talk about here But it must be...

But it must be cleared if i embedded interpreter in my app, used it once and then finalized it. If memory is not cleared, i will get a memory leak.
I said "almost" 🙂

Yeah, embedding the interpreter is one case where it's worthwhile to free memory (since the process might not be dying, and someone might initialize a new interpreter). And running under a tool that's checking for memory leaks is another, since it's a way for the interpreter to indicate to the tool that it hadn't lost track of some memory.

warm breach Dec 17, 2022, 7:49 PM

#

objects being GC'd when their refcount hits 0 is a cpython implementation detail. Along with __del__ getting called immediately after GC

raven ridge Dec 17, 2022, 7:50 PM

#

I think, relying on __del__ of object with several references is bad. __del__ is good for one-use or one-reference objects (like file descriptors).
An object rarely if ever knows how many references to it there will be. And things with only 1 reference are quite rare.

warm breach Dec 17, 2022, 7:51 PM

#

I suppose there's objects created in ctypes.py_object having 1 reference exactly

raven ridge Dec 17, 2022, 7:52 PM

#

it's not that it can't happen, just that it's quite a special case, not the norm. And not something that you generally have control over unless you're writing C code (or ctypes)

raven ridge Dec 17, 2022, 7:53 PM

#

warm breach `weakref.finalize` probably offers more specific behavior on all implementations

Specifically, weakref.finalize finalizers are called from atexit, which happens before the interpreter starts destroying module globals. They're guaranteed to fire at a time before this tricky stuff about module globals being destroyed applies. And they're guaranteed to fire, while __del__ is explicitly not.

warm breach Dec 17, 2022, 7:59 PM

#

!e though, assuming interpreter is still alive when gc collects 😔

import ctypes
import weakref

class Foo:
    def __init__(self):
        self._finalize = weakref.finalize(self, self.cleanup)
        
    @classmethod
    def cleanup(cls):
        print("Important cleanup tasks")

    def __del__(self):
        print("Important cleanup tasks")

f = Foo()
ctypes.py_object.from_address(-1).value
print(f)

fallen slateBOT Dec 17, 2022, 7:59 PM

#

@warm breach :warning: Your 3.11 eval job has completed with return code 139 (SIGSEGV).

[No output]

raven ridge Dec 17, 2022, 8:09 PM

#

They also don't run if your machine is unplugged and the battery dies

#

more news at 11

dusk comet Dec 17, 2022, 8:16 PM

#

they also don't run after os._exit() obviously 😄

>>> import os
>>> x = type('',(),dict(__del__=lambda*a:print('IMPORTANT CLEANUP')))()
>>> del x
IMPORTANT CLEANUP
>>> x = type('',(),dict(__del__=lambda*a:print('IMPORTANT CLEANUP')))()
>>> os._exit(1)
*nothing*

#

my repl is absolutely broken

dusk comet Dec 18, 2022, 1:58 AM

#

pliant tusk !e ```py set(map((l:=iter([0])).__setstate__, l)) ``` would something like this ...

same with {*iter(int,1)}

rose schooner Dec 18, 2022, 1:59 AM

#

dusk comet same with `{*iter(int,1)}`

same with all the other unpacks

rose schooner Dec 18, 2022, 2:01 AM

#

pliant tusk !e ```py set(map((l:=iter([0])).__setstate__, l)) ``` would something like this ...

happens for <builtin iterable>(<infinite iterator not implemented in python>) or an unpack of the infinite iterator

gray galleon Dec 18, 2022, 2:02 AM

#

can this crash python ```py
class A:
a = None
while True:
class B(A):
pass

class C(A):
pass

B.a = C
C.a = B

dusk comet Dec 18, 2022, 2:03 AM

#

MemoryError probably

gray galleon Dec 18, 2022, 2:04 AM

#

!e ```py
class A:
a = None

while True:
class B(A):
pass

class C(A):
pass

B.a = C
C.a = B

fallen slateBOT Dec 18, 2022, 2:04 AM

#

@gray galleon :warning: Your 3.11 eval job timed out or ran out of memory.

[No output]

dusk comet Dec 18, 2022, 2:06 AM

#

no, it is not even using a lot of memory
it is creating new classes and old are GC'd

gray galleon Dec 18, 2022, 2:07 AM

#

aren’t those classes circularly referenced

dusk comet Dec 18, 2022, 2:07 AM

#

class A:
  a = 1

while 1:
  class A(A): ... # memory leaking

A.a # very very slow

dusk comet Dec 18, 2022, 2:09 AM

#

gray galleon aren’t those classes circularly referenced

when new classes are created, old are no longer referenced by anything (except each other), so they can be GC'd

gray galleon Dec 18, 2022, 2:10 AM

#

dusk comet when new classes are created, old are no longer referenced by anything (except e...

aren’t those circular classes have non zero reference count

dusk comet Dec 18, 2022, 2:11 AM

#

they are referencing each other, yes
but GC can figure out that nothing is referencing these two classes (except these classes itself), so they can be safely GC'd

#

It is very similar to this example:```py

a, b = [], []
a.append(b)
b.append(a)
a, b
([[<Recursion on list with id=2168809581952>]], [[<Recursion on list with id=2168809582784>]])
del a, b

rose schooner Dec 18, 2022, 2:12 AM

#

dusk comet It is very similar to this example:```py >>> a, b = [], [] >>> a.append(b) >>> b...

huh? ```py

refcounts in comments

a = [b := []] # a: 1, b: 2
b.append(a) # a: 2, b: 2
del b # a: 1, b: 1
del a # a: 0, b: 0

gray galleon Dec 18, 2022, 2:14 AM

#

dusk comet they are referencing each other, yes but GC can figure out that nothing is refer...

can the gc detect self referenced object (like the class A(A): example)

dusk comet Dec 18, 2022, 2:15 AM

#

rose schooner huh? ```py # refcounts in comments a = [b := []] # a: 1, b: 2 b.append(a) # a: 2...

no

# refcounts in comments
a = [b := []]  # a: 1, b: 2
b.append(a)    # a: 2, b: 2
del b          # a: 2, b: 1  a is still 2 because it is referenced in namespace and in b
del a          # a: 1, b: 1  reference cycle
# at some point after that GC will run and collect this cycle:
               # a: 0, b: 0

dusk comet Dec 18, 2022, 2:16 AM

#

gray galleon can the gc detect self referenced object (like the `class A(A):` example)

yes, it can detect even more complicated isolated structures

rose schooner Dec 18, 2022, 2:16 AM

#

nvm yeah

#

‫b only Py_DECREF's when it's getting deleted

rose schooner Dec 18, 2022, 2:18 AM

#

dusk comet yes, it can detect even more complicated isolated structures

like this? ```py
class A:
def init(self):
self.self = self

A()

dusk comet Dec 18, 2022, 2:20 AM

#

it can detect everything

#

https://devguide.python.org/internals/garbage-collector/

#

so in python you can get memory leak if you:

have memory leak in C
have no actual memory leak (you are still holding reference somewhere, but dont know about it/dont use it)
3*) messed with internals and broken something (regular #esoteric-python stuff)

gray galleon Dec 18, 2022, 2:29 AM

#

acc = ()
while True:
  acc = acc, acc
```this actually crashes

raven ridge Dec 18, 2022, 2:29 AM

#

sure. That uses infinite memory.

#

CPython has two different garbage collection methods. Objects are destroyed instantly if their reference count ever drops to 0, and they're destroyed if the cycle-collecting GC runs and detects that those objects are part of a cycle, and that cycle has no references into it from outside the cycle.

gray galleon Dec 18, 2022, 2:31 AM

#

dusk comet it can detect everything

so it works as well as traditional tracing gcs

raven ridge Dec 18, 2022, 2:32 AM

#

it is a proper GC, yes.

boreal umbra Dec 18, 2022, 2:32 AM

#

raven ridge CPython has two different garbage collection methods. Objects are destroyed inst...

an object belongs to a cycle if it can't be discovered in a tree traversal from the scope's symbol table, right?

feral island Dec 18, 2022, 2:33 AM

#

boreal umbra an object belongs to a cycle if it can't be discovered in a tree traversal from ...

if it can be discovered in a traversal from its own children

raven ridge Dec 18, 2022, 2:33 AM

#

https://devguide.python.org/internals/garbage-collector/#identifying-reference-cycles describes the algorithm

boreal umbra Dec 18, 2022, 2:36 AM

#

feral island if it can be discovered in a traversal from its own children

right. but that doesn't tell you if that cycle is disconnected from the rest of the reference graph. so if an object has a non-zero reference count, but there's some graph traversal whereby it can't be visited (I'm not totally sure what that is--I'll have to read the link godly just dropped), it must be part of a disconnected cycle.

#

(I hope I'm not coming off as confrontational. I'm just trying to clarify what I meant, which is hard when I don't fully understand what I'm talking about 😛 )

feral island Dec 18, 2022, 2:38 AM

#

boreal umbra right. but that doesn't tell you if that cycle is disconnected from the rest of ...

oh yes, that does make sense. though I think there are other GC roots than the scope's symbol table

rose schooner Dec 18, 2022, 2:39 AM

#

so i tested gc.collect() using a C extension c static PyObject * a_c_test_refcnt(PyObject *self, PyObject *o) { PyObject *s = PyTuple_GET_ITEM(o, 0); PyObject *non = PyTuple_GET_ITEM(o, 1); Py_INCREF(non); Py_DECREF(o); PyGC_Collect(); printf("%zd\n", Py_REFCNT(s)); return non; } (takes a tuple in the form of (obj, None) because i have no idea why all the basic python structures/objects, including Py_None, point to garbage addresses)

#

seems to work ```py

from a_c import test_refcnt as t
class X:
... def init(s):s.s=s
...
t((X(),None))
-2459565876494606883

raven ridge Dec 18, 2022, 2:40 AM

#

feral island oh yes, that does make sense. though I think there are other GC roots than the s...

it doesn't start from roots at all. It just iterates over a linked list of all GC-aware objects.

feral island Dec 18, 2022, 2:40 AM

#

I think you are reading garbage there since you're incorrectly DECREFing the tuple

rose schooner Dec 18, 2022, 2:40 AM

#

feral island I think you are reading garbage there since you're incorrectly DECREFing the tup...

wdym?

#

‫the only reference gets passed to the function right

raven ridge Dec 18, 2022, 2:41 AM

#

Py_DECREF(o); is invalid. You're decrementing the reference count of a tuple that you don't own a reference to.

#

the reference gets passed to the function right
a borrowed reference is.

rose schooner Dec 18, 2022, 2:42 AM

#

where else does it go anyway?

feral island Dec 18, 2022, 2:42 AM

#

raven ridge it doesn't start from roots at all. It just iterates over a linked list of all G...

oh clever, it knows which things are reachable because they have references from outside the GC

raven ridge Dec 18, 2022, 2:42 AM

#

yep.

raven ridge Dec 18, 2022, 2:43 AM

#

rose schooner where else does it go anyway?

when your function is called, Py_REFCNT(o) should show 1 - there is a single reference to o, owned by the caller of a_c_test_refcnt

rose schooner Dec 18, 2022, 2:43 AM

#

raven ridge when your function is called, `Py_REFCNT(o)` should show 1 - there is a single r...

‫so the caller in this case is the program

#

and it creates a temporary tuple that it passes to CALL

feral island Dec 18, 2022, 2:44 AM

#

rose schooner ‫so the caller in this case is the program

yes. after the call returns, it DECREFs the tuple

raven ridge Dec 18, 2022, 2:44 AM

#

the caller is PyObject_CallObject or something like that

#

but yes - the thing that creates the temporary tuple is the thing that's also supposed to destroy it (unless you increase its reference count, saving a reference to it for yourself)

rose schooner Dec 18, 2022, 2:45 AM

#

feral island yes. after the call returns, it DECREFs the tuple

hm

#

‫?it should've aborted then because of negative reference count

raven ridge Dec 18, 2022, 2:46 AM

#

when you decrement its reference count, a) it drops a reference that you didn't own (likely destroying that object), and b) when the calling frame drops the reference that it did own, it either decrements it to -1, or it decrements some entirely unrelated object if the pointer was reused.

#

or it segfaults if a new allocation reused that same pointer, though that's unlikely to happen in your example.

#

I think python -Xdev would catch your negative reference count

rose schooner Dec 18, 2022, 2:47 AM

#

raven ridge I think `python -Xdev` would catch your negative reference count

it is -X dev

feral island Dec 18, 2022, 2:47 AM

#

rose schooner ‫?it should've aborted then because of negative reference count

or maybe it just writes into some now-unused memory and it's harmless

rose schooner Dec 18, 2022, 2:47 AM

#

rose schooner it is `-X dev`

it exits fine

raven ridge Dec 18, 2022, 2:47 AM

#

rose schooner it is `-X dev`

the space isn't needed

#internals-and-peps

refcounts in comments