feral island Feb 10, 2023, 10:31 PM

#

honestly the whole thing about not using f-strings in logging mostly feels like premature optimization to me

halcyon trail Feb 10, 2023, 10:32 PM

#

I mean, yeah, because we're currently in a situation where there's a trade-off between readability and optimization

#

I also just default to using f-strings for logging

deep nova Feb 10, 2023, 10:32 PM

#

THIIIIIIIIIIIIS

halcyon trail Feb 10, 2023, 10:32 PM

#

it's only premature optimization if you have to do extra work though or make compromises

#

if "the way" is both ergonomic and fast then it's not premature anything

grave jolt Feb 10, 2023, 10:33 PM

#

deep nova THIIIIIIIIIIIIS

I mean, I understand Agda using the literal letter λ for anonymous functions. But that's agda (and λ is actually good-looking and concise unlike lambda)

#

even haskell uses \x -> x

halcyon trail Feb 10, 2023, 10:33 PM

#

writing logger.info { f"Hello {user}" } seems pretty nice to me

halcyon trail Feb 10, 2023, 10:34 PM

#

grave jolt even haskell uses `\x -> x`

I like the approach of swift and kotlin where, I imagine, they sat down on the very first day and said "okay, lambdas get the absolute best syntax in the language. Now that's done, let's look at everything else"

grave jolt Feb 10, 2023, 10:36 PM

#

halcyon trail writing `logger.info { f"Hello {user}" }` seems pretty nice to me

Hey, let's do it in the Enterprise Python Style, Extra Clean and Readable ™️ ```py
@logger.info
def log_hello_user_greeting() -> str:
"""
Log a greeting phrase mentioning the user's name, but
only if the logging verbosity is set to :logging.INFO or higher.
"""
user_to_be_greeted = user
greeting = "Hello"
return f"{greeting} {user_to_be_greeted}"

warm breach Feb 10, 2023, 10:38 PM

#

😔 https://github.com/python/cpython/blob/main/Objects/descrobject.c#L1268-L1269

fallen slateBOT Feb 10, 2023, 10:38 PM

#

Objects/descrobject.c lines 1268 to 1269

/* This has no reason to be in this file except that adding new files is a
   bit of a pain */```

feral island Feb 10, 2023, 10:38 PM

#

that one makes a lot more sense than reversed in enumobject.c honestly

#

descrobject.c is for all the internal descriptors

warm breach Feb 10, 2023, 10:39 PM

#

can you make a wrapper object in python?

#

or is it only for slot methods?

#

int.__add__ is a "<slot wrapper ..." though

#

this thing is supposed to be a "<method wrapper ..."?

feral island Feb 10, 2023, 10:41 PM

#

__add__ is a slot

pliant tusk Feb 10, 2023, 10:42 PM

#

!e print(type(1 .__add__))

fallen slateBOT Feb 10, 2023, 10:42 PM

#

@pliant tusk :white_check_mark: Your 3.11 eval job has completed with return code 0.

<class 'method-wrapper'>

grave jolt Feb 10, 2023, 10:43 PM

#

grave jolt Hey, let's do it in the Enterprise Python Style, Extra Clean and Readable ™️ ```...

grave jolt Feb 10, 2023, 10:44 PM

#

grave jolt Hey, let's do it in the Enterprise Python Style, Extra Clean and Readable ™️ ```...

Improved

📎 HelloUserGreeterLogger.py

feral island Feb 10, 2023, 10:44 PM

#

grave jolt Improved

you hardcoded "Hello". Rejected.

grave jolt Feb 10, 2023, 10:44 PM

#

fuck, I pinned something

#

fixed

grave jolt Feb 10, 2023, 10:45 PM

#

feral island you hardcoded "Hello". Rejected.

Wdym hardcoded?! I can subclass this and implement my own greeting

#

that's just good design, provide sensible defaults!

feral cedar Feb 10, 2023, 10:47 PM

#

smfh. open to extension, closed to modification

halcyon trail Feb 10, 2023, 10:47 PM

#

if we have learned anything about software engineering in the last 30 years

#

it's that inheritance is always the best

#

always

grave jolt Feb 10, 2023, 10:48 PM

#

yeah

#

every human inherited stuff from their mom and dad

halcyon trail Feb 10, 2023, 10:48 PM

#

it solves all problems, once and for all

feral cedar Feb 10, 2023, 10:48 PM

#

grave jolt every human inherited stuff from their mom and dad

damn. multiple inheritance was always the true way to do things

grave jolt Feb 10, 2023, 10:48 PM

#

feral cedar smfh. open to extension, closed to modification

yeah, it's open to extension!

#

You won't be able to modify this code because we need to book a meeting to make a change in a docstring. As that's potentially a breaking change for our documentation readers.

halcyon trail Feb 10, 2023, 10:49 PM

#

I can't accept any enterprise code that doesn't include a factory

#

rejected

grave jolt Feb 10, 2023, 10:49 PM

#

Isn't HelloUserGreeterLogger a log message factory?

halcyon trail Feb 10, 2023, 10:50 PM

#

how are people supposed to create HelloUserGreeterLogger

#

hmm? hmmm?

grave jolt Feb 10, 2023, 10:50 PM

#

There's obviously a metaclass mechanism

halcyon trail Feb 10, 2023, 10:50 PM

#

do you want people to soil themselves with touching a concrete constructor

grave jolt Feb 10, 2023, 10:50 PM

#

in LoggerProtocol

halcyon trail Feb 10, 2023, 10:50 PM

#

😛

#

in a reddit thread recently people were levelling charges like this unironically at logging and it made me sad

#

"stinks of Java" 😦

grave jolt Feb 10, 2023, 10:56 PM

#

halcyon trail hmm? hmmm?

# noqa
from __future__ import annotations
from logrossmeister.utils import MetaLoggerProtocolFactoryProtocolRepositoryProtocolFactory

LogUserReturnTypeT = TypeVar("LogUserReturnTypeT", bound=None)

LOG_USER_SLEEPING_TIME_CONSTANT_SECONDS = 0.217

async def log_user(user_to_be_greeted: UserProtocol | LogUserReturnTypeT) -> LogUserReturnTypeT:
    if user_to_be_greeted is None:
        return user_to_be_greeted
    meta_logger_protocol_factory_protocol_repository =\
        await MetaLoggerProtocolFactoryProtocolRepositoryProtocolFactory.get()
    meta_logger_protocol_factory =\
        await meta_logger_protocol_factory_protocol_repository.get()
    await meta_logger = meta_logger_protocol_factory.get(HelloUserGreeterLogger)
    async with meta_logger.lock():
        await meta_logger.args.clear()
        await meta_logger.args.append_("user_to_be_greeted")
        await meta_logger.args.user_to_be_greeted = user_to_be_greeted
        loggable = await meta_logger.create_loggable(mongodb=True, async_=True, django=True)
        await loggable.log()
        await meta_logger.args.clear()
        await asyncio.sleep(LOG_USER_SLEEPING_TIME_CONSTANT_SECONDS)

halcyon trail Feb 10, 2023, 10:57 PM

#

yes

grave jolt Feb 10, 2023, 10:57 PM

#

now as an exercise, write a test suite for this function

warm breach Feb 10, 2023, 10:57 PM

#

grave jolt ```py # noqa from __future__ import annotations from logrossmeister.utils import...

no class? 😔

halcyon trail Feb 10, 2023, 10:57 PM

#

I feel you growing powerful. Now strike me down

#

and your journey to the dark side shall be complete

grave jolt Feb 10, 2023, 10:57 PM

#

warm breach no class? 😔

Yeah, we're doing functional programming now.

#

fixed*

grave jolt Feb 10, 2023, 11:14 PM

#

(btw I'm mildly sorry for shitposting in this serious channel)

halcyon trail Feb 10, 2023, 11:23 PM

#

grave jolt (btw I'm mildly sorry for shitposting in this serious channel)

enterprise means never having to say you're sorry

#

also what's with your new icon thingie. is that a Rust reference

grave jolt Feb 10, 2023, 11:24 PM

#

it's... complicated

halcyon trail Feb 10, 2023, 11:24 PM

#

weird I thought I asked about your icon not a relationship status. discord are you ok

raven ridge Feb 10, 2023, 11:25 PM

#

we were talking about the syntactic macros PEP the other day - it seems that this would be a reasonable use for macros in Python, actually. People want:
a) To be able to sprinkle logging code in their application without slowing it down
b) To be able to use f-strings for forming their log messages
c) To write their log statements in a way that's succinct and readable

If logging was macro-based, we'd be able to accomplish all 3, by wrapping log call arguments in an object that formats lazily automatically, so that writing py info!(logger, f"Guess what: {expensive_call()}") gets translated automagically to ```py
logger.info(LazyLoggingFormatter(lambda: f"Guess what: {expensive_call()}"))

Or hell, we could just do it with a lazy f-string macro in the first place: ```py
import! lazyformat as lf
logger.info(lf!"Guess what: {expensive_call()}")

#

@warm breach You were asking for examples of places where the syntactic macros proposal might be useful, and I think this is a pretty reasonable one.

grave jolt Feb 10, 2023, 11:26 PM

#

or you could just pass in a lambda 🙂

halcyon trail Feb 10, 2023, 11:26 PM

#

I agree that since lambdas have been botched too badly to be used for this, maybe macros could do instead

#

but does it justify macros, prob not (but other people will decide that)

raven ridge Feb 10, 2023, 11:26 PM

#

grave jolt or you could just pass in a lambda 🙂

it's not as easy as just passing in a lambda - you need to pass an object with a __format__ method...

halcyon trail Feb 10, 2023, 11:27 PM

#

you pass in a lambda that returns a string when evaluated

#

i'm not sure why you need a format method

#

maybe if you want to keep structured data around, I suppose?

raven ridge Feb 10, 2023, 11:27 PM

#

halcyon trail you pass in a lambda that returns a string when evaluated

pass it in to what?

halcyon trail Feb 10, 2023, 11:27 PM

#

to the function

#

logger.info(lambda: f"hello {expensive_call()}")

raven ridge Feb 10, 2023, 11:28 PM

#

!e ```py
import logging
logging.error(lambda: "hello")

fallen slateBOT Feb 10, 2023, 11:28 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

ERROR:root:<function <lambda> at 0x7fe166d52b60>

raven ridge Feb 10, 2023, 11:28 PM

#

that already does something. The change you're proposing would be backwards incompatible.

halcyon trail Feb 10, 2023, 11:29 PM

#

I wasn't seriously proposing it because lamdas in python are so ugly

#

but if you're proposing a new macro, then you can just as well propose new logger functions as well

raven ridge Feb 10, 2023, 11:29 PM

#

I guess maybe it could be done if you subclassed Logger...

halcyon trail Feb 10, 2023, 11:29 PM

#

or a new logger type

#

logger = logging.getLazyLogger(__name__)

#

etc

raven ridge Feb 10, 2023, 11:29 PM

#

yeah. that could work.

halcyon trail Feb 10, 2023, 11:30 PM

#

I kinda just think none of these things are actually worth the price of admission though

raven ridge Feb 10, 2023, 11:30 PM

#

less nice than the macro solution, I think, for being error prone and less succinct, but...

halcyon trail Feb 10, 2023, 11:30 PM

#

(for python, and in its current state)

raven ridge Feb 10, 2023, 11:30 PM

#

well, possibly. I don't think there's been any real movement on that syntactic macros PEP in a long time. I'm not sure why it came up again the other day - maybe I'm wrong and it came up here because people were discussing it elsewhere?

halcyon trail Feb 10, 2023, 11:30 PM

#

if python adds macros then the universe will probably end in a Greenspun's tenth rule explosion though

raven ridge Feb 10, 2023, 11:32 PM

#

I'm not really sure that it's worth the cost to add macros to Python, but I think this is an interesting example of a place where they'd allow us to do something that's quite ugly without them. Automagically wrapping some code up in a function to delay evaluation is something that macros could do, where the alternative is extra code pushed into every call site.

halcyon trail Feb 10, 2023, 11:32 PM

#

I am less down on macros since the last time we discussed this, insofar as I think they work well in Rust.
macros in a dynamically typed, non-lisp just fundamentally makes me sad because if I was willing to sacrifice static typing I could already have had so much nicer macros

warm breach Feb 10, 2023, 11:32 PM

#

raven ridge we were talking about the syntactic macros PEP the other day - it seems that thi...

would this even work with macros?

halcyon trail Feb 10, 2023, 11:32 PM

#

raven ridge I'm not really sure that it's worth the cost to add macros to Python, but I thin...

Well, you can basically extend this to all the things folks would like to do with lambdas, that they dont in python because they're just too ugly

warm breach Feb 10, 2023, 11:32 PM

#

you would get the ast of a string

halcyon trail Feb 10, 2023, 11:33 PM

#

you could probably also define the macro to define a local function and pass it in

#

you'd probably want to do that in fact so you never have any artificial "one line" restriction

#

so "macros as a hack around poor lambdas" is I suppose a legitimate selling point

raven ridge Feb 10, 2023, 11:34 PM

#

warm breach you would get the ast of a string

I haven't thought too hard about it, but I don't see why not? You'd take the AST of that string, and you'd wrap it up in the AST of a function call to construct a type whose __format__ evaluates and returns that string

warm breach Feb 10, 2023, 11:35 PM

#

oh actually I think it would? but you'd need to provide your own ast I guess

#

!e since python ast would parse it without the field

import ast

print(ast.dump(ast.parse('"Guess what: {expensive_call()}"')))

fallen slateBOT Feb 10, 2023, 11:36 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

Module(body=[Expr(value=Constant(value='Guess what: {expensive_call()}'))], type_ignores=[])

warm breach Feb 10, 2023, 11:36 PM

#

but yeah that'd be a nice use case if it works

raven ridge Feb 10, 2023, 11:37 PM

#

I'm not sure whether the syntax would be nice with PEP 638 as it's proposed, but it's certainly something that macros could do in principle

warm breach Feb 10, 2023, 11:38 PM

#

yeah rust has f!() that does that

#

I think it's lazy?

raven ridge Feb 10, 2023, 11:38 PM

#

transforming the AST for f"Guess what: {expensive_call()}" into the AST for LazyFormat(lambda: f"Guess what: {expensive_call()}") doesn't seem like a big lift, as far as AST rewriting goes

raven ridge Feb 10, 2023, 11:45 PM

#

feral island honestly the whole thing about not using f-strings in logging mostly feels like ...

the argument not to do it for performance reasons is premature optimization, I think - if the string formatting will kill you, odds are that the calls to the logging methods are already too expensive.

But there's another reason, in addition to performance: interpolation failures are caught and reported, without the exception escaping from logging

warm breach Feb 10, 2023, 11:47 PM

#

raven ridge the argument not to do it for performance reasons is premature optimization, I t...

you could just use surround your log with if __debug__ and that can get compiled away with -o

#

then use whatever f strings inside you want

#

but that's a bit verbose for a lot of inline stuff

#

in any case I think the function call to logging will take longer than any non-lazy f string

raven ridge Feb 10, 2023, 11:49 PM

#

the performance advice might be more reasonable if it weren't for the fact that arguments get evaluated eagerly anyway - so logging.debug("result: %s", some_expensive_call) saves the cost of the interpolation, but not the cost of the expensive call

warm breach Feb 10, 2023, 11:50 PM

#

raven ridge the performance advice might be more reasonable if it weren't for the fact that ...

I think the argument here is that str(some_expensive_call) may be very expensive

#

but I don't really see that as too common

raven ridge Feb 10, 2023, 11:51 PM

#

yeah, it's much more common that the call is expensive than that str() on the result of the call is.

#

well, I dunno. big dicts are slow to stringify, I guess.

grave jolt Feb 10, 2023, 11:54 PM

#

!e
Speaking of templating, I have invented this hack to emulate jinja-style {% if %}s

class Yes:
    def __format__(self, spec): return spec.strip()
class No:
    def __format__(self, spec): return ""

template = """
thing.on("userLogin", (user) => {{
    {alert_sentry:
      sentry.send(`Login. ${{user.name}}`)}
    {alert_log:
      console.log(`Login. ${{user.name}}`)}
    user.confirmLogin()
}})
"""

print(template.format(alert_sentry=Yes(), alert_log=No()))

fallen slateBOT Feb 10, 2023, 11:54 PM

#

@grave jolt :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 
002 | thing.on("userLogin", (user) => {
003 |     sentry.send(`Login. ${user.name}`)
004 |     
005 |     user.confirmLogin()
006 | })

grave jolt Feb 10, 2023, 11:55 PM

#

the power is truly terrifying

lone sun Feb 11, 2023, 2:14 AM

#

He doesn't actually prove that Python is non-context-free. He just says, "That makes the language context sensitive, in my opinion." I'm not going to deny him his right to an opinion, but either it is or it isn't, and he hasn't given a sufficient argument one way or the other.

However, there is a perfectly simple mathematical proof of a toy version of this. If I remember correctly, a language of the form {s^3 : s in Σ^*}, where Σ is some alphabet, is not context-free (as long as the alphabet has more than one character). (I think the 3 in the exponent is right, but it might be something else?) This is a consequence of the pumping lemma for context-free languages. It follows that similar languages, like {stsusv : s, t, u, v in Σ^*}, are also not context-free. So you can't recognize that three consecutive lines begin with the same string of whitespace using a context-free grammar.

This doesn't mean that we're using the wrong tools. The type of context-sensitivity we need is quite simple. You just need to remember what the leading whitespace of the most recent line was and update it as necessary (pushing or popping; it's a stack). And sure, in principle stacks let you do interesting computations, but in practice we're really not doing much.

deep nova Feb 11, 2023, 8:21 AM

#

Just in case y'all are curious: https://hastebin.com/share/penerufeta.py

Hastebin

Hastebin is a free web-based pastebin service for storing and sharing text and code snippets with anyone. Get started now.

flat gazelle Feb 11, 2023, 1:31 PM

#

according to https://github.com/python/cpython/blob/3.10/Python/ceval.c#L2700-L2718, it would appear that END_ASYNC_FOR simply assumes the stack has a single exception triple and and the async iterable. But https://docs.python.org/3.10/library/dis.html#opcode-END_ASYNC_FOR says that it uses 7 stack elements, which just seems odd to me. Which is true?

plush dragon Feb 11, 2023, 7:15 PM

#

Genuine question, what do you think are the top most essential peps to know to code collaborate with python? I'm thinking like pep8 and pep20 at least. Since they talk about how python coders think. Do you guys know some more "easter eggs" or must know peps drink and eat and breath all day? I always like pep8/20. Maybe one more to add to my bookmark if you have any

#

Sorry if my English is a bit broken, it's my 2nd language

#

Im still learning

feral island Feb 11, 2023, 7:17 PM

#

Most PEPs aren't really relevant to normal coding, they are change proposals that were either accepted or rejected. You're usually better off reading the documentation at docs.python.org. PEP 8 and 20 are unusual in that regard

#

Most PEPs are relevant only if you are actually working on the development of the language or interested in language design

plush dragon Feb 11, 2023, 7:20 PM

#

Ohh. Thank you! @feral island

rose schooner Feb 11, 2023, 10:31 PM

#

flat gazelle according to <https://github.com/python/cpython/blob/3.10/Python/ceval.c#L2700-L...

the other popped stack elements must be from UNWIND_EXCEPT_HANDLER()
https://github.com/python/cpython/blob/3.10/Python/ceval.c#L1456-L1475

flat gazelle Feb 11, 2023, 10:45 PM

#

Ah, thanks

surreal sun Feb 11, 2023, 11:44 PM

#

reading through PEP 638 (syntactic macros) i never rlly got the purpose of them

#

from what i'm understanding they change the AST and u can do stuff like DSLs and other cool stuff with it right?

#

but how is it even defined bc i'm not rlly understanding it in the PEP, is it just a function that changes the ast based on the ast node

gray galleon Feb 12, 2023, 1:59 AM

#

surreal sun but how is it even defined bc i'm not rlly understanding it in the PEP, is it ju...

which is what it does
at compiletime

surreal sun Feb 12, 2023, 2:00 AM

#

ohh

deep nova Feb 12, 2023, 2:47 AM

#

It's gonna take me a few weeks to understand this grammar of python's

#

But I'm starting to go through it. I'm still a bit curious about the distinction between a compound statement and a simple statement

#

As best I can tell — it all comes down to the semicolon?

rich cradle Feb 12, 2023, 2:48 AM

#

Compound statements contain (groups of) other statements; they affect or control the execution of those other statements in some way. In general, compound statements span multiple lines, although in simple incarnations a whole compound statement may be contained in one line.
https://docs.python.org/3/reference/compound_stmts.html
A simple statement is comprised within a single logical line. Several simple statements may occur on a single line separated by semicolons.
https://docs.python.org/3/reference/simple_stmts.html

deep nova Feb 12, 2023, 2:48 AM

#

So you can do things like from some_module import thing; thing.func()

deep nova Feb 12, 2023, 2:49 AM

#

rich cradle > Compound statements contain (groups of) other statements; they affect or contr...

Oh

#

This makes so much more sense then the other thing I read

#

Its a compound statement because it literal is compounded from multiple other statements

#

Another question, while I'm here

#

I think I understand that this group of rules:

#

single_input: NEWLINE | simple_stmts | compound_stmt NEWLINE;
file_input: (NEWLINE | stmt)* EOF;
eval_input: testlist NEWLINE* EOF;

#

Just so I totally understand...

#

file_input: (NEWLINE | stmt)* ENDMARKER (from the actual python grammar this time, not a knockoff version)

A file consists of any number of statements and newlines, followed by an end marker. How does this relate to the "flattening of simple statements"? Is it that I collect a sequence of semi-colon-delimited simpler statements in a single pass, but in the resulting AST they should not be grouped within simple-statement collections but rather directly as children of the main File node?

#

Ahhh, here we go: file[mod_ty]: a=[statements] ENDMARKER { _PyPegen_make_module(p, a) }

#

In this one, there is no or NEWLINE clause. Does this mean that every statement will have its own rules for consuming a newline at its termination?

deep nova Feb 12, 2023, 3:40 AM

#

I'm having a hard time understanding star_expressions

gray galleon Feb 12, 2023, 3:49 AM

#

star expressions = star + expression

#

i think star have the same precedence as unary operators

rose schooner Feb 12, 2023, 3:56 AM

#

gray galleon i think star have the same precedence as unary operators

no
it has different precedences depending on the context

rose schooner Feb 12, 2023, 3:57 AM

#

rose schooner no it has different precedences depending on the context

actually maybe just one

#

precedence just below a bitwise OR expression

gray galleon Feb 12, 2023, 4:05 AM

#

wait so [*3+3] is parsed as [*(3+3)]?

#

til

deep nova Feb 12, 2023, 4:12 AM

#

I've never seen a star used as a unary operator

#

I've seen it used in iterable unpacking, and that's what I assuming it was

gray galleon Feb 12, 2023, 4:44 AM

#

gray galleon wait so `[*3+3]` is parsed as `[*(3+3)]`?

python be like fuck consistency

raven ridge Feb 12, 2023, 4:48 AM

#

!e ```py
x = "foo"
print(*x + "bar")

fallen slateBOT Feb 12, 2023, 4:48 AM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

f o o b a r

raven ridge Feb 12, 2023, 4:48 AM

#

so yep, parsed as *(x + "bar")

deep nova Feb 12, 2023, 4:50 AM

#

I have absolutely no idea what's happening

raven ridge Feb 12, 2023, 4:50 AM

#

it pretty much has to be, right? Parsing it as (*x) + "bar" wouldn't make sense.

sour thistle Feb 12, 2023, 4:51 AM

#

deep nova I have absolutely no idea what's happening

* can be used for two completely different purposes:

multiplication, 2*3=6
unpacking - for example, if numbers = [1, 2, 3] then doing foo(*numbers) is the same as doing foo(1, 2, 3)

deep nova Feb 12, 2023, 4:51 AM

#

Ohhhhh

#

So, by way of operator precedence

#

*x + "bar" becomes *(x + bar)

sour thistle Feb 12, 2023, 4:52 AM

#

pretty much

deep nova Feb 12, 2023, 4:54 AM

#

So, in the grammar, is the star operator (wrt to unpacking) the lowest precedence operation in the chain and hence, referenced constantly as the go-to for any type of expression in that chain?

#

(Hoping that makes sense)

unkempt rock Feb 12, 2023, 4:54 AM

#

does anyone knw what this means

#

this keeps popping up randomly for me

#

after i close

sour thistle Feb 12, 2023, 4:54 AM

#

deep nova So, in the grammar, is the star operator (wrt to unpacking) the lowest precedenc...

I'm too tired to properly determine whenever or not that is technically correct derp

deep nova Feb 12, 2023, 4:55 AM

#

🧠

sour thistle Feb 12, 2023, 4:56 AM

#

unkempt rock does anyone knw what this means

if it is about python at all: open a post in #1035199133436354600 with more details
if it isn't: you can try asking in some offtopic channel, but I'd recommend looking for another more on topic server

deep nova Feb 12, 2023, 4:58 AM

#

O.O There's a more on topic channel than here to ask about python's grammar?

sour thistle Feb 12, 2023, 5:01 AM

#

redd's questions is not about python grammar at all

deep nova Feb 12, 2023, 5:15 AM

#

assignment:
    | NAME ':' expression ['=' annotated_rhs ] 
    ...alternatives

annotated_rhs: 
    | yield_expr 
    | star_expressions

yield_expr:
    | 'yield' 'from' expression 
    | 'yield' [star_expressions] 

star_expressions:
    | star_expression (',' star_expression )+ [','] 
    | star_expression ',' 
    | star_expression

star_expression:
    | '*' bitwise_or 
    | expression

bitwise_or:
    | bitwise_or '|' bitwise_xor 
    | bitwise_xor
bitwise_xor:
    | bitwise_xor '^' bitwise_and 
    | bitwise_and
bitwise_and:
    | bitwise_and '&' shift_expr 
    | shift_expr
shift_expr:
    | shift_expr '<<' sum 
    | shift_expr '>>' sum 
    | sum

sum:
    | sum '+' term 
    | sum '-' term 
    | term
term:
    | term '*' factor 
    | term '/' factor 
    | term '//' factor 
    | term '%' factor 
    | term '@' factor 
    | factor

...and so on, all the way down to atomics

#

I just want to make sure I'm interpreting this correctly. NAME ':' expression ['=' annotated_rhs ] translates to:

Name and type-annotation (which may be an expression) optionally followed by = some-kind-of-expression

#

So I can have an "assignment" that doesn't assign anything but rather just declares, such asa: int

#

And then there could be a right hand side to it, the value of which will be some kind of expression. The top-level expression rule in this case seems to be annotated_rhs which degrades into yield or starred, etc

raven ridge Feb 12, 2023, 5:37 AM

#

that all sounds right

deep nova Feb 12, 2023, 5:44 AM

#

assignment:
('(' single_target ')' 
         | single_subscript_attribute_target) ':' expression ['=' annotated_rhs ]

So, the left-hand side could be a single-target between parentheses OR a single_subscript_attribute_target (which I assume to be something like a.b.c or a[6].someattr). But single_target degrades directly into single_subscript_attribute_target as well as into '(' single_target ')'

#

Isn't all that a) wildly confusing and b) pointless? Wouldn't just saying single_target not cover all of this?

raven ridge Feb 12, 2023, 5:51 AM

#

no, that wouldn't allow parentheses

#

!e ```py
(x): int = 4
print(x)

fallen slateBOT Feb 12, 2023, 5:53 AM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

deep nova Feb 12, 2023, 5:59 AM

#

I've never really see putting objects in parentheses for assignment

#

The only use-case I can think of might something like ```
a, (b, c), d = 1, (2, 3), 4

#

Is the point to allow for recursive assignment to nested targets?

raven ridge Feb 12, 2023, 6:02 AM

#

Black reformats x,=4 as (x,) = 4

#

I've never really seen (x) = val, but I suppose it makes sense to allow it as a degenerate case of (x, y, z) = val

deep nova Feb 12, 2023, 6:03 AM

#

https://docs.python.org/3/reference/simple_stmts.html#grammar-token-python-grammar-annotated_assignment_stmt

Python documentation

7. Simple statements

A simple statement is comprised within a single logical line. Several simple statements may occur on a single line separated by semicolons. The syntax for simple statements is: Expression statement...

#

The grammar showcased here is totally different from the one I see on Github

#

Is it modified for better readability, while the "real" grammar is designed for efficiency or something?

grave jolt Feb 12, 2023, 6:59 AM

#

raven ridge Black reformats `x,=4` as `(x,) = 4`

no touchie the pistol operator!!

rose schooner Feb 12, 2023, 7:05 AM

#

gray galleon python be like fuck consistency

it's just like not but it feels "less natural" because usually * has a high precedence

deep nova Feb 12, 2023, 7:33 AM

#

I'm sure it'll become plain soon enough, but

#

Why are the top-level expressions in assignment yield_expression and starred_expression?

#

Why those two and not some others?

gray galleon Feb 12, 2023, 7:52 AM

#

because they are the only expressions i think

rose schooner Feb 12, 2023, 7:54 AM

#

deep nova Why are the top-level expressions in assignment `yield_expression` and `starred_...

starred_expressions allows for stuff like this py a = *b, *c which makes a tuple of the elements of b and c and assigns it to a

#

actually it's star_expressions, not starred_expression/starred_expressions

gray galleon Feb 12, 2023, 7:55 AM

#

but you shouldn't be able to do```
a = *b

grave jolt Feb 12, 2023, 7:56 AM

#

make it dereference b brainmon

rose schooner Feb 12, 2023, 7:56 AM

#

yield_expression basically just allows for top-level expression assignment to a yield py a = yield b which is useful when you wanna receive values from outside that uses .send()

rose schooner Feb 12, 2023, 7:56 AM

#

gray galleon but you shouldn't be able to do``` a = *b ```

nope

gray galleon Feb 12, 2023, 7:57 AM

#

rose schooner nope

so whats the point of including star expressions in assignment

rose schooner Feb 12, 2023, 7:57 AM

#

gray galleon so whats the point of including star expressions in assignment

allows to unpack into a tuple

#

you can do a = *b,

deep nova Feb 12, 2023, 7:58 AM

#

That explains what those expressions are, but why are they the top level expressions (the ones which degrade into all others) of all the assignment statements?

gray galleon Feb 12, 2023, 7:58 AM

#

rose schooner allows to unpack into a tuple

but then thats in tuple syntax

rose schooner Feb 12, 2023, 7:58 AM

#

gray galleon so whats the point of including star expressions in assignment

actually nvm you can do that but it errors because the star isn't used anywhere

gray galleon Feb 12, 2023, 7:58 AM

#

gray galleon but then thats in tuple syntax

not toplevel assignment

rose schooner Feb 12, 2023, 7:58 AM

#

rose schooner actually nvm you *can* do that but it errors because the star isn't used anywher...

which leads to this error with not much information ```pycon

a = *b
File "<stdin>", line 1
SyntaxError: can't use starred expression here

rose schooner Feb 12, 2023, 7:59 AM

#

gray galleon but then thats in tuple syntax

it's embedded in the rule

#

the actual tuple rule uses parentheses

rose schooner Feb 12, 2023, 8:00 AM

#

rose schooner which leads to this error with not much information ```pycon >>> a = *b File "...

by "isn't used anywhere" i actually mean "doesn't resolve to a single expression"

grave jolt Feb 12, 2023, 8:03 AM

#

grave jolt make it dereference `b` <:brainmon:439516188771483658>

!e

class ptr:
    def __init__(self, obj):
        self._obj = obj

    def __iter__(self):
        yield self._obj

    def __imul__(self, other):
        self._obj = other
        return self

px = ptr(37)
py = ptr(5)
pz = ptr(0)
(x, y) = (*px, *py)
pz *= x + y
print(*pz)

fallen slateBOT Feb 12, 2023, 8:03 AM

#

@grave jolt :white_check_mark: Your 3.11 eval job has completed with return code 0.

raven ridge Feb 12, 2023, 8:30 AM

#

deep nova That explains what those expressions are, but why are they the top level express...

Because they're the ones with the lowest precedence

deep nova Feb 12, 2023, 8:33 AM

#

This is what I needed to know XD

raven ridge Feb 12, 2023, 8:36 AM

#

Something needs to be at the top level, and the only thing that's special about that top level thing is that it needs to be able to match all the other things

deep nova Feb 12, 2023, 8:42 AM

#

I was hoping/suspecting as much. I just wanted to check to make sure there wasn't anything particularly special or complicated about those expression categories in particular

#

Personally, I'd favor a top-level-expression rule, or simply reserver the term expression for that purpose

#

I love python, I really do, but it's internals are some of the least semantic code I've seen in my life

raven ridge Feb 12, 2023, 8:54 AM

#

deep nova Personally, I'd favor a `top-level-expression` rule, or simply reserver the term...

Isn't annotated_rhs a reasonable name for the stuff on the right side of an =?

deep nova Feb 12, 2023, 8:55 AM

#

Nope

#

Not even close

#

I had no idea what that was supposed to mean

raven ridge Feb 12, 2023, 8:55 AM

#

It's the right hand side argument of an annotated assignment

deep nova Feb 12, 2023, 8:55 AM

#

assignment_rhs or simply expression would have gotten the point across much better

#

annotated_rhs might have made more sense if the rule name was annotated_assignment. In starting in on trying to understand the rule (whose name was assignment) there was no clear indication that that particular part of the rule references annotated assignment specifically. In the context of assignment as a broader set of rules, using the term annotated_rhs was quite confusing. In fact, not use separate rules for the different types of assignment is confusing af

#

That particular set of rules looks more like something a machine would have spit out after having digested and optimized a much clearer, semantically focused equivalent

raven ridge Feb 12, 2023, 9:01 AM

#

deep nova `assignment_rhs` or simply `expression` would have gotten the point across much ...

expression is already used by the grammar to mean something else, though, so you'd need to rename that too

#

I suspect it's not as easy as you'd imagine to come up with good names for each of the intermediate productions in a grammar

deep nova Feb 12, 2023, 9:02 AM

#

Oh, I'm sure its a total pain

raven ridge Feb 12, 2023, 9:03 AM

#

In fact, the rule you posted above had an expression in it as well

#

The type annotation of an annotated assignment is matched by expression

deep nova Feb 12, 2023, 9:03 AM

#

That proves my point though

#

Why is annoated_rhs (itself an analogue for yield_expression | star_expressions) different from just expression? What about the former is different from the later, and what makes one the required rule for an annotation and but not an assignment right-hand?

raven ridge Feb 12, 2023, 9:05 AM

#

They match different stuff

deep nova Feb 12, 2023, 9:06 AM

#

https://tenor.com/view/you-dont-say-nicholas-cage-gif-13481115

Tenor

#

My point is that what they match and why that particular entry point to the expression-fission-chain is used there should be obvious.

raven ridge Feb 12, 2023, 9:09 AM

#

People would say that both the thing after the : and the thing after the = are expressions. Trying to come up with different words for "ok, this is an expression, but it's not an expression that can start with yield or *" for every one of these hundreds of productions isn't easy. Tack on to that the fact that this grammar evolved over time - it's reasonably likely that at one point expression was at the top level, and then new changes to the grammar required a new production above expression

deep nova Feb 12, 2023, 9:10 AM

#

Ehhhhhhhhhhhhhhhhhhhhhhhhh

#

I mean, yeah, sure

raven ridge Feb 12, 2023, 9:11 AM

#

yield was added in, what, 2.5?

deep nova Feb 12, 2023, 9:12 AM

#

But the grammar is the literal definition of the language in so far as such a thing might exist. Confusion is not an option

raven ridge Feb 12, 2023, 9:12 AM

#

And * unpacking for assignments was added even later than that, I think

deep nova Feb 12, 2023, 9:13 AM

#

Besides, we're smart people. We've all written oodles of essays and technical documents. Python is run by a steering committee and is peer reviewed out the wazoo I'm sure. I'm not sure that "its too hard to keep straight" is a reasonable rational for a confusing document

raven ridge Feb 12, 2023, 9:14 AM

#

I didn't say that it's too hard to keep straight, I said that the names are essentially arbitrary by virtue of the fact that grammars force you to choose way too many names, and that evolution over time accounts for cruft

raven ridge Feb 12, 2023, 9:15 AM

#

deep nova But the grammar is the *literal definition of the language* in so far as such a ...

And Python doesn't have a standard, just a reference implementation. The grammar isn't the definition of the language, "what CPython does" is.

deep nova Feb 12, 2023, 9:15 AM

#

That's why I said "in so far as such a thing exists"

#

Anyway, I've got no particular loathing towards the document. I do think it could use a sprucing up, though

raven ridge Feb 12, 2023, 9:17 AM

#

🤷‍♀️ go for it 🙂

deep nova Feb 12, 2023, 9:17 AM

#

I mean

#

In writing my own language's grammar, that's basically what I'm doing

#

So gimme a month or two, and I'll get back to you I suppose XD

raven ridge Feb 12, 2023, 9:18 AM

#

CPython is open source. If you see ways to improve on the grammar, send PRs. If the core devs agree that they're improvements, they'll get merged.

deep nova Feb 12, 2023, 9:18 AM

#

Hmmmmmmm

#

Rewriting the grammar for readability does sound like a fun time...

#

And it would be a good excuse to learn the beast inside and out

#

While you're here — one quick question

#

Why is the grammar shown here in the "docs" different (very different) from the one in the actual grammar file?

#

https://docs.python.org/3/reference/simple_stmts.html

Python documentation

7. Simple statements

A simple statement is comprised within a single logical line. Several simple statements may occur on a single line separated by semicolons. The syntax for simple statements is: Expression statement...

raven ridge Feb 12, 2023, 9:22 AM

#

Dunno

#

It might be simplified for readability, or it might be a place where docs didn't keep up with changes to implementation details

deep nova Feb 12, 2023, 9:23 AM

#

Or both?

raven ridge Feb 12, 2023, 9:24 AM

#

Possibly

deep nova Feb 12, 2023, 9:24 AM

#

Cool. I just wanted to know if there was a method to the madness

raven ridge Feb 12, 2023, 9:24 AM

#

You might check if the grammar in the docs matched the old, non PEG, grammar more closely

deep nova Feb 12, 2023, 9:25 AM

#

I love the peg grammar's syntax

#

Maybe not the actual content, but the syntax is quite graceful

warm breach Feb 12, 2023, 9:27 AM

#

grave jolt make it dereference `b` <:brainmon:439516188771483658>

can't really dereference it anymore though

#

every python variable access is already an implicit dereference

#

oh though maybe, * of ints dereferences the pyobject at that address?

#

super cursed

#

!e

from einspect.structs import PyObject
from einspect import impl

@impl(int)
def __iter__(self):
    return iter((PyObject.from_address(self).into_object(),))

x = id("hello")

print(x)
print(*x)

fallen slateBOT Feb 12, 2023, 9:32 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 140716785662064
002 | hello

deep nova Feb 12, 2023, 9:37 AM

#

WHAT!?

#

https://pypi.org/project/einspect/0.5.3/

PyPI

einspect

Extended Inspect - view and modify memory structs of runtime objects.

#

When did this happen!?

gray galleon Feb 12, 2023, 9:46 AM

#

deep nova In writing my own language's grammar, that's basically what I'm doing

btw you can use lark

#

!pypi lark

fallen slateBOT Feb 12, 2023, 9:47 AM

#

lark v1.1.5

a modern parsing library

graceful pelican Feb 12, 2023, 10:58 AM

#

hope this is the right channel to ask this: does python bind methods when they are accessed, or when the class is instantiated?

class Test:
  def test(self): pass

Test().test # does this bind `test` to `Test()`, or is it already bound?

dusk comet Feb 12, 2023, 11:01 AM

#

graceful pelican hope this is the right channel to ask this: does python bind methods when they a...

!e ```py
class Test:
def test(self): pass
print(Test.test)
print(Test().test)
print(Test().test)

fallen slateBOT Feb 12, 2023, 11:01 AM

#

@dusk comet :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | <function Test.test at 0x7f8330ea5c60>
002 | <bound method Test.test of <__main__.Test object at 0x7f8330c643d0>>
003 | <bound method Test.test of <__main__.Test object at 0x7f8330c64410>>

graceful pelican Feb 12, 2023, 11:05 AM

#

that doesn't answer my question, i guess what i'm trying to ask is does the LOAD_ATTR instruction bind the method to the receiver somehow, or does it load an already bound method?

#

i'm not familiar with cpython source code, so i thought maybe someone already familiar could answer that question or point me in the right direction

graceful pelican Feb 12, 2023, 11:09 AM

#

fallen slate <@575681145929203724> :white_check_mark: Your 3.11 eval job has completed with r...

this does show that the addresses are different, but it could also be cloning the bound method for some reason

prime estuary Feb 12, 2023, 11:12 AM

#

It's when they're accessed yes.

#

They're "just" regular functions in the class.

graceful pelican Feb 12, 2023, 11:13 AM

#

okay, thank you

prime estuary Feb 12, 2023, 11:14 AM

#

All through the magic of the descriptor protocol, same way @property works.

rose schooner Feb 12, 2023, 11:37 AM

#

graceful pelican that doesn't answer my question, i guess what i'm trying to ask is does the `LOA...

it's actually LOAD_METHOD

#

it's specialized for loading an attribute that is to be called

prime estuary Feb 12, 2023, 12:19 PM

#

That's only when it's immediately called, but it's not really too important - just an optimisation it should change semantics or be observable from your code.

deep nova Feb 13, 2023, 3:46 AM

#

Hey guys, I'm trying to understand a nuance in Python's Grammar's syntax

#

assignment[statement] ::=
    | targets=( (t=target_list '=' {t})+ ) expr=expression {{ parse_assignment(targets, expr) }}

#

From my own grammar, trying to emulate python's.

The issue stems from needing to repeat (target_list '=') + as so, but, only wanting to actually collect the target_list node, ignoring the consumed =

#

So, if I'm interpreting this right...

#

(t=target_list '=' {t})+ says "each time I collect a target_list followed by an =, bind it to the name t and collect that (as opposed, well, I don't really know)"

#

All the ts get collected via while loop and put into a collect, which is then bound to the name targets?

gray galleon Feb 13, 2023, 4:00 AM

#

deep nova From my own grammar, trying to emulate python's. The issue stems from needing t...

just use lark 😉

deep nova Feb 13, 2023, 4:00 AM

#

https://tenor.com/view/i-wont-do-it-stephen-abootman-south-park-s12e4-canada-on-strike-gif-21161390

Tenor

#

Rebuilding python's parser and parser generator is exactly what I want to do. It's absolutely exhilarating, and, it'll look great on my github

#

And I'll have superpowers when I'm done

gray galleon Feb 13, 2023, 4:02 AM

#

in seriousness
removing punctuations like that should be done at ast generation

#

the job of the parser
is to parse

deep nova Feb 13, 2023, 4:04 AM

#

_>

#

<_<

#

Yes, I agree. Hence the grammar

gray galleon Feb 13, 2023, 4:14 AM

#

deep nova ``` assignment[statement] ::= | targets=( (t=target_list '=' {t})+ ) expr=ex...

what grammar are you looking at
the python grammar spec doesn't look like this

deep nova Feb 13, 2023, 4:14 AM

#

This is a grammar of my own design, but I think I'm sticking pretty close to Python's

#

I'm basing it more off of this, which looks like a modifier version designed for readability: https://docs.python.org/3/reference/simple_stmts.html#grammar-token-python-grammar-augtarget

Python documentation

7. Simple statements

A simple statement is comprised within a single logical line. Several simple statements may occur on a single line separated by semicolons. The syntax for simple statements is: Expression statement...

#

But I'm also doing my best to be consistent with this where possible: https://github.com/python/cpython/blob/3.11/Grammar/python.gram

GitHub

cpython/python.gram at 3.11 · python/cpython

The Python programming language. Contribute to python/cpython development by creating an account on GitHub.

gray galleon Feb 13, 2023, 4:29 AM

#

actually i still don't know what your problem is

deep nova Feb 13, 2023, 4:48 AM

#

Python's grammar is basically a programming language unto itself

#

It has variables and function calls. Its really quite beautiful

#

some_rule ::=
    | a=sub_rule_1 sub_rule_2, b=sub_rule_3 {{ parse_some_rule(a, b) }}
``` This contains everything the parser generator needs to know to build the parser, including how to bind the results of calling some other rule to a name, and, how to pass the collected child nodes to the desired parsing function

#

But things are a bit weird when you're collecting one-or-more or zero-or-more of something

#

some_rule ::=
    | a=( some_other_rule * ) {{ parse_some_rule(a) }}
``` I *assume* this basically says "collect zero or more of `some_other_rule` and place them in a collection. Bind that collection to the name `a`, and pass that collection on". Alright, easy enough

#

What about this?

#

some_rule ::=
    | a=( ( some_other_rule '=' ) * ) {{ parse_some_rule(a) }}

#

Collect some rule zero or more times, as before. But you're also consuming a token. What, then, are the elements of a? Tuples of the form tuple[some_other_tule, Token]? Does the parser implicitly ignore the collected token? Maybe it will only place into the "result" of a repeated group items that are named ( (x=some_other_rule some_ignored_rule) * )

#

That's what I'm asking about. What I think it's doing is this: ( (x=some_other_rule some_ignored_rule {x}) *)

#

Basically, any parenthesized group can have a return statement {something} at the end. If so, the "result" or "contents" of the parenthesized group will be those items in the return statement.

#

I think.

deep nova Feb 13, 2023, 5:21 AM

#

I think I get it now. yield_expr and star_expressions are not necessarily on top of the expression fission chain, but they require special handling in the context of assignment (and maybe in a few other cases)

rose schooner Feb 13, 2023, 10:25 AM

#

deep nova Rebuilding python's parser and parser generator is exactly what I want to do. It...

i am also doing that for my programming language currently put on a hiatus

grave jolt Feb 13, 2023, 10:43 AM

#

deep nova It has variables and function calls. Its really quite beautiful

https://en.wikipedia.org/wiki/Greenspun's_tenth_rule

Any sufficiently complicated C or Fortran program contains an ad hoc, informally-specified, bug-ridden, slow implementation of half of Common Lisp.

#

🙂

deep nova Feb 13, 2023, 5:03 PM

#

grave jolt https://en.wikipedia.org/wiki/Greenspun%27s_tenth_rule > Any sufficiently compli...

I don't think I understand

grave jolt Feb 13, 2023, 5:04 PM

#

deep nova I don't think I understand

well, it sounds to me like Python's grammar got a lisp inside of it

deep nova Feb 13, 2023, 5:07 PM

#

Oh I see

#

Hehe

deep nova Feb 13, 2023, 5:23 PM

#

deep nova I think I get it now. `yield_expr` and `star_expressions` are not necessarily on...

about this

#

I can certainly see why a yield expression requires special handling. It has "limited usage" in that it can only appear as part of an assignment expression, or, in a yield statement (which is probably just a statement wrapper around a yield expression — but I havn't looked)

feral island Feb 13, 2023, 5:25 PM

#

deep nova I can certainly see why a yield expression requires special handling. It has "li...

that's not true

#

!e def f(): (yield x) + a((yield y))

fallen slateBOT Feb 13, 2023, 5:25 PM

#

@feral island :warning: Your 3.11 eval job has completed with return code 0.

[No output]

feral island Feb 13, 2023, 5:26 PM

#

this is perfectly legal

deep nova Feb 13, 2023, 5:26 PM

#

Damn XD

#

This grammar is confusing!

#

Here I though I'd figured it out

feral island Feb 13, 2023, 5:28 PM

#

I haven't looked at the formal grammar but a weirdness around yield is that it sometimes needs extra parentheses; e.g., f(await y) is allowed but f(yield y) is not

#

Might have something to do with yield without an argument being legal

deep nova Feb 13, 2023, 5:29 PM

#

My gut instinct is that that's an inconsistency. But I don't really know enough it so say anything

grave jolt Feb 13, 2023, 5:41 PM

#

!e

def f():
    g(yield 42069)

fallen slateBOT Feb 13, 2023, 5:41 PM

#

@grave jolt :x: Your 3.11 eval job has completed with return code 1.

001 |   File "<string>", line 2
002 |     g(yield 42069)
003 |       ^^^^^
004 | SyntaxError: invalid syntax

grave jolt Feb 13, 2023, 5:41 PM

#

yo wtf

feral island Feb 13, 2023, 5:41 PM

#

this is something I run into a lot because of https://github.com/quora/asynq 🙂

grave jolt Feb 13, 2023, 5:44 PM

#

feral island I haven't looked at the formal grammar but a weirdness around yield is that it s...

oh I think I kinda get it?
Like, if you have x = yield - 5, it's not clear whether it's ((yield) - 5) or (yield (-5))

feral island Feb 13, 2023, 5:45 PM

#

!e def f(): yield - 5 print(list(f()))

fallen slateBOT Feb 13, 2023, 5:45 PM

#

@feral island :white_check_mark: Your 3.11 eval job has completed with return code 0.

[-5]

grave jolt Feb 13, 2023, 5:45 PM

#

feral island !e ```def f(): yield - 5 print(list(f()))```

Yeah, in this form it's alright in order to not break code written before yield expressions were a thing

feral island Feb 13, 2023, 5:46 PM

#

!e def f(): print((yield - 5)) print(list(f()))

fallen slateBOT Feb 13, 2023, 5:46 PM

#

@feral island :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | None
002 | [-5]

deep nova Feb 13, 2023, 6:06 PM

#

I don't suppose there's a detail, thorough write up of python's grammar anywhere?

#

Not the just the grammar itself, but a description of how and why it works?

feral island Feb 13, 2023, 6:07 PM

#

deep nova I don't suppose there's a detail, thorough write up of python's grammar anywhere...

there is https://devguide.python.org/internals/parser/, but it's more about how the parser works, not so much why the grammar is the way it is

deep nova Feb 13, 2023, 6:08 PM

#

Awesome

#

This is a start, at least 🙂

#

Better question — is there any one here who understands the grammar quite deeply from whom I could buy or borrow an hour or two

vernal girder Feb 13, 2023, 7:58 PM

#

Gang

halcyon trail Feb 13, 2023, 7:59 PM

#

Might want to start with a question or two instead of asking right out for an hour or two...

deep nova Feb 13, 2023, 8:01 PM

#

I have zillions of questions, which I've been asking one by one

#

But I feel like its going to take far longer, for all parties, to do it that way

halcyon trail Feb 13, 2023, 9:22 PM

#

maybe, it's just a huge ask for a place like this to essentially ask someone for a commitment.
generally you just ask questions and whoever feels like answering, answers. If they enjoy the convo and want to keep answering, they'll do it. If not, they stop answering whenever.
Obviously you can still ask for an hour or two but IME in places like this, usually writing something like that results in crickets. Best of luck though.
btw I think you changed it to "buy or borrow" - now I have to ask, what rate are you offering 😛

deep nova Feb 14, 2023, 1:14 AM

#

Whatever the standard rate for such a thing is, I suppose?

#

Whatever the purveyor of the knowledge thought was fair and appropriate

raven ridge Feb 14, 2023, 1:17 AM

#

!rule 9 regardless

fallen slateBOT Feb 14, 2023, 1:17 AM

#

Rules

9. Do not offer or ask for paid work of any kind.

deep nova Feb 14, 2023, 1:29 AM

#

Fair enough :3

#

Sorry

deep nova Feb 14, 2023, 1:50 AM

#

With respect to specific questipons

#

    | a=('(' b=single_target ')' { b } | single_subscript_attribute_target) ':' b=expression c=['=' d=annotated_rhs { d }]

#

I don't think I understand this. Line 150 of the grammar

#

The assignment target can be either a single target in parentheses or a single_subscript_attribute_target. The latter does pretty much what you'd expect it to do. The former can either be another single_subscript_attribute_target or an identifier, or itself in parentheses

#

Looking closer, this is one of two rules for annotated assignment. It seems to allow for (a) : int = 1, ((a)) : int = 1, a.b.c : int = 1 and such. The seems very strange

raven ridge Feb 14, 2023, 2:10 AM

#

why?

#

given that (a, b) = (b, a) is allowed, there's no particular reason to disallow (a) = (b)

deep nova Feb 14, 2023, 2:19 AM

#

I guess I'm a bit confused about parentheses wrapped around assignment targets in the first place

#

I understand it in the case of a (b, c), d = 1, some_iterable, 2

raven ridge Feb 14, 2023, 2:20 AM

#

deep nova I guess I'm a bit confused about parentheses wrapped around assignment targets i...

C and C++ both allow it. So does Java. What languages don't?

deep nova Feb 14, 2023, 2:21 AM

#

Well I don't know

#

But I've never seen such a thing before, and I guess I just don't understand the usefulness, except in the case I mentioned earlier

raven ridge Feb 14, 2023, 2:26 AM

#

Zig allows it... Rust allows it... Nim allows it... I'm not sure that I've ever seen a language that doesn't.

deep nova Feb 14, 2023, 2:27 AM

#

That line seems to be doing one of two things: wrapping a single_target (a name, a subscripted-target of some kind, another single_target in parens) in parentheses; OR, directly supplying a subscripted-target w/o parens

#

Taken in conjunction with the rule above, which uses only a NAME token as the target, you're able to annotate either a name, a subscripted target, or either nested arbitrarily deeply within parens

#

Why its broken into two rules, I can't see

#

All told, as best I can tell, the two rules seem to specify the lhs-cases of single-target assignment with annotation

raven ridge Feb 14, 2023, 2:44 AM

#

deep nova Why its broken into two rules, I can't see

I'm not sure I understand the question you're asking here - what two rules are you talking about? single_target and single_subscript_attribute_target?

deep nova Feb 14, 2023, 2:45 AM

#

The first two cases here

#

# NOTE: annotated_rhs may start with 'yield'; yield_expr must start with 'yield'
assignment[stmt_ty]:
    | a=NAME ':' b=expression c=['=' d=annotated_rhs { d }] {
        CHECK_VERSION(
            stmt_ty,
            6,
            "Variable annotation syntax is",
            _PyAST_AnnAssign(CHECK(expr_ty, _PyPegen_set_expr_context(p, a, Store)), b, c, 1, EXTRA)
        ) }
    | a=('(' b=single_target ')' { b }
         | single_subscript_attribute_target) ':' b=expression c=['=' d=annotated_rhs { d }] {
        CHECK_VERSION(stmt_ty, 6, "Variable annotations syntax is", _PyAST_AnnAssign(a, b, c, 0, EXTRA)) }
    | a[asdl_expr_seq*]=(z=star_targets '=' { z })+ b=(yield_expr | star_expressions) !'=' tc=[TYPE_COMMENT] {
         _PyAST_Assign(a, b, NEW_TYPE_COMMENT(p, tc), EXTRA) }
    | a=single_target b=augassign ~ c=(yield_expr | star_expressions) {
         _PyAST_AugAssign(a, b->kind, c, EXTRA) }
    | invalid_assignment

raven ridge Feb 14, 2023, 2:46 AM

#

by "first two cases" you mean this is 1:

    | a=NAME ':' b=expression c=['=' d=annotated_rhs { d }] {
        CHECK_VERSION(
            stmt_ty,
            6,
            "Variable annotation syntax is",
            _PyAST_AnnAssign(CHECK(expr_ty, _PyPegen_set_expr_context(p, a, Store)), b, c, 1, EXTRA)
        ) }

and this is 2:

    | a=('(' b=single_target ')' { b }
         | single_subscript_attribute_target) ':' b=expression c=['=' d=annotated_rhs { d }] {
        CHECK_VERSION(stmt_ty, 6, "Variable annotations syntax is", _PyAST_AnnAssign(a, b, c, 0, EXTRA)) }

?

deep nova Feb 14, 2023, 2:46 AM

#

Yeah

raven ridge Feb 14, 2023, 2:50 AM

#

and you're asking why those aren't merged into one case with a more complicated pattern for a?

deep nova Feb 14, 2023, 2:52 AM

#

Well, I was more just remarking in passing

#

It looks to me like this rule could have been expressed as ```
| a=(NAME | '(' b=single_target ')' | single_subscript_attribute_target)
':' b=expression c=['=' d=annotated_rhs {d}]

raven ridge Feb 14, 2023, 2:53 AM

#

they seem to pass different values to the _PyAST_AnnAssign - do they result in different ASTs?

deep nova Feb 14, 2023, 2:55 AM

#

Oh, you're right. I have no idea what those arguments are for (I havn't gotten that far yet)

raven ridge Feb 14, 2023, 2:56 AM

#

!e ```py
import ast
print(ast.dump(ast.parse("x: int = y")))
print(ast.dump(ast.parse("(x): int = y")))

fallen slateBOT Feb 14, 2023, 2:56 AM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | Module(body=[AnnAssign(target=Name(id='x', ctx=Store()), annotation=Name(id='int', ctx=Load()), value=Name(id='y', ctx=Load()), simple=1)], type_ignores=[])
002 | Module(body=[AnnAssign(target=Name(id='x', ctx=Store()), annotation=Name(id='int', ctx=Load()), value=Name(id='y', ctx=Load()), simple=0)], type_ignores=[])

deep nova Feb 14, 2023, 2:57 AM

#

They seem to differ in their simple argument, which leads me to suspect different handling (likely on account of one target being just an identifier, the other being a compound object)

raven ridge Feb 14, 2023, 2:57 AM

#

!d ast.AnnAssign

fallen slateBOT Feb 14, 2023, 2:57 AM

#

ast.AnnAssign


class ast.AnnAssign(target, annotation, value, simple)```
An assignment with a type annotation. `target` is a single node and can be a [`Name`](https://docs.python.org/3/library/ast.html#ast.Name "ast.Name"), a [`Attribute`](https://docs.python.org/3/library/ast.html#ast.Attribute "ast.Attribute") or a [`Subscript`](https://docs.python.org/3/library/ast.html#ast.Subscript "ast.Subscript"). `annotation` is the annotation, such as a [`Constant`](https://docs.python.org/3/library/ast.html#ast.Constant "ast.Constant") or [`Name`](https://docs.python.org/3/library/ast.html#ast.Name "ast.Name") node. `value` is a single optional node. `simple` is a boolean integer set to True for a [`Name`](https://docs.python.org/3/library/ast.html#ast.Name "ast.Name") node in `target` that do not appear in between parenthesis and are hence pure names and not expressions.

raven ridge Feb 14, 2023, 2:57 AM

#

literally just a flag to tell you if it was or wasn't just a name.

deep nova Feb 14, 2023, 2:57 AM

#

Ahhhhhhh, there it is

deep nova Feb 14, 2023, 2:58 AM

#

raven ridge !d ast.AnnAssign

This, btw, is going to be super useful. Thanks for showing me

deep nova Feb 14, 2023, 3:56 AM

#

yield_expr[expr_ty]:
    | 'yield' 'from' a=expression { _PyAST_YieldFrom(a, EXTRA) }
    | 'yield' a=[star_expressions] { _PyAST_Yield(a, EXTRA) }

#

So, you can yield from a singular expression

#

Or, you can yield many expressions, comma separated, some of which may be starred?

#

Is there any reason for this? The behaviour I'd have expected from yield from 1, 2, 3, 4 would be to automatically convert the multiple values into a tuple, and yield them all

raven ridge Feb 14, 2023, 4:02 AM

#

yield from 1, 2, 3, 4 isn't valid syntax at all.

#

possibly because it's not obvious whether it should be parsed as (yield from 1), 2, 3, 4 or as yield from (1, 2, 3, 4)

deep nova Feb 14, 2023, 4:03 AM

#

Hmmmmm

#

That seems to be the consensus on the other server as well

gray galleon Feb 14, 2023, 4:35 AM

#

yield from expressions?

deep nova Feb 14, 2023, 5:02 AM

#

Wait, so

#

If yield from 1, 2, 3, 4 is ambiguous because it could mean yield from (1), 1, 2, 3 or yield from (1, 2, 3, 4)

#

Why is yield 1, 2, 3, 4 not ambiguous? It could mean the same thing

deep nova Feb 14, 2023, 5:36 AM

#

primary[expr_ty]:
    | a=primary '.' b=NAME { _PyAST_Attribute(a, b->v.Name.id, Load, EXTRA) }
    | a=primary b=genexp { _PyAST_Call(a, CHECK(asdl_expr_seq*, (asdl_expr_seq*)_PyPegen_singleton_seq(p, b)), NULL, EXTRA) }
    | a=primary '(' b=[arguments] ')' {
        _PyAST_Call(a,
                 (b) ? ((expr_ty) b)->v.Call.args : NULL,
                 (b) ? ((expr_ty) b)->v.Call.keywords : NULL,
                 EXTRA) }
    | a=primary '[' b=slices ']' { _PyAST_Subscript(a, b, Load, EXTRA) }
    | atom

#

What is this? | a=primary b=genexp { _PyAST_Call(a, CHECK(asdl_expr_seq*,

#

It looks like a.b.c[i for i in range(10)] or some such

gray galleon Feb 14, 2023, 6:13 AM

#

deep nova Why is `yield 1, 2, 3, 4` not ambiguous? It could mean the same thing

!e ```py
def a():
b = yield 1, 2, 3, 4

print(next(a()))

fallen slateBOT Feb 14, 2023, 6:13 AM

#

@gray galleon :white_check_mark: Your 3.11 eval job has completed with return code 0.

(1, 2, 3, 4)

gray galleon Feb 14, 2023, 6:15 AM

#

yield takes precedence over ,

#

not sure what happened with yield from

radiant garden Feb 14, 2023, 6:22 AM

#

makes a little more sense when thinking of it as await

raven ridge Feb 14, 2023, 6:29 AM

#

deep nova What is this? ` | a=primary b=genexp { _PyAST_Call(a, CHECK(asdl_expr_seq*, `

!e That might be for ```py
print(i for i in range(10))

fallen slateBOT Feb 14, 2023, 6:29 AM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

<generator object <genexpr> at 0x7f97157d81e0>

raven ridge Feb 14, 2023, 6:31 AM

#

As a special case, Python lets you pass a generator expression to a callable using one set of parentheses instead of 2, as long as it's the only argument

deep nova Feb 14, 2023, 6:34 AM

#

Ohhhhhh, yeah, that sounds right

#

Nice

rose schooner Feb 14, 2023, 10:24 AM

#

gray galleon `yield` takes precedence over `,`

or rather below ,

gray galleon Feb 14, 2023, 3:51 PM

#

why does python raise an exception to indicate the end of iteration?
why don’t iterators have a .done() or .running() method to check the state of the iterator and design for loops (of any other iteration structure) around it?
raising and catching exceptions are expensive but this is just a function call plus a boolean check

dusk comet Feb 14, 2023, 4:00 PM

#

gray galleon why does python raise an exception to indicate the end of iteration? why don’t i...

what should happen if iterator with .done() is exhausted but you are still calling next(it)?

gray galleon Feb 14, 2023, 4:01 PM

#

still raising an exception ig
just that normal iteration doesn’t have to involve exceptions

dusk comet Feb 14, 2023, 4:02 PM

#

There should be one-- and preferably only one --obvious way to do it.

gray galleon Feb 14, 2023, 4:03 PM

#

gray galleon still raising an exception ig just that normal iteration doesn’t have to involve...

like this is how you would implement for ```py
it = iter(iterable)
while it.running():
x = next(it)

code here

feral island Feb 14, 2023, 4:04 PM

#

at the C level I'm not sure the exception is actually expensive. e.g. listiter_next doesn't even set the StopIteration exception, it's implicit https://github.com/python/cpython/blob/main/Objects/listobject.c#L3235

fallen slateBOT Feb 14, 2023, 4:04 PM

#

Objects/listobject.c line 3235

listiter_next(_PyListIterObject *it)```

feral island Feb 14, 2023, 4:04 PM

#

so the caller only has to check that the return value is not NULL, then call PyErr_Occurred (a few pointer comparisons)

gray galleon Feb 14, 2023, 4:05 PM

#

gray galleon like this is how you would implement `for` ```py it = iter(iterable) while it.ru...

looks simpler than original for ```py
it = iter(iterable)
while True:
try:
x = next(it)
# code here
except StopIteration:
break

gray galleon Feb 14, 2023, 4:05 PM

#

feral island at the C level I'm not sure the exception is actually expensive. e.g. listiter_n...

but for the general case?

feral island Feb 14, 2023, 4:06 PM

#

the general case is that 99% of the time the iterable is consumed in C code

#

instead of calling next() directly

gray galleon Feb 14, 2023, 4:06 PM

#

ok

gray galleon Feb 14, 2023, 4:16 PM

#

dusk comet > There should be one-- and preferably only one --obvious way to do it.

i think after .done() is introduced catching StopIteration isn’t very obvious anymore and people gravitate towards .done()

feral island Feb 14, 2023, 4:18 PM

#

and now you have a new set of possible bugs where .done() is out of sync with whether __next__() throws

gray galleon Feb 14, 2023, 4:37 PM

#

https://tenor.com/view/crying-emoji-dies-gif-21956120

Tenor

warm breach Feb 14, 2023, 5:17 PM

#

gray galleon like this is how you would implement `for` ```py it = iter(iterable) while it.ru...

but why do you need to implement for anyways pithink

gray galleon Feb 14, 2023, 5:36 PM

#

uh no
that is the pseudocode that cpython has to implement

gray galleon Feb 14, 2023, 5:36 PM

#

gray galleon why does python raise an exception to indicate the end of iteration? why don’t i...

according to this

quick trellis Feb 14, 2023, 6:11 PM

#

hey sorry to bother you, anyone experimented with building a c extension on nixos?
i can't get it to find the header files and functions

sacred yew Feb 14, 2023, 8:13 PM

#

quick trellis hey sorry to bother you, anyone experimented with building a c extension on nixo...

#c-extensions

quick trellis Feb 14, 2023, 8:13 PM

#

inactive

grave jolt Feb 14, 2023, 8:14 PM

#

A channel won't magically become active if you don't post anything to it 🙂

#

C extensions are indeed not the hottest thing these days

quick trellis Feb 14, 2023, 8:15 PM

#

oh wow neat observation

#

i did

#

lol

cursive wharf Feb 14, 2023, 9:32 PM

#

@sacred yew thanks, TIL #c-extensions is a thing on this Discord server.

warm breach Feb 14, 2023, 9:40 PM

#

gray galleon uh no that is the pseudocode that cpython has to implement

but CPython is in C though

#

there's not really try, the iter function just returns NULL when it's exhausted

warm breach Feb 14, 2023, 9:49 PM

#

gray galleon like this is how you would implement `for` ```py it = iter(iterable) while it.ru...

but essentially this wouldn't actually be valid safe code because the iterator could be exhausted after you check running(), perhaps by another thread

#

so realistically you would need next() to return StopIteration instead of raising perhaps. But that would complicate most nested code. And since 3.11 trys are quite a bit faster than conditionals

elder blade Feb 14, 2023, 9:54 PM

#

gray galleon why does python raise an exception to indicate the end of iteration? why don’t i...

Python doesn't design their APIs around method name calls like this (all classes with a __iter__ and __next__ would now need these methods). Exceptions come quite naturally as a sentinel value here

spark magnet Feb 14, 2023, 9:55 PM

#

gray galleon why does python raise an exception to indicate the end of iteration? why don’t i...

are you sure raising exceptions are too expensive?

elder blade Feb 14, 2023, 9:58 PM

#

Yeah I would think an extra Python function call is much more expensive than just setting a variable and walking back a stack (calling the function would require working with the stack anyways)

deep nova Feb 14, 2023, 10:16 PM

#

Do any of y'all know how the parser's packrat-left-recursion hack works? I can't find any good non-academic articles on it

warm breach Feb 14, 2023, 10:30 PM

#

should python have a mutable string class?

#

!e

from sys import getsizeof


s = "hm🤔" * 100_000

print(getsizeof(s) // 1000, "KB")

ls = list(s)
items = sum(map(getsizeof, ls))
print((items + getsizeof(ls)) // 1000, "KB")

fallen slateBOT Feb 14, 2023, 10:31 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 1200 KB
002 | 20400 KB

warm breach Feb 14, 2023, 10:31 PM

#

this list of characters of a 1.2MB str is 20.4MB

pliant tusk Feb 14, 2023, 11:25 PM

#

warm breach should python have a mutable string class?

You can just decode to bytes then use a bytearray

warm breach Feb 14, 2023, 11:26 PM

#

that wouldn't handle utf-8 chars though 😔

feral island Feb 14, 2023, 11:29 PM

#

you can write your own library to provide a memory-efficient mutable Unicode string class

#

if it becomes very widely useful it can be added to the stdlib. Personally I haven't often seen a need for it

#

actually seems like you can use https://docs.python.org/3.10/library/array.html with the u code?

gray galleon Feb 14, 2023, 11:34 PM

#

warm breach should python have a mutable string class?

character list and bytearray

warm breach Feb 14, 2023, 11:36 PM

#

!e

from string import ascii_lowercase

s = "🐍🤔" * 1000

ls = list(s)

print(len(set(map(id, ls))))

fallen slateBOT Feb 14, 2023, 11:36 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

warm breach Feb 14, 2023, 11:37 PM

#

is there a reason we can't intern non-ascii strings

gray galleon Feb 14, 2023, 11:38 PM

#

it is not known at compile time
so it can’t intern

warm breach Feb 14, 2023, 11:48 PM

#

!e

s = eval("'abc123'")
s *= 1000

ls = [s[i:i+5] for i in range(995)]
print(len(set(ls)))
print(len(set(map(id, ls))))

fallen slateBOT Feb 14, 2023, 11:48 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 6
002 | 995

warm breach Feb 14, 2023, 11:48 PM

#

hm curious, I guess we don't dynamically intern in this situation

prime estuary Feb 14, 2023, 11:48 PM

#

This isn't interning, Python has singletons for ASCII characters similar to -1-255 ints. We could have more singletons, but that'd mean a massive array to put them in...
Interning is only done normally for strings that are valid identifiers.

#

And it's done during compile time

warm breach Feb 14, 2023, 11:49 PM

#

right yeah, but we could dynamically intern strings? as an optimization

prime estuary Feb 14, 2023, 11:49 PM

#

Well it'd be a pessimisation most of the time.

warm breach Feb 14, 2023, 11:51 PM

#

you'd be looking at 50 / 80 bytes base for an empty string, then the size of the string bytes for each duplication

prime estuary Feb 14, 2023, 11:51 PM

#

It's only useful if you can expect that code is going to be doing comparisons or dict lookups with the string, and that there's going to be duplicates elsewhere.

warm breach Feb 14, 2023, 11:51 PM

#

vs 4 bytes for a reference

prime estuary Feb 14, 2023, 11:52 PM

#

But it costs a dict lookup to do the interning. So a hash calculation on top of that.

#

Wasteful in random string ops.

#

Better to leave that to the application, since that inows the uses of the string. A html parser for instance probably would want to intern the attribute names, while something parsing user accounts doesn't need to intern usernames...

warm breach Feb 15, 2023, 12:01 AM

#

hm yeah fair

gray galleon Feb 15, 2023, 12:28 AM

#

warm breach so realistically you would need `next()` to return `StopIteration` instead of ra...

is there any case where bubbling up StopIteration are useful?

warm breach Feb 15, 2023, 12:29 AM

#

gray galleon is there any case where bubbling up `StopIteration` are useful?

it also carries the return value of a generator

gray galleon Feb 15, 2023, 12:30 AM

#

again when are generator return value useful

#

coroutines?

warm breach Feb 15, 2023, 12:32 AM

#

gray galleon is there any case where bubbling up `StopIteration` are useful?

it's also usually simpler than checking every return result of next (the alternative)

with suppress(StopIteration):
    seq.append(next(it))
    other |= next(it)

vs

temp = next(it)
if not isinstance(temp, StopIteration):
    seq.append(temp)
temp2 = next(it)
if not isinstance(temp, StopIteration):
    other |= temp2

gray galleon Feb 15, 2023, 12:32 AM

#

gray galleon again when are generator return value useful

and you can just return StopIteration(somevalue)?

warm breach Feb 15, 2023, 12:32 AM

#

warm breach it's also usually simpler than checking every return result of next (the alterna...

yeah this is what that would cause ^

#

looks worse imo

warm breach Feb 15, 2023, 12:35 AM

#

gray galleon coroutines?

essentially yeah

#

it makes it easy to make event loops that send and return values while yielding during some events

warm breach Feb 15, 2023, 12:37 AM

#

gray galleon and you can just `return StopIteration(somevalue)`?

that's a more general thing regarding exceptions I imagine

#

you can argue int() can return ValueError instead of raising as well

#

but python was just built around exceptions and making an one-off change like this would be odd and affect pretty much everyone

gray galleon Feb 15, 2023, 12:45 AM

#

!e ```py
print(StopIteration.mro())

fallen slateBOT Feb 15, 2023, 12:45 AM

#

@gray galleon :white_check_mark: Your 3.11 eval job has completed with return code 0.

[<class 'StopIteration'>, <class 'Exception'>, <class 'BaseException'>, <class 'object'>]

gray galleon Feb 15, 2023, 12:45 AM

#

why does it subclass Exception

sour thistle Feb 15, 2023, 12:46 AM

#

gray galleon why does it subclass `Exception`

because it is an exception?

#

!e ```py
next(iter([]))

fallen slateBOT Feb 15, 2023, 12:47 AM

#

@sour thistle :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 1, in <module>
003 | StopIteration

sour thistle Feb 15, 2023, 12:47 AM

#

it is also used when a generator returns something, but even then it is still an exception

raven ridge Feb 15, 2023, 1:39 AM

#

warm breach right yeah, but we could dynamically intern strings? as an optimization

!e Sure? ```py
import sys
sys.intern("🙂")

fallen slateBOT Feb 15, 2023, 1:39 AM

#

@raven ridge :warning: Your 3.11 eval job has completed with return code 0.

[No output]

warm breach Feb 15, 2023, 1:39 AM

#

warm breach !e ```py s = eval("'abc123'") s *= 1000 ls = [s[i:i+5] for i in range(995)] pri...

I mean like for something like this @raven ridge

#

why 995 new strings with only 6 unique ones

raven ridge Feb 15, 2023, 1:41 AM

#

interning would work by creating the new string, then looking up a canonical instance of that new string

gray galleon Feb 15, 2023, 1:43 AM

#

warm breach why 995 new strings with only 6 unique ones

how would that work
it would require hashing
which might be more expensive

raven ridge Feb 15, 2023, 1:43 AM

#

right. There are languages that intern every string, but yeah - that's how it'd be done.

#

interning strings is an optimization that trades off increased CPU usage for decreased memory usage

swift imp Feb 15, 2023, 2:04 AM

#

warm breach it's also usually simpler than checking every return result of next (the alterna...

What is this |= stuff

warm breach Feb 15, 2023, 2:04 AM

#

a = a | b -> a |= b

swift imp Feb 15, 2023, 2:05 AM

#

No I understand that but I don't get the context of it

raven ridge Feb 15, 2023, 2:08 AM

#

I think it's just an example of an operation where you call next twice

rich cradle Feb 15, 2023, 5:39 AM

#

i feel like i'm reading this wrong. https://docs.python.org/3/reference/lexical_analysis.html#identifiers

am i correct in my understanding that an identifier may start with any character in XID_Start (or an underscore), followed by any number of characters from XID_Continue?

sour thistle Feb 15, 2023, 6:05 AM

#

!e ```py
import unicodedata as u
char = u.lookup('SCRIPT CAPITAL P')
print(char)
exec(f'{char} = 123')
exec(f'{char}abc = 456')
print(eval(char))
exec(f'print({char}abc)')

fallen slateBOT Feb 15, 2023, 6:05 AM

#

@sour thistle :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | ℘
002 | 123
003 | 456

sour thistle Feb 15, 2023, 6:06 AM

#

looks like you got it right?.. though I hope for the day I see that character being used in real code base never to come
edit; not sure, the hiragana/katakana marker is not working despite being in Other_ID_Start

violet tendon Feb 15, 2023, 9:51 AM

#

if anyone has python interview questions, please do send them?

gray galleon Feb 15, 2023, 10:15 AM

#

this is not where to ask it

cursive wharf Feb 15, 2023, 4:14 PM

#

sour thistle looks like you got it right?.. though I hope for the day I see that character be...

That the Weierstrass elliptic function up there?

rich cradle Feb 15, 2023, 10:39 PM

#

sour thistle looks like you got it right?.. though I hope for the day I see that character be...

thanks! but uh, what's the hiragana marker?

sour thistle Feb 15, 2023, 10:48 PM

#

rich cradle thanks! but uh, what's the hiragana marker?

Other_ID_Start - explicit list of characters in PropList.txt to support backwards compatibility
https://www.unicode.org/Public/14.0.0/ucd/PropList.txt
309B..309C ; Other_ID_Start # Sk [2] KATAKANA-HIRAGANA VOICED SOUND MARK..KATAKANA-HIRAGANA SEMI-VOICED SOUND MARK

#

if you meant like what they are in the Japanese language, https://en.wikipedia.org/wiki/Dakuten_and_handakuten, but I'll assume that you just wanted to know the code points

rich cradle Feb 15, 2023, 10:51 PM

#

yeah, the codepoints. hm, that's interesting.

#

huh. the nfkc normalizations are also within that file (at least, whatever unicodedata.normalize("NFKC", ...) gives me).

#

oh, hmmmm

#

maybe that entire file isn't supported?

#

since that's part of XID_Continue according to these tables i'm looking at, but not XID_Start

#

and it's indeed valid as the second char in an ident

rich cradle Feb 16, 2023, 12:12 AM

#

does cpython have a parser testsuite somewhere?

fallen slateBOT Feb 16, 2023, 2:56 AM

#

:incoming_envelope: :ok_hand: applied mute to @north prawn until <t:1676516766:f> (10 minutes) (reason: duplicates rule: sent 4 duplicated messages in 10s).

The <@&831776746206265384> have been alerted for review.

warm breach Feb 16, 2023, 8:42 AM

#

@chilaxan#3116 do you know if there are places where python unconditionally assumes that some PyMethods exist on specific types?

#

not sure if making those pointers null after allocation is safe

sacred yew Feb 16, 2023, 9:14 AM

#

@pliant tusk your ping failed

pliant tusk Feb 16, 2023, 1:24 PM

#

warm breach not sure if making those pointers null after allocation is safe

I think mapping proxy probably does

deep nova Feb 16, 2023, 5:05 PM

#

I'm a bit confused about python's PEG parser

#

How exactly does backtracking/lookahead work?

#

In the case of lookahead, wouldn't one need to memoize the current parser state, capture a boolean representing whether or not the whatever is parsed properly, and then restore the original state?

#

As well, does python's parser use a streaming lexer, or does it lex the entire source and maintain a list of the tokens?

rose schooner Feb 16, 2023, 11:13 PM

#

deep nova In the case of lookahead, wouldn't one need to memoize the current parser state,...

i think it just looks ahead one token at a time

rose schooner Feb 16, 2023, 11:13 PM

#

deep nova As well, does python's parser use a streaming lexer, or does it lex the entire s...

seems like a streaming lexer

rose schooner Feb 16, 2023, 11:16 PM

#

rose schooner i think it just looks ahead one token at a time

so this makes it simple
https://github.com/python/cpython/blob/main/Parser/pegen.c#L333-L340
but yes they do store the current state, parse, and bring the state back again

fallen slateBOT Feb 16, 2023, 11:16 PM

#

Parser/pegen.c lines 333 to 340

int
_PyPegen_lookahead_with_int(int positive, Token *(func)(Parser *, int), Parser *p, int arg)
{
    int mark = p->mark;
    void *res = func(p, arg);
    p->mark = mark;
    return (res != NULL) == positive;
}```

rose schooner Feb 16, 2023, 11:17 PM

#

actually it's not a "state"

neat delta Feb 16, 2023, 11:19 PM

#

I ran into an issue with scientific notation a bit ago, and did some experimenting, finding out that 1eX, which is a float, does not equal 10**X (an int) for X > 22. why 22? Is it because 1e22 is slightly less than 2**74 (~73.1), and 1e23 is above (~76.4)? If so, why 74 and not 64?

#

!e

print(10**22, f'{1e22:f}')
print(10**23, f'{1e23:f}')

fallen slateBOT Feb 16, 2023, 11:29 PM

#

@neat delta :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 10000000000000000000000 10000000000000000000000.000000
002 | 100000000000000000000000 99999999999999991611392.000000

rose schooner Feb 16, 2023, 11:29 PM

#

fallen slate <@298248019646611468> :white_check_mark: Your 3.11 eval job has completed with r...

i guess that's your answer

neat delta Feb 16, 2023, 11:29 PM

#

the question is why it happens - that's just a example for the curious

raven ridge Feb 16, 2023, 11:30 PM

#

!e ```py
print(253)
print(253 + 1)
print(2**53 + 1.0)

fallen slateBOT Feb 16, 2023, 11:30 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 9007199254740992
002 | 9007199254740993
003 | 9007199254740992.0

feral island Feb 16, 2023, 11:30 PM

#

floats can only represent integer values exactly up to some limit

raven ridge Feb 16, 2023, 11:30 PM

#

namely, 2**53

#

the 53 is because 53 of the 64 bits of a float are used to represent the significand

charred pilot Feb 16, 2023, 11:30 PM

#

But 1e22 is much larger than 2**53. Shouldn't this error show up before 22?

raven ridge Feb 16, 2023, 11:31 PM

#

after 2**53, floats can represent every other integer.
after 2**54, they can represent every 4th integer.
after 2**55, they can represent every 8th integer.
etc.

rose schooner Feb 16, 2023, 11:32 PM

#

neat delta the question is *why* it happens - that's just a example for the curious

https://github.com/python/cpython/blob/main/Objects/floatobject.c#L501-L503
if the float has the same amount of bits as the integer, it goes to this line

>>> from math import frexp, modf
>>> frexp(1e22)[1] == len(bin(10**22))-2 # 1e22 passes this check
True
>>> frexp(1e23)[1] == len(bin(10**23))-2 # 1e23 passes this check
True
>>> _, intpart = modf(1e22)
>>> intpart, int(intpart)
(1e+22, 10000000000000000000000)
>>> _, intpart = modf(1e23)
>>> intpart, int(intpart) # here's the problem
(1e+23, 99999999999999991611392)

fallen slateBOT Feb 16, 2023, 11:32 PM

#

Objects/floatobject.c lines 501 to 503

/* v and w have the same number of bits before the radix
 * point.  Construct two ints that have the same comparison
 * outcome.```

rose schooner Feb 16, 2023, 11:33 PM

#

the _ variable here is also checked but it doesn't matter in this case since it'll just be 0.0

neat delta Feb 17, 2023, 12:19 AM

#

i found a very grokkable answer: the significand for 10**23, which is 5**23, is 54 bits long, and thus cannot fit into a 64-bit float. 5**22 is only 52 bits

raven ridge Feb 17, 2023, 12:20 AM

#

raven ridge after `2**53`, floats can represent every other integer. after `2**54`, they can...

If you're willing to accept this this

#

!e Then here's a tidy proof of why 23 is the cutoff point: ```py
import math

def modulus(power_of_two):
return 2**(max(power_of_two - 52, 0))

for power in range(16, 24):
val = 10power
prev_power_of_two = math.floor(math.log2(val))
difference = val - 2prev_power_of_two
every_nth = modulus(prev_power_of_two)
print(f"{val=:<25d} {prev_power_of_two=} {difference=:<23d} {every_nth=:<8d} {difference % every_nth=}")

fallen slateBOT Feb 17, 2023, 12:21 AM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | val=10000000000000000         prev_power_of_two=53 difference=992800745259008         every_nth=2        difference % every_nth=0
002 | val=100000000000000000        prev_power_of_two=56 difference=27942405962072064       every_nth=16       difference % every_nth=0
003 | val=1000000000000000000       prev_power_of_two=59 difference=423539247696576512      every_nth=128      difference % every_nth=0
004 | val=10000000000000000000      prev_power_of_two=63 difference=776627963145224192      every_nth=2048     difference % every_nth=0
005 | val=100000000000000000000     prev_power_of_two=66 difference=26213023705161793536    every_nth=16384    difference % every_nth=0
006 | val=1000000000000000000000    prev_power_of_two=69 difference=409704189641294348288   every_nth=131072   difference % every_nth=0
007 | val=10000000000000000000000   prev_power_of_two=73 difference=555267034260709572608   every_nth=2097152  difference % every_nth=0
008 | val=100000000000000000000000  prev_power_o
... (truncated - too long)

Full output: https://paste.pythondiscord.com/qifequfata.txt?noredirect

raven ridge Feb 17, 2023, 12:26 AM

#

it's the first power of 10 where ```py
(10x - 2math.floor(math.log2(10x))) % 2(max(math.floor(math.log2(10**x)) - 52, 0)) != 0

raven ridge Feb 17, 2023, 12:39 AM

#

neat delta i found a very grokkable answer: the significand for `10**23`, which is `5**23`,...

!e it's a bit trickier than that. That doesn't explain why ```py
print(100000000000000008388608 > 10**23)
print(100000000000000008388608 == 100000000000000008388608.0)

fallen slateBOT Feb 17, 2023, 12:40 AM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | True
002 | True

deep nova Feb 17, 2023, 12:45 AM

#

rose schooner so this makes it simple https://github.com/python/cpython/blob/main/Parser/pegen...

After reading through Guido's blog posts, it looks as though the whole source code is lexed prior to parsing

#

Which makes sense — I've looked at it every which way, and that's the only sensible option if you're backtracking

rich cradle Feb 17, 2023, 1:17 AM

#

a separate lexing step also makes handling semantic whitespace rather nice

#

the parser, which is generally a decent bit more complex, can just deal with tokens like newline, indent, and dedent, rather than messing around with that in the parser + all the rest of the syntax

gray galleon Feb 17, 2023, 4:52 AM

#

is parsing done in python or in C

feral island Feb 17, 2023, 5:52 AM

#

gray galleon is parsing done in python or in C

C

#

otherwise who would parse the Python code for the parser

raven ridge Feb 17, 2023, 6:01 AM

#

!otn a who parses the parsers

fallen slateBOT Feb 17, 2023, 6:01 AM

#

:ok_hand: Added who-parses-the-parsers to the names list.

signal river Feb 17, 2023, 6:13 AM

#

/imagine ja

#

hii

#

(╯°□°)╯︵ ┻━┻

gray galleon Feb 17, 2023, 8:03 AM

#

feral island otherwise who would parse the Python code for the parser

can't the parser code be precompiled?

#

if its python

dusk comet Feb 17, 2023, 8:18 AM

#

Who would compile it? To compile it you first should parse it, but you dont have parser

elder blade Feb 17, 2023, 8:46 AM

#

Well to be fair a parser is often able to parse itself

#

C language compilers are commonly written in C. You start by writing and using a different compiler, then once the initial compiler is done you can start compiling the compiler.

In the case of the very first C compiler, it was presumably written in Assembly or like Fortan

radiant garden Feb 17, 2023, 8:57 AM

#

bootstrapping compilers are most useful for, well, compilers

gray galleon Feb 17, 2023, 8:58 AM

#

dusk comet Who would compile it? To compile it you first should parse it, but you dont have...

you compile it with the previous version

radiant garden Feb 17, 2023, 8:58 AM

#

you can't self host an entire interpreter since you'd still need some external VM at the core of it

#

and if that VM happens to be your CPU, then, well, congratulations on the compiler

#

the benefits for interpreted languages like Python aren't quite as major, the best you can get is the self-hosted bytecode compiler which is still fine but you'll need to use C to run it

gray galleon Feb 17, 2023, 9:04 AM

#

radiant garden you can't self host an entire interpreter since you'd still need some external V...

the parser and bytecode compiler are the most likely parts to self-host
because they produce and consume python objects (ast nodes)

radiant garden Feb 17, 2023, 9:04 AM

#

Yes

gray galleon Feb 17, 2023, 9:07 AM

#

but they are not smh

rose schooner Feb 17, 2023, 9:07 AM

#

gray galleon if its python

yes but C is faster

gray galleon Feb 17, 2023, 9:09 AM

#

does that matter much for parsing and compiling?

rose schooner Feb 17, 2023, 9:10 AM

#

gray galleon does that matter much for parsing and compiling?

yep

gray galleon Feb 17, 2023, 9:11 AM

#

using python: slow
using C: have to deal with desugared API

rose schooner Feb 17, 2023, 10:08 AM

#

gray galleon using python: slow using C: have to deal with desugared API

too late to change cpython
despite there being criticisms about C it's still usable and the general rule for programming is that "if it works, it works"

deep nova Feb 17, 2023, 4:09 PM

#

Is there a reason that assignment expressions are bound more loosely than everything else? I think I'd like to try putting them at the bottom of the expression chain instead of at the top

#

You end up with situations like these...

#

if not (something := some_expression):
  ...
```or```py
if something := some_expression and something_else:
    ...

#

The second case is a bit ambiguous 😐

grave jolt Feb 17, 2023, 7:01 PM

#

||To make walrus expressions even more cluttered||

raven ridge Feb 17, 2023, 8:10 PM

#

deep nova Is there a reason that assignment expressions are bound more loosely than everyt...

Where do other languages place assignment expressions in the operator precedence hierarchy? C and C++ put it at nearly the very bottom as well (lower than or equal to everything but the comma operator)

#

It's the lowest in Java and C#

deep nova Feb 17, 2023, 8:16 PM

#

Hehe

#

Alrighty then

rich cradle Feb 17, 2023, 9:57 PM

#

does cpython have a parser test suite somewhere? i would like to make sure i'm doing this right, and don't trust the few tests i'm coming up with.

feral island Feb 17, 2023, 10:05 PM

#

rich cradle does cpython have a parser test suite somewhere? i would like to make sure i'm d...

it's scattered in Lib/test I think, e.g test_syntax.py

rich cradle Feb 17, 2023, 10:07 PM

#

oh wow, that's really scatted. there's some badsyntax_*.py in there too. but thanks, that's a helpful start.

#

seems like this isn't exactly in a format where i can easily throw it into another parser, but it's plenty helpful nonetheless

#

the reference mentions this. what does this really mean?

Indentation is rejected as inconsistent if a source file mixes tabs and spaces in a way that makes the meaning dependent on the worth of a tab in spaces; a TabError is raised in that case.
https://docs.python.org/3/reference/lexical_analysis.html#indentation

grave jolt Feb 17, 2023, 10:29 PM

#

rich cradle the reference mentions this. what does this really mean? > Indentation is reject...

like ```py
if foo():
if bar():
fizz()
<tab>buzz()

#

The meaning of this would change depending on whether <tab> is 4 characters or 8

rich cradle Feb 17, 2023, 10:30 PM

#

ugh, right, thanks.

grave jolt Feb 17, 2023, 10:31 PM

#

tl;dr tab bad

rose schooner Feb 17, 2023, 10:31 PM

#

grave jolt The meaning of this would change depending on whether `<tab>` is 4 characters or...

it's by default 8

grave jolt Feb 17, 2023, 10:31 PM

#

??

#

no there's no default

#

what kind of default? who sets it?

quick snow Feb 17, 2023, 10:32 PM

#

It's even stricter than it claims:

if foo():
<tab>    bar()
    <tab>bat()

Would be unambiguous, but is rejected

rose schooner Feb 17, 2023, 10:32 PM

#

grave jolt what kind of default? who sets it?

https://github.com/python/cpython/blob/main/Parser/tokenizer.c#L74
https://github.com/python/cpython/blob/main/Parser/tokenizer.c#L36-L37

fallen slateBOT Feb 17, 2023, 10:32 PM

#

Parser/tokenizer.c line 74

tok->tabsize = TABSIZE;```
`Parser/tokenizer.c` lines 36 to 37
```c
/* Don't ever change this -- it would break the portability of Python code */
#define TABSIZE 8```

grave jolt Feb 17, 2023, 10:32 PM

#

o

quick snow Feb 17, 2023, 10:33 PM

#

o_O why?

rose schooner Feb 17, 2023, 10:33 PM

#

actually by the looks of the comment it's required to be 8 (at least in CPython)

grave jolt Feb 17, 2023, 10:33 PM

#

TIL

#

well that's cursed

rich cradle Feb 17, 2023, 10:33 PM

#

Tabs are replaced (from left to right) by one to eight spaces such that the total number of characters up to and including the replacement is a multiple of eight (this is intended to be the same rule as used by Unix). The total number of spaces preceding the first non-blank character then determines the line’s indentation. Indentation cannot be split over multiple physical lines using backslashes; the whitespace up to the first backslash determines the indentation.

#

the thing i linked earlier says this

rose schooner Feb 17, 2023, 10:35 PM

#

quick snow It's even stricter than it claims: ```py if foo(): <tab> bar() <tab>bat()...

there are some cases you can somehow bypass this ```py
def foo():
<tab> if bar():
<tab><tab><tab>baz()

#

well probably not technically "bypass"

#

‫but it's not consistent all the time

deep nova Feb 18, 2023, 3:45 AM

#

rich cradle > Tabs are replaced (from left to right) by one to eight spaces such that the to...

I've read this document over and over

#

I've tried to implement it a few times

#

And I will never understand this

#

Personally, I think the easier solution would be to restrict it to only tabs, and then enforce that an indentation may only be exactly one tab

#

Though, I've heard it should actually be possible to do the indentation matching right in the parser. Either way, it's a no win scenario

grave jolt Feb 18, 2023, 3:54 AM

#

I would restrict to only spaces

#

tab = poop

raven ridge Feb 18, 2023, 4:02 AM

#

deep nova Personally, I think the easier solution would be to restrict it to only tabs, an...

the Make language requires tabs, and that absolutely confuses the hell out of people.

#

for one notable issue with that, it makes it very hard to copy-paste code off the internet

#

https://stackoverflow.com/questions/16931770/makefile4-missing-separator-stop
https://stackoverflow.com/questions/920413/make-error-missing-separator
https://stackoverflow.com/questions/14109724/makefile-missing-separator
https://stackoverflow.com/questions/23927212/makefile2-missing-separator-stop
etc...

#

granted the arcane error message Make gives doesn't help, but if you make it impossible for people to copy-paste code out of a browser into their editor, even if your language gives a very nice error message about how lines can't start with leading spaces, it'll make things harder for your users.

gray galleon Feb 18, 2023, 5:38 AM

#

tab bad space good

flat gazelle Feb 18, 2023, 7:52 AM

#

A tabulator is for making tables, a space is for spacing.

sacred yew Feb 18, 2023, 11:09 AM

#

space bad tab good

magic rune Feb 18, 2023, 11:27 AM

#

Does anyone know why doing i & 0x1 is slower than i % 2? I thought bitwise operations should be faster. ( Btw it is faster when i tried it with numpy)

rose schooner Feb 18, 2023, 11:34 AM

#

magic rune Does anyone know why doing `i & 0x1` is slower than `i % 2`? I thought bitwise o...

how small were the numbers that you tested it on?

rose schooner Feb 18, 2023, 11:35 AM

#

magic rune Does anyone know why doing `i & 0x1` is slower than `i % 2`? I thought bitwise o...

ok so i can't seem to reproduce this

rose schooner Feb 18, 2023, 11:36 AM

#

magic rune Does anyone know why doing `i & 0x1` is slower than `i % 2`? I thought bitwise o...

!ti ```py
[n&1 for n in range(10000)]

fallen slateBOT Feb 18, 2023, 11:36 AM

#

@rose schooner You've already got a job running - please wait for it to finish!

#

@rose schooner :white_check_mark: Your 3.11 timeit job has completed with return code 0.

500 loops, best of 5: 574 usec per loop

rose schooner Feb 18, 2023, 11:36 AM

#

!ti ```py
[n%2 for n in range(10000)]

fallen slateBOT Feb 18, 2023, 11:36 AM

#

@rose schooner :white_check_mark: Your 3.11 timeit job has completed with return code 0.

500 loops, best of 5: 628 usec per loop

rose schooner Feb 18, 2023, 11:36 AM

#

@magic rune i don't see it

magic rune Feb 18, 2023, 11:39 AM

#

This is what i tried:

rose schooner Feb 18, 2023, 11:52 AM

#

magic rune This is what i tried:

what python version?

#

!ti ```py
for i in range(1_000): i & 0x1

fallen slateBOT Feb 18, 2023, 11:52 AM

#

@rose schooner :white_check_mark: Your 3.11 timeit job has completed with return code 0.

5000 loops, best of 5: 48.6 usec per loop

rose schooner Feb 18, 2023, 11:52 AM

#

!ti ```py
for i in range(1_000): i % 2

fallen slateBOT Feb 18, 2023, 11:52 AM

#

@rose schooner :white_check_mark: Your 3.11 timeit job has completed with return code 0.

5000 loops, best of 5: 54.6 usec per loop

rose schooner Feb 18, 2023, 11:53 AM

#

@magic rune still can't reproduce

#

3.11+ has improved in these areas i think

magic rune Feb 18, 2023, 11:57 AM

#

rose schooner 3.11+ has improved in these areas i think

Oh i see. I'm using 3.10. I thought maybe it had something to do with the fact that python ints aren't of fixed size compared to numpy's but it don't really know how they're implemented actually

rose schooner Feb 18, 2023, 12:09 PM

#

magic rune Oh i see. I'm using 3.10. I thought maybe it had something to do with the fact t...

i think they've added a code path to make it faster for ints with an absolute value less than 2**30 (default)

magic rune Feb 18, 2023, 12:18 PM

#

!ti ```py
for i in range(230, 230 + 1000): i % 2

fallen slateBOT Feb 18, 2023, 12:18 PM

#

@magic rune :white_check_mark: Your 3.10 timeit job has completed with return code 0.

5000 loops, best of 5: 85.6 usec per loop

magic rune Feb 18, 2023, 12:18 PM

#

!ti ```py
for i in range(230, 230 + 1000): i & 0x1

fallen slateBOT Feb 18, 2023, 12:18 PM

#

@magic rune :white_check_mark: Your 3.10 timeit job has completed with return code 0.

5000 loops, best of 5: 72.1 usec per loop

magic rune Feb 18, 2023, 12:19 PM

#

@rose schooner Yeah, the modulus operator seems to be a bit slower in 3.10. Thanks for the help!

unkempt rock Feb 19, 2023, 5:43 AM

#

guys what does def do

boreal umbra Feb 19, 2023, 12:53 PM

#

unkempt rock guys what does def do

It's the keyword for defining a function

warm breach Feb 19, 2023, 6:21 PM

#

is there a way to stop python from garbage collecting a ctypes.Structure instance

raven ridge Feb 19, 2023, 6:21 PM

#

keep a reference to it? 😛

warm breach Feb 19, 2023, 6:22 PM

#

https://docs.python.org/3/c-api/typeobj.html#quick-reference

#

so I'm allocating these pointers to PyMethods structs

#

how does it normally work in C pithink who keeps the reference to them

#

https://github.com/python/cpython/blob/main/Objects/longobject.c#L6247

fallen slateBOT Feb 19, 2023, 6:25 PM

#

Objects/longobject.c line 6247

static PyNumberMethods long_as_number = {```

warm breach Feb 19, 2023, 6:25 PM

#

oh they're static? hm

#

I suppose I could just PyMem_Malloc the size of the struct instead of making a ctypes.Structure instance

#

wait no that would never get freed

raven ridge Feb 19, 2023, 6:28 PM

#

they're static for static types, but they're dynamic for heap types. I'm not sure how that actually works for the heap types - I'm guessing that the type object itself holds pointers to them, and knows to free them when it is garbage collected

warm breach Feb 19, 2023, 6:31 PM

#

where are python heap types even defined in c

#

PyType_New..?

#

https://github.com/python/cpython/blob/3.11/Objects/typeobject.c#L2757

fallen slateBOT Feb 19, 2023, 6:34 PM

#

Objects/typeobject.c line 2757

PyHeapTypeObject *et = (PyHeapTypeObject *)type;```

warm breach Feb 19, 2023, 6:34 PM

#

PyHeapTypeObject apparently

raven ridge Feb 19, 2023, 6:42 PM

#

ah, this seems to be the answer: https://github.com/python/cpython/blob/bdc93b8a3563b4a3adb25fa902c0c879ccf427f6/Include/internal/pycore_object.h#L357-L360

fallen slateBOT Feb 19, 2023, 6:42 PM

#

Include/internal/pycore_object.h lines 357 to 360

// Access macro to the members which are floating "behind" the object
static inline PyMemberDef* _PyHeapType_GET_MEMBERS(PyHeapTypeObject *etype) {
    return (PyMemberDef*)((char*)etype + Py_TYPE(etype)->tp_basicsize);
}```

raven ridge Feb 19, 2023, 6:42 PM

#

the members of a heap type are stored on the heap after the type

warm breach Feb 19, 2023, 6:43 PM

#

hm

#

I guess heap types frees that itself?

#

I suppose I can make a WeakKeyDictionary with type keys and values of lists of PyMethod ctypes.Structure instances

#

so the PyMethods GC should come after the type...?

warm breach Feb 19, 2023, 8:42 PM

#

@pliant tusk btw what is even going on here with this lambda 👀 https://github.com/chilaxan/fishhook/blob/master/fishhook/fishhook.py#L89

fallen slateBOT Feb 19, 2023, 8:42 PM

#

fishhook/fishhook.py line 89

def getdict(cls, E=type('',(),{'__eq__':lambda s,o:o})()):```

warm breach Feb 19, 2023, 8:42 PM

#

that type thing gets the dict of a mapping proxy?

pliant tusk Feb 19, 2023, 8:42 PM

#

Exploits a bug in mapping proxies to get the wrapped mapping

#

*a bug that has been explicitly marked will not fix

pliant tusk Feb 19, 2023, 8:45 PM

#

warm breach that type thing gets the dict of a mapping proxy?

It's cause because the order of operations is proxy->wrapped.__eq__(E) which returns NotImplemented, then E.__eq__(proxy->wrapped) which returns whatever it is passed, in this case the wrapped mapping

warm breach Feb 19, 2023, 8:49 PM

#

interesting 👀

warm breach Feb 19, 2023, 9:36 PM

#

!e

import sys
from einspect.structs import PyTypeObject

v = vars(sys)["int_info"]
t = PyTypeObject(v)
print(t.tp_name)
print(t.tp_name.decode())

fallen slateBOT Feb 19, 2023, 9:36 PM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | b'\x14\xca\x9a;'
002 | ʚ;

warm breach Feb 19, 2023, 9:36 PM

#

uh

#

what is up with this

#

https://github.com/python/cpython/blob/main/Objects/longobject.c#L6328

fallen slateBOT Feb 19, 2023, 9:39 PM

#

Objects/longobject.c line 6328

static PyTypeObject Int_InfoType;```

warm breach Feb 19, 2023, 9:43 PM

#

oh nvm type(v) is <class 'sys.int_info'> here

warm breach Feb 20, 2023, 9:56 AM

#

@pliant tusk finally have object allocations working now, with your mro patch https://github.com/ionite34/einspect/blob/main/src/einspect/views/view_type.py#L114-L158

#

also spent an hour wondering where random segfaults coming from before realizing I didn't keep a reference to the new struct and it got GC'd https://github.com/ionite34/einspect/blob/main/src/einspect/views/view_type.py#L142

fallen slateBOT Feb 20, 2023, 9:58 AM

#

src/einspect/views/view_type.py line 142

PY_METHOD_STRUCTS.setdefault(object, []).append(base)```

pliant tusk Feb 20, 2023, 3:27 PM

#

warm breach <@274715613115711488> finally have object allocations working now, with your mro...

nice, I would recommend caching object for use inside of the custom __base__ descriptor to prevent a user from changing object in __builtins__ which would break your check

raven ridge Feb 20, 2023, 5:00 PM

#

pliant tusk nice, I would recommend caching `object` for use inside of the custom `__base__`...

!e If you want a way to get object without __builtins__, perhaps ```py
print((1).class.bases[-1])

fallen slateBOT Feb 20, 2023, 5:00 PM

#

@raven ridge :white_check_mark: Your 3.11 eval job has completed with return code 0.

<class 'object'>

pliant tusk Feb 20, 2023, 5:06 PM

#

raven ridge !e If you want a way to get `object` without `__builtins__`, perhaps ```py print...

Oh I was referring to just caching object its fine to retrieve it at import time imo

#

fishhook stores it as a default arg at import time

raven ridge Feb 20, 2023, 5:07 PM

#

pliant tusk Oh I was referring to just caching `object` its fine to retrieve it at import ti...

its fine to retrieve it at import time imo
What if the user modifies __builtins__ before importing fishhook?

pliant tusk Feb 20, 2023, 5:09 PM

#

i considered that an acceptable risk. I figure that libraries like fishhook and einspect are being used by users likely to modify things that are normally consistent, but that those changes would typically be facilitated by libs like fishhook or einspect (so any odd state would come after import)

raven ridge Feb 20, 2023, 5:11 PM

#

🤷‍♂️ Python does't really have "import time" as a concept. Everything that happens in a Python program happens "at import time". Your program kicks off when the interpreter imports your code as __main__, and interpreter finalization starts as soon as __main__ is done being imported

pliant tusk Feb 20, 2023, 5:13 PM

#

it does conceptually for imported libraries. things that happen in global state of the module I would argue could be considered import time execution, vs things that happen when a user uses the library which I would consider run/use time

#

for example, fishhook calls patch_object() at import time to patch the object type, vs the patches that happen passively to other types as the lib is used

raven ridge Feb 20, 2023, 5:14 PM

#

pliant tusk it does conceptually for imported libraries. things that happen in global state ...

sure, but both of those are "at import time" with respect to the imports of other modules

pliant tusk Feb 20, 2023, 5:15 PM

#

fair enough, I guess it is better thought of as order of execution

warm breach Feb 20, 2023, 5:47 PM

#

pliant tusk nice, I would recommend caching `object` for use inside of the custom `__base__`...

huh, I thought object would get captured as a constant

#

guess not, it's just a normal name lookup

pliant tusk Feb 20, 2023, 5:48 PM

#

Yea, only upper locals would get captured like that

warm breach Feb 20, 2023, 5:49 PM

#

pliant tusk Oh I was referring to just caching `object` its fine to retrieve it at import ti...

oh yeah I made mine not import time either, it runs on the first attempted allocation on object

#

https://github.com/ionite34/einspect/blob/main/src/einspect/views/view_type.py#L175-L176

fallen slateBOT Feb 20, 2023, 5:50 PM

#

src/einspect/views/view_type.py lines 175 to 176

if obj == object:
    _patch_object_base()```

rose schooner Feb 21, 2023, 10:20 AM

#

warm breach <https://github.com/ionite34/einspect/blob/main/src/einspect/views/view_type.py#...

why not is?

warm breach Feb 21, 2023, 10:21 AM

#

rose schooner why not `is`?

obj is a PyTypeObject there

#

__eq__ is true for other PyObjects at the same address or python objects at the same address

rose schooner Feb 21, 2023, 10:22 AM

#

warm breach `__eq__` is true for other PyObjects at the same address or python objects at th...

ok

warm breach Feb 21, 2023, 10:22 AM

#

might be a bit too implicit I dunno

#

but I was doing a bunch of obj.address == address(object) before

#

so I just added it to eq

spring musk Feb 21, 2023, 11:55 AM

#

hi>

rancid shadow Feb 21, 2023, 1:29 PM

#

In search of a sco person help me in India

little marlin Feb 21, 2023, 2:21 PM

#

Does anyone know why there is not more a expressive description for the error raised when trying to create a file with the same name as an existing directory (on Windows)? is there some limitation that prevents finding out that this is the issue or is it just deemed unimportant?

#

(it results in a PermissionError)

warm breach Feb 21, 2023, 3:36 PM

#

little marlin Does anyone know why there is not more a expressive description for the error ra...

I'm not sure windows tells you anything different when you either really lack permissions or the destination is a directory

little marlin Feb 21, 2023, 3:37 PM

#

Hmm, I might try and find the relevant source code later, but it's been ages since I've read/written any C

flat gazelle Feb 21, 2023, 3:37 PM

#

I am pretty sure python just builds the error message it gets from windows into an exception

warm breach Feb 21, 2023, 3:38 PM

#

same thing in C# as well

#

opening a directory throws UnauthorizedAccessException

little marlin Feb 21, 2023, 3:38 PM

#

ah, figures that it'd be a windows issue

warm breach Feb 21, 2023, 3:39 PM

#

we could technically make windows file io explicitly check for directories first

#

not sure if that would have other problems

little marlin Feb 21, 2023, 3:39 PM

#

well I feel like windows should at least know at the point where it's denying you permission to write to that address

#

maybe there's some security related reason to prevent enumeration?

warm breach Feb 21, 2023, 3:40 PM

#

well the reason is you just can't open a directory in read mode

#

and how it prevents you is not giving you permission

#

you can modify other metadata attributes of a directory

little marlin Feb 21, 2023, 3:41 PM

#

yeah but if I try to write a file to the same path as a directory it also just says no permission

feral island Feb 21, 2023, 3:41 PM

#

warm breach we could technically make windows file io explicitly check for directories first

that would be vulnerable to race conditions, no?

little marlin Feb 21, 2023, 3:42 PM

#

True

#

I guess it was a more reasonable assumption that it was a windows limitation something the people working on python didn't care to implement

#

I still wonder if there's an OS reason the windows error message isn't more expressive

warm breach Feb 21, 2023, 3:43 PM

#

feral island that would be vulnerable to race conditions, no?

yeah, though, it would only be a race condition that changes what type of 2 errors you get

#

which isn't too bad as things go I guess

little marlin Feb 21, 2023, 3:44 PM

#

since if you tried to open files with all sorts of names in a location you don't have access to you could enumerate the directory structure of the drive

warm breach Feb 21, 2023, 3:44 PM

#

it would be strange for python io opens to do anything else beyond actually opening the file though

warm breach Feb 21, 2023, 4:02 PM

#

feral island that would be vulnerable to race conditions, no?

https://github.com/python/cpython/blob/main/Modules/_io/fileio.c#L450-L453

fallen slateBOT Feb 21, 2023, 4:02 PM

#

Modules/_io/fileio.c lines 450 to 453

/* On Unix, open will succeed for directories.
   In Python, there should be no file objects referring to
   directories, so we need a check.  */
if (S_ISDIR(fdfstat.st_mode)) {```

warm breach Feb 21, 2023, 4:02 PM

#

we seem to do the same thing for unix (albeit after opening)?

feral island Feb 21, 2023, 4:02 PM

#

that doesn't do a new syscall does it?

#

or rather it does (it calls fstat on the fd a few lines up), but because it's called on the open fd, not a path, it's not vulnerable to race conditions where someone overwrites the path

warm breach Feb 21, 2023, 4:10 PM

#

hm

#

does unix guarantee a file can't be deleted when in use?

#

iirc removals are scheduled after all fds are closed but wasn't sure if that was a standard or overridable

feral island Feb 21, 2023, 4:11 PM

#

a file can be deleted but the fd remains valid

#

this is a common pitfall around disk usage: sometimes your disk shows up as full but the files you can see don't account for all the used disk space

wind helm Feb 21, 2023, 10:20 PM

#

Hello, hopefully this is the right channel.

I've a question for something I never dealt with before. I've written a module with a single function that might be useful in several projects.
I want to externalise from the project I'm working on for the reason said above, so not to maintain code in several parts.

Shall I necessarily need to go for a package?

I now I can import a module from a different folder via the sys, but that still implies to know my local path, with doesn't sound very elegant.
What's the best approach?

tribal dirge Feb 21, 2023, 10:34 PM

#

Hi

warm breach Feb 21, 2023, 11:42 PM

#

wind helm Hello, hopefully this is the right channel. I've a question for something I nev...

sure, sounds fine, define a pyproject.toml and you can pip install -e . it

wind helm Feb 22, 2023, 7:02 AM

#

@warm breach so no need to go for a full package process basically

wind helm Feb 22, 2023, 7:25 AM

#

Thanks. And when it comes to making updates, what will happen, I will just launch the `pip install --upgrade package'.

dusk comet Feb 22, 2023, 10:17 AM

#

raven ridge 🤷‍♂️ Python does't really have "import time" as a concept. Everything that happ...

It is not true if you embed interpreter in another app. You can initialize interpreter, do some stuff and continue to do other things. Interpreter still exists, but it is doing nothing and all imports are finished

lunar harbor Feb 23, 2023, 3:38 PM

#

dusk comet It is not true if you embed interpreter in another app. You can initialize inter...

what @raven ridge was trying to say is that import is code execution. there is no "import phase" in python that is distinct from execution.

gray galleon Feb 23, 2023, 4:44 PM

#

which is probably why you can import in function bodies

rose schooner Feb 23, 2023, 10:49 PM

#

dusk comet It is not true if you embed interpreter in another app. You can initialize inter...

wait what does this even mean

#

if there's python code running, the interpreter is doing things

flat gazelle Feb 23, 2023, 10:53 PM

#

well, you can init the python interpreter, then do some random operations without yielding control to python, then later actually use your interpreter

#

at which point there are sort of two distinct times, one is at interpreter setup time, another is at actually using the interpreter time

fallen slateBOT Feb 24, 2023, 12:09 AM

#

:incoming_envelope: :ok_hand: applied mute to @proud elk until <t:1677197979:f> (10 minutes) (reason: chars rule: sent 4216 characters in 5s).

The <@&831776746206265384> have been alerted for review.

molten elk Feb 24, 2023, 2:32 AM

#

lunar harbor what <@451976922361102357> was trying to say is that import is code execution. ...

I think there is an import phase separate from execution. After all, execution is only a side effect of import.

The import phase would be either the process of sys.modules.__getitem__ or the process of working through the sys.path_hook/sys.meta_paths mechanism. Neither of these are particularly interesting in general.

I've litigated this point before in trying to characterise the distinction between “compile-time” and “run-time” in Python. Some of the same arguments may apply.

In practice, is there not a common and meaningful distinction between the execution of module-level code at something akin to a “compile-time” and execution of everything else at “run-time.” This is a meaningful distinction in practice, despite the former not really being “compile-time” (since the Python compiler historically did only, like, three interesting things.)

raven ridge Feb 24, 2023, 3:12 AM

#

molten elk I think there is an import phase separate from execution. After all, execution i...

I think there is an import phase separate from execution.
There isn't, though. Like I said, when you run a Python script, the Python interpreter imports whatever module or file you tell it to, and literally as soon as that module or file finishes being imported, the interpreter starts doing its teardown and getting ready to exit.

molten elk Feb 24, 2023, 3:16 AM

#

raven ridge > I think there is an import phase separate from execution. There isn't, though....

Doesn't that presume the standard entry point?

There are other ways into PyEval_EvalFrameEx that do not require passing through the import mechanism.

For example, python -c goes through pymain_run_command which goes straight to the “very high-level embedding” PyRun_SimpleStringFlags which I don't think ever passes through the import machinery itself.

molten elk Feb 24, 2023, 3:18 AM

#

molten elk Doesn't that presume the standard entry point? There are other ways into `PyEva...

It's incorrect to say that importing and programme execution are one and the same.

After all, when we say “import,” we are generally referring to the mechanisms surrounding import, which perform execution only as a side-effect.

raven ridge Feb 24, 2023, 4:15 AM

#

molten elk It's incorrect to say that importing and programme execution are one and the sam...

Doesn't that presume the standard entry point?
There are other ways into PyEval_EvalFrameEx that do not require passing through the import mechanism.
Fair enough, and that's true for the example about embedding the interpreter into another program as well. But python foo.py and python -m foo, which are the overwhelmingly common ways to run Python code, spend 100% of their time underneath an import call.
It's incorrect to say that importing and programme execution are one and the same.
Which is exactly why I think it is correct to say that importing and program execution are one and the same, or at least so tightly coupled that it's not useful to distinguish between them.

#

After all, when we say “import,” we are generally referring to the mechanisms surrounding import, which perform execution only as a side-effect.
I think that's distinctly not what was being referred to in the comment that I replied to when I kicked this whole conversation off, also.

raven ridge Feb 24, 2023, 4:17 AM

#

pliant tusk Oh I was referring to just caching `object` its fine to retrieve it at import ti...

this one this

molten elk Feb 24, 2023, 4:27 AM

#

raven ridge > After all, when we say “import,” we are generally referring to the mechanisms ...

As I understood, the original comment was referring to early-binding something from builtins.

I suppose the implication of the original comment was that there was some separate “import-time” mechanism that occurred like a pre-runtime compilation step and, therefore, preëmpt other runtime changes.

But your comment was that the import mechanism is so closely tied to module execution, and this happens during normal execution, so there is no distinct interval of time during which only import activities occur.

raven ridge Feb 24, 2023, 4:28 AM

#

right - my point was that nothing stops someone from having messed with builtins before importing the code that wants to early bind something from builtins.

molten elk Feb 24, 2023, 4:29 AM

#

molten elk As I understood, the original comment was referring to early-binding something f...

In this sense, it's meaningful to deny the presence of an import time, since the majority of module execution, as you note, occurs nested under some hierarchy of PyImport_ImportModule* calls.

molten elk Feb 24, 2023, 4:34 AM

#

raven ridge right - my point was that nothing stops someone from having messed with builtins...

I get the thrill of writing these patching/introspection/interception libraries and the gimmick of doing this at runtime, but I don't quite see the advantage of not just writing some (much simpler) C code to patch things (which would probably be even easier if you also swap out the entry point.)

warm breach Feb 24, 2023, 9:07 AM

#

molten elk I get the thrill of writing these patching/introspection/interception libraries ...

Are C extensions really easier? Don't you have to compile them and then import it

#

instead of just interacting with python code normally that works with any interactive session like repl / jupyter

molten elk Feb 24, 2023, 9:47 AM

#

warm breach Are C extensions really easier? Don't you have to compile them and then import i...

You can distribute binary packages via PyPI, so the target machine doesn’t need a compiler tool chain.

You can avoid a lot of the contortions involved with, e.g., finding things, since (the current state of) the Python C-API exposes a lot of the symbols you want.

warm breach Feb 24, 2023, 9:50 AM

#

sure yeah it might be more performant but binaries can be annoying too, non stable ABI and struct attributes regularly change between python versions

#

but if you're just exploring or debugging I've found being able to access internal attributes from live python to be useful

molten elk Feb 24, 2023, 10:00 AM

#

I don’t know that it will be any faster to run, but it surely should be much easier to write!

molten elk Feb 24, 2023, 10:06 AM

#

molten elk I don’t know that it will be any faster to run, but it surely should be much eas...

The secured interpreter environments already don’t allow ctypes, cffi, pywin32, NumPy, &c. so only true pure-Python approaches (like bytecode or /proc/self) will likely work.

That minimizes the distinction between a pure Python library that uses ctypes for patching and one which just writes a C-extension module.

warm breach Feb 24, 2023, 10:10 AM

#

molten elk The secured interpreter environments already don’t allow ctypes, cffi, pywin32, ...

I agree they're not distinct security wise, yeah

grave jolt Feb 24, 2023, 10:31 AM

#

I assume you can do a lot of cursed stuff with just patching bytecode of code obejcts

warm breach Feb 24, 2023, 10:36 AM

#

grave jolt I assume you can do a lot of cursed stuff with just patching bytecode of code ob...

!e don't really even need to mess with bytecode 🥴

import gc

class Cursed:
    def __length_hint__(self):
        return 1
    
    def __iter__(self):
        for obj in gc.get_objects():
            if isinstance(obj, tuple):
                try:
                    0 in obj
                except SystemError:
                    yield obj
                    break
                
self_tuple = tuple(Cursed())
print(self_tuple)

fallen slateBOT Feb 24, 2023, 10:36 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

((...),)

grave jolt Feb 24, 2023, 10:42 AM

#

Oof

#

How

#

Wh

#

Actually, can you do something like this without using any imports, exec or eval?

warm breach Feb 24, 2023, 10:45 AM

#

!e well there's this without any imports

def getdict(cls, x=type('',(),{'__eq__':lambda s,o:o})()):
    return cls.__dict__ == x

getdict(list)["wtf"] = "???"

print([].wtf)

fallen slateBOT Feb 24, 2023, 10:45 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

???

warm breach Feb 24, 2023, 10:45 AM

#

writing to the type dict of immutable types

#

supposedly this is not a bug (discovered by chilaxan)

prime estuary Feb 24, 2023, 10:47 AM

#

It's a bug, but not something easy/performant to solve.

rose schooner Feb 24, 2023, 10:48 AM

#

warm breach !e well there's this without any imports ```py def getdict(cls, x=type('',(),{'_...

‫i would write a TIL about comparison mms and the .__bool__() mm but i still have a 1h cooldown

prime estuary Feb 24, 2023, 10:48 AM

#

Problem is that mappingproxy's eq method forwards the call onto the internal dict, thus exposing it. But to solve that you'd need to do a whole new equality method, test it, etc...

warm breach Feb 24, 2023, 10:50 AM

#

prime estuary Problem is that mappingproxy's eq method forwards the call onto the internal dic...

yeah it's been closed as won't fix after some failed attempts

#

https://github.com/python/cpython/issues/88004#issuecomment-1093910942

GitHub

There is a way to access an underlying mapping in MappingProxyType ...

BPO 43838 Nosy @gvanrossum, @rhettinger, @ncoghlan, @serhiy-storchaka, @brandtbucher, @domdfcoding PRs #27300 Note: these values reflect the state of the issue at the time it was migrated and might...

rose schooner Feb 24, 2023, 10:51 AM

#

warm breach https://github.com/python/cpython/issues/88004#issuecomment-1093910942

seems to have included all of the core devs

prime estuary Feb 24, 2023, 10:52 AM

#

Yeah the problem is if both are proxy objects, it gets hairy and hard to solve without an expensive copy of either mapping,

warm breach Feb 24, 2023, 10:52 AM

#

also mapping proxy requires GC due to potential recursive references

#

so regardless you can expose the linkage with get_referrers

prime estuary Feb 24, 2023, 10:54 AM

#

Basically, if you to try hard enough you can get through protection, it's there to stop you accidentally doing the wrong thing.

warm breach Feb 24, 2023, 10:54 AM

#

pretty much the only thing the interpreter actually prevents you doing is modifying types marked PYTYPE_IMMUTABLE

#

which isn't something you can elect from python either, so things like frozen dataclass are easily mutable with object.__setattr__

rose schooner Feb 24, 2023, 10:57 AM

#

prime estuary Yeah the problem is if both are proxy objects, it gets hairy and hard to solve w...

what about checking for PyDict_CheckExact() or PyMappingProxy_Check() for both operands otherwise delegating to just passing the proxy itself to the comparison

prime estuary Feb 24, 2023, 10:59 AM

#

Well mapping-proxy works with any mapping, not just dicts. The problem is if both are proxies, you have to somehow implement the operator (including handling NotImplemented, the subclass exception, etc) without ever exposing either object to the other one.

grave jolt Feb 24, 2023, 10:59 AM

#

warm breach !e well there's this without any imports ```py def getdict(cls, x=type('',(),{'_...

ok I understand how this works, and this is 🅱️eyond cursed

warm breach Feb 24, 2023, 11:03 AM

#

!e optionally you can do the same thing with a Structure memory view and access mapping directly

from einspect import view

view(list.__dict__).mapping["wtf"] = "???"

print([].wtf)

fallen slateBOT Feb 24, 2023, 11:03 AM

#

@warm breach :white_check_mark: Your 3.11 eval job has completed with return code 0.

???

warm breach Feb 24, 2023, 11:04 AM

#

not sure if more or less cursed 😔

rose schooner Feb 24, 2023, 11:16 AM

#

prime estuary Well mapping-proxy works with any mapping, not just dicts. The problem is if bot...

i was thinking something like this but yeah the "any mapping" thing would be a problem ```c
static PyObject *
mappingproxy_richcompare(mappingproxyobject *v, PyObject w, int op)
{
/ v is already guaranteed a mappingproxy */
if (PyDict_CheckExact(w)) {
return PyObject_RichCompare(v->mapping, w, op);
}
if (PyObject_TypeCheck(w, &PyDictProxy_Type)) {
return PyObject_RichCompare(v->mapping, w->mapping, op);
}
return PyObject_RichCompare(v, w, op);
}

warm breach Feb 24, 2023, 11:26 AM

#

rose schooner i was thinking something like this but yeah the "any mapping" thing would be a p...

might not be too crazy just to reimplement comparisons manually with GetItem

#

though that's technically a behavior change

rose schooner Feb 24, 2023, 11:30 AM

#

warm breach might not be too crazy just to reimplement comparisons manually with GetItem

actually we might be able to do this for an "any mapping" implementation ```c
static PyObject *
mappingproxy_richcompare(mappingproxyobject *v, PyObject *w, int op)
{
if (PyDict_Check(w)) {
return PyDict_Type.tp_richcompare(v->mapping, w, op);
}
if (PyObject_TypeCheck(w, &PyDictProxy_Type)) {
return PyObject_RichCompare(v->mapping, w->mapping, op);
}
return PyObject_RichCompare(v, w, op);
}

#

just directly use dict_richcompare

warm breach Feb 24, 2023, 11:30 AM

#

doesn't that still expose mapping

rose schooner Feb 24, 2023, 11:31 AM

#

warm breach doesn't that still expose mapping

eh well this time it's like loading globals
it just directly does that from dict

warm breach Feb 24, 2023, 11:31 AM

#

also

#

isn't that last line an infinite recursion

#

it'll end up calling this slot

rose schooner Feb 24, 2023, 11:32 AM

#

actually yes

#

it should be Py_RETURN_NOTIMPLEMENTED;

rose schooner Feb 24, 2023, 11:32 AM

#

rose schooner eh well this time it's like loading globals it just directly does that from `dic...

!e ```py
class B(dict):
def eq(self, other):
print('here')
return False

print(dict.eq({1: 2}, B({1: 2})))

fallen slateBOT Feb 24, 2023, 11:32 AM

#

@rose schooner :white_check_mark: Your 3.11 eval job has completed with return code 0.

True

rose schooner Feb 24, 2023, 11:32 AM

#

like that

#

static PyObject *
mappingproxy_richcompare(mappingproxyobject *v, PyObject *w, int op)
{
    if (PyDict_Check(w)) {
        return PyDict_Type.tp_richcompare(v->mapping, w, op);
    }
    if (PyObject_TypeCheck(w, &PyDictProxy_Type)) {
        return PyObject_RichCompare(v->mapping, w->mapping, op);
    }
    Py_RETURN_NOTIMPLEMENTED;
}
``` also fixed

warm breach Feb 24, 2023, 11:34 AM

#

guess it would be fine if accepting that comparisons will be false for any dict subtypes or other custom mappings

#

not sure how many code usages in the wild depend on mapping proxies passing through custom __eq__ for innocent reasons

#

that being said

prime estuary Feb 24, 2023, 11:36 AM

#

It's documented and used with any mapping, so it has to support them.

warm breach Feb 24, 2023, 11:37 AM

#

!d types.MappingProxyType

fallen slateBOT Feb 24, 2023, 11:37 AM

#

types.MappingProxyType


class types.MappingProxyType(mapping)```
Read-only proxy of a mapping. It provides a dynamic view on the mapping’s entries, which means that when the mapping changes, the view reflects these changes.

New in version 3.3.

Changed in version 3.9: Updated to support the new union (`|`) operator from [**PEP 584**](https://peps.python.org/pep-0584/), which simply delegates to the underlying mapping.

warm breach Feb 24, 2023, 11:37 AM

#

first line does kind of say "read only" here 😔

#

though I guess that just means getitem and not eq?

#

but that's kind of a weird distinction

rose schooner Feb 24, 2023, 11:39 AM

#

rose schooner ```c static PyObject * mappingproxy_richcompare(mappingproxyobject *v, PyObject ...

implementations that don't subclass dict but need to compare have these 2 alternatives ```py
def eq(self: Self, other: MappingProxyType) -> bool:
return convert_to_dict_or_dict_subtype(self) == other

or

def eq(self: Self, other: MappingProxyType) -> bool:
for key, value in other.items():
# manually compare, early return False assumed here
...
return True

warm breach Feb 24, 2023, 11:41 AM

#

rose schooner implementations that don't subclass `dict` but need to compare have these 2 alte...

could just make a PyMapping_RichCompare

#

Mapping types already guarantee all the methods you need to compare

#

it'll just be slower since you need to call python functions

#

but otherwise just do everything dict does but with python calls

rose schooner Feb 24, 2023, 11:43 AM

#

warm breach could just make a PyMapping_RichCompare

i don't get how that would work

#

oh

#

general Mapping compare

rose schooner Feb 24, 2023, 11:44 AM

#

warm breach could just make a PyMapping_RichCompare

wouldn't that just be PyDict_Type.tp_richcompare

#

unless there are mappings that aren't dict (subclasses)

pliant tusk Feb 24, 2023, 3:30 PM

#

molten elk The secured interpreter environments already don’t allow ctypes, cffi, pywin32, ...

there are secured interpreter environments?

molten elk Feb 24, 2023, 3:32 PM

#

pliant tusk there are secured interpreter environments?

There are secured interpreter environments.

pliant tusk Feb 24, 2023, 3:33 PM

#

can you direct me to implementations

#

*ones that don't rely on sandboxing the interpreter, but only restrict python itself, since sandboxed interpreters can allow ctypes-esqu stuff

molten elk Feb 24, 2023, 3:42 PM

#

pliant tusk *ones that don't rely on sandboxing the interpreter, but only restrict python it...

The secured interpreters are not general purpose tools for public use.

They are unlikely to include parts of the standard library like ctypes or include major third parties libraries like pywin32 or numpy or cffi. They won't provide you with any way to install new packages. And, even if you could install packages, they'll probably require code-signing of shared objects and Python .py files. (They'll probably disable bytecode cache.) They'll probably disable -c and -m modes and may even force -S. They may have some additional hardening for sys.meta_path and sys.path_hooks. They might run code execution through anti-malware pattern matching. They'll probably use PEP-578 audit hooks to log everything the interpreter does.

pliant tusk Feb 24, 2023, 3:48 PM

#

I am able to get full process memory r/w with no imports, no bytecode cache, and easily obfuscatable code. the only audit hooks that are called are compile and exec, I don't think the above would be enough, (still works with -S)

molten elk Feb 24, 2023, 3:50 PM

#

pliant tusk I am able to get full process memory r/w with no imports, no bytecode cache, and...

I am able to get full process memory r/w with no imports, no bytecode cache, and easily obfuscatable code.

Sure, me, too…

with open('/proc/self/mem', 'rw') as f:
   pass

pliant tusk Feb 24, 2023, 3:50 PM

#

it does not use /proc (it works cross platform)

#

no open audit event

molten elk Feb 24, 2023, 3:50 PM

#

molten elk > I am able to get full process memory r/w with no imports, no bytecode cache, a...

But most of this stuff is hardening. The code signing is the core of it.

pliant tusk Feb 24, 2023, 3:51 PM

#

molten elk But most of this stuff is hardening. The code signing is the core of it.

you can do a lot with just the functions compiled into the binary, and python has lots of ways to control code flow that do not involve running your own assembly code.

#

for example, with full process mem r/w you could disable audit hooks, and probably most if not all of the security harness that you are proposing, and then just use python to do your post exploitation, which would bypass all hypothetical code signing

pliant tusk Feb 24, 2023, 3:54 PM

#

molten elk There are secured interpreter environments.

fyi, the reason i was asking here was because I wanted to see if real-world implementations fell into the pitfalls that I assume they will fall into

molten elk Feb 24, 2023, 3:54 PM

#

pliant tusk for example, with full process mem r/w you could disable audit hooks, and probab...

How do you get your payload into the running interpreter?

pliant tusk Feb 24, 2023, 3:55 PM

#

copy + paste ? I assumed that the purpose of a hypothetical hardened interpreter is running untrusted code

molten elk Feb 24, 2023, 3:56 PM

#

pliant tusk copy + paste ? I assumed that the purpose of a hypothetical hardened interpreter...

There's no interactive console.

pliant tusk Feb 24, 2023, 3:57 PM

#

if the hypothetical hardened interpreter is by some way evaling untrusted code then that does not matter

#

if it isnt, then why bother using a hardened interpreter

molten elk Feb 24, 2023, 3:58 PM

#

pliant tusk if the hypothetical hardened interpreter is by some way `eval`ing untrusted code...

The whole point is to prevent execution of code that has not already been signed.

pliant tusk Feb 24, 2023, 3:58 PM

#

whats the point then?

#

if it is closed-system -> closed-secure-interpreter -> closed-system then why bother using a secure interpreter

#

if all of your input is fully trusted then it doe not matter

pliant tusk Feb 24, 2023, 3:59 PM

#

molten elk The whole point is to prevent execution of code that has not already been signed...

I figured the purpose of a secure interpreter would be to run untrusted python code

#

(most devs running across an implementation would also likely assume that)

molten elk Feb 24, 2023, 4:02 PM

#

pliant tusk I figured the purpose of a secure interpreter would be to run untrusted python c...

No, securing environments that run untrusted code is usually handled differently, using lightweight VMs or containers or similar.

I am describing environments where you want to prevent the running of untrusted code.

molten elk Feb 24, 2023, 4:03 PM

#

molten elk No, securing environments that run untrusted code is usually handled differently...

These would be environments like BMCs or single application containers or hypervisor hosts.

pliant tusk Feb 24, 2023, 4:03 PM

#

molten elk These would be environments like BMCs or single application containers or hyperv...

yea i know that is the best way to run untrusted code

#

but i am having trouble wrapping my head around an environment where you want to fully prevent untrusted python code, but also do not have any method for running untrusted python code. It feels redundant

molten elk Feb 24, 2023, 4:05 PM

#

pliant tusk but i am having trouble wrapping my head around an environment where you want to...

I think a good example is a BMC, which often have fully-featured software stacks these days. REST APIs and all sorts of bells and whistles.

pliant tusk Feb 24, 2023, 4:06 PM

#

molten elk I think a good example is a BMC, which often have fully-featured software stacks...

my point is how would an attacker get to a point where they can try to run untrusted code if you do not explicitly make an endpoint for it

#

the only hypothetical system i can think of would involve the following weird cases: the ability to drop arbitrary files with arbitrary file endings -> the ability to perform arbitrary imports

molten elk Feb 24, 2023, 4:07 PM

#

pliant tusk my point is how would an attacker get to a point where they can try to run untru...

By some means, they have managed to get console access into the BMC. You want to then prevent them from bringing a payload with them to further their access (and these payloads are often written in Python.) You can, of course, not deploy Python, but then you can't write BMC tooling that uses Python.

pliant tusk Feb 24, 2023, 4:08 PM

#

would they not have much higher access with console access to the BMC (and if they don't then what is the console for)

#

And if your point is that Console access would not be able to run untrusted python code in this case, then why have it (the console) at all? Presumably, it would take input and then respond "refused to run untrusted code"

#

it would not be super useful, so if that is your threat model, just remove it

#

but if you need console access as the dev or trusted user, then how can you implement that sort of hardening without also hampering the console into unusability?

#

in that case, I would focus on securing console access, not what can be done in the console

molten elk Feb 24, 2023, 4:11 PM

#

pliant tusk in that case, I would focus on securing console access, not what can be done in ...

Well, there's a lot of people doing that part, too.

pliant tusk Feb 24, 2023, 4:12 PM

#

molten elk Well, there's a lot of people doing that part, too.

probably because it makes more sense then neutering the console

#

I'm still confused about what a real world use case for that sort of tooling would be

molten elk Feb 24, 2023, 4:13 PM

#

pliant tusk I'm still confused about what a real world use case for that sort of tooling wou...

The audit hooks PEPs (551 and 578) go into a little bit of detail behind the motivation.

molten elk Feb 24, 2023, 4:15 PM

#

molten elk The audit hooks PEPs (551 and 578) go into a little bit of detail behind the mot...

In truth, Guido asked me very similar questions when these PEPs were pending approval, but it turns out that there are environments and situations where this is useful as one among many other lines-of-defence.

molten elk Feb 24, 2023, 4:17 PM

#

molten elk In truth, Guido asked me very similar questions when these PEPs were pending app...

It took the better part of an hour over lunch to convince him, too. The BMC or hypervisor host use-cases are probably pretty niche, but I think the single-application container use-case is broadly relevant.

pliant tusk Feb 24, 2023, 4:19 PM

#

I am still having trouble envisioning any of those environments where an attack would reach this point without a glaring vulnerability (which would likely be required for some feature, which these mitigations would disable)

molten elk Feb 24, 2023, 4:20 PM

#

molten elk It took the better part of an hour over lunch to convince him, too. The BMC or h...

Niche, as in, regular people don't usually care about this, but not as in “there are not billions of dollars of computing machinery doing this.”

pliant tusk Feb 24, 2023, 4:20 PM

#

molten elk Niche, as in, regular people don't usually care about this, but not as in “there...

yea i understood that

molten elk Feb 24, 2023, 4:20 PM

#

molten elk Niche, as in, regular people don't usually care about this, but not as in “there...

In terms of actual scope, BMC is probably the widest. After all, think about all the BMCs out in the cloud.

pliant tusk Feb 24, 2023, 4:21 PM

#

molten elk In terms of actual scope, BMC is probably the widest. After all, think about all...

are you referring to a system with no OS or a non-virtual system?

#

since Bare Metal Computing can mean either

molten elk Feb 24, 2023, 4:22 PM

#

pliant tusk are you referring to a system with no OS or a non-virtual system?

Board Management Controllers.

pliant tusk Feb 24, 2023, 4:22 PM

#

ah google did not know that acronym at all

molten elk Feb 24, 2023, 4:22 PM

#

https://en.wikipedia.org/wiki/OpenBMC

OpenBMC

The OpenBMC project is a Linux Foundation collaborative open-source project whose goal is to produce an open source implementation of the Baseboard Management Controllers (BMC) Firmware Stack. OpenBMC is a Linux distribution for BMCs meant to work across heterogeneous systems that include enterprise, high-performance computing (HPC), telecommuni...

pliant tusk Feb 24, 2023, 4:24 PM

#

from what I am reading there, those are essentially embedded devices, and speed seems important. I cannot see a situation where you would want to use python

#

and if you did, you would likely use something like circuitpython which would still likely be too slow

molten elk Feb 24, 2023, 4:28 PM

#

pliant tusk from what I am reading there, those are essentially embedded devices, and speed ...

These things can easily have ≥1GiB of RAM.

#

They aren't small machines.

pliant tusk Feb 24, 2023, 4:28 PM

#

writing code in C or any other directly compilable language would still run vastly faster

#

also wouldnt clock cycle have more influence on speed then RAM?

flat gazelle Feb 24, 2023, 4:31 PM

#

Huh, how come just wasting CPU cycles with useless computation isn't a threat?

#

Or RAM IG

molten elk Feb 24, 2023, 4:35 PM

#

pliant tusk writing code in `C` or any other directly compilable language would still run va...

Hmm, I ran this with hyperfine, and I guess the C code is faster?

#incude <unistd.h>

int main(int argc, char *argv[]) {
  char *args[] = {"/usr/bin/systemctl", "reboot", "-i"};
  execv(args[0], args);
}

from subprocess import run
run('/usr/bin/systemctl reboot -i'.split())

#

I suppose the things you would want to do with Python in these environments may not benefit from the relative efficiency of C over Python.

pliant tusk Feb 24, 2023, 4:39 PM

#

that would make sense that the C code is faster, the python code is running thousands more lines of code and several dynamic allocations, the C code doesn't allocate any memory dynamically, but both are calling out to a C program systemctl

#

a better example would be like a hashing system or some algorithm implementation

molten elk Feb 24, 2023, 4:40 PM

#

pliant tusk a better example would be like a hashing system or some algorithm implementation

I imagine that the kind of code that someone might want to use a scripting language like Python for on a BMC or a hypervisor host might not often need one to find cycles in a doubly-linked list. Maybe dynamic programming might not even come up at all.

molten elk Feb 24, 2023, 4:41 PM

#

molten elk I imagine that the kind of code that someone might want to use a scripting langu...

I genuinely wouldn't know, because I've only ever done the work on the interpreter side of that. I've not actually written the code that runs within the interpreter.

pliant tusk Feb 24, 2023, 4:42 PM

#

fair enough

pliant tusk Feb 24, 2023, 4:42 PM

#

molten elk I imagine that the kind of code that someone might want to use a scripting langu...

I would imagine that python would not even be considered for this kind of thing until subinterpreters and multithreaded python works better

molten elk Feb 24, 2023, 4:44 PM

#

pliant tusk I would imagine that python would not even be considered for this kind of thing ...

What do you need subinterpreters for if you're, like, power-cycling or, like, calibrating optics?

pliant tusk Feb 24, 2023, 4:45 PM

#

i would assume that a system like that would need to do different things on different threads simultaneously

molten elk Feb 24, 2023, 4:45 PM

#

pliant tusk i would assume that a system like that would need to do different things on diff...

A BMC? BMCs are management controllers. Other than maybe data collection, I think they sit mostly idle.

#

I think the idea of using Python in that particular environment is the same reason to run Linux on those devices which is the same reason to stick so much RAM and CPU into those devices. It saves a lot of money in human effort, despite being computationally wasteful.

pliant tusk Feb 24, 2023, 4:47 PM

#

wouldn't that data collection require live monitoring of multiple systems?

#

imo, in reading about those systems, it seems like they would want to squeeze out efficiency

molten elk Feb 24, 2023, 4:52 PM

#

pliant tusk wouldn't that data collection require live monitoring of multiple systems?

So, that way I imagine it, like, you have a hypervisor host, which is the server that runs all the virtual machines for your clients. And that host is running on an actual physical machine running in some datacenter somewhere. And since you live in Palo Alto and the data center is in Nevada, you can't quite get up from your desk and walk over and power cycle the machine when it gets stuck, right? So you need an out-of-band controller connected to the physical hardware. So I would imagine, but I couldn't say for certain, that probably those devices are mostly idle.

pliant tusk Feb 24, 2023, 4:54 PM

#

But what you're describing still sounds like a system that you would want to secure externally

molten elk Feb 24, 2023, 4:56 PM

#

molten elk So, that way I imagine it, like, you have a hypervisor host, which is the server...

And the tasks you might want to do on those machines are monitoring, management, and remediation tasks. So the machine is probably going to do a lot of work that might require moderately ad hoc scripting. It might be really helpful to be able to run Python scripts to do those tasks. But you probably don't want those devices to be able to run arbitrary Python scripts.

pliant tusk Feb 24, 2023, 4:57 PM

#

Wouldn't ad hoc scripting be arbitrary python scripts?

molten elk Feb 24, 2023, 4:58 PM

#

pliant tusk Wouldn't ad hoc scripting be arbitrary python scripts?

Not necessarily, because these environments may be more heterogeneous in practice than you expect. So the axis across which these would be ad hoc is not per task but per device or per deployment cycle.

molten elk Feb 24, 2023, 5:00 PM

#

molten elk Not necessarily, because these environments may be more heterogeneous in practic...

Device of model XYZ with firmware 1.2.3 in data center ABC2 needs something slightly different.

pliant tusk Feb 24, 2023, 5:00 PM

#

It just feels unnecessary to add to a system that should 100% be secured externally

#

And if it's secured externally, you can just put normal python on there. It doesn't need to be some secured version.

sour thicket Feb 24, 2023, 7:57 PM

#

Hi, anyone know any cool projects to do? or any github repo with cool projects, something like that?

#

a beginner to intermediate level

warm breach Feb 25, 2023, 1:59 AM

#

@feral island (3.10) do I just do PyObject *iter = _PyEval_GetBuiltinId(&PyId_iter); here, it works the same way as the new _PyEval_GetBuiltin?

static PyObject *
bytearrayiter_reduce(bytesiterobject *it, PyObject *Py_UNUSED(ignored))
{
<<<<<<< HEAD
    _Py_IDENTIFIER(iter);
    if (it->it_seq != NULL) {
        return Py_BuildValue("N(O)n", _PyEval_GetBuiltinId(&PyId_iter),
                             it->it_seq, it->it_index);
    } else {
        return Py_BuildValue("N(())", _PyEval_GetBuiltinId(&PyId_iter));
=======
    PyObject *iter = _PyEval_GetBuiltin(&_Py_ID(iter));

    /* _PyEval_GetBuiltin can invoke arbitrary code,
     * call must be before access of iterator pointers.
     * see issue #101765 */

    if (it->it_seq != NULL) {
        return Py_BuildValue("N(O)n", iter, it->it_seq, it->it_index);
    } else {
        return Py_BuildValue("N(())", iter);
>>>>>>> 54dfa14c5a (gh-101765: Fix SystemError / segmentation fault in iter `__reduce__` when internal access of `builtins.__dict__` exhausts the iterator (#101769))
    }
}

feral island Feb 25, 2023, 3:32 AM

#

warm breach <@783088578363523104> (3.10) do I just do `PyObject *iter = _PyEval_GetBuiltinId...

yes I think so

warm breach Feb 25, 2023, 3:33 AM

#

yeah tests seem fine so far python/cpython#102229 python/cpython#102228

neon troutBOT Feb 25, 2023, 3:33 AM

#

GitHub

PROpen [cpython] #102229 [3.10] gh-101765: Fix SystemError / segmentation fault in iter __reduce__ when internal access of builtins.__dict__ exhausts the iterator (GH-101769)
PROpen [cpython] #102228 [3.11] gh-101765: Fix SystemError / segmentation fault in iter __reduce__ when internal access of builtins.__dict__ exhausts the iterator (GH-101769)

gray galleon Feb 26, 2023, 1:34 AM

#

is python string indexing an O(n) operation (n is the index)
UTF-8 is a variable length encoding so it should be the case
or is it converted into a fixed length form

#

or does it create a lookup table for each character in the string

raven ridge Feb 26, 2023, 1:36 AM

#

gray galleon is python string indexing an O(n) operation (n is the index) UTF-8 is a variable...

no, it's constant time

raven ridge Feb 26, 2023, 1:37 AM

#

gray galleon or does it create a lookup table for each character in the string

no, internally the string holds an array of codepoints, and a slice just extracts the codepoints at the given offset(s)

gray galleon Feb 26, 2023, 1:40 AM

#

so 'ab' is represented like this?```
61 00 00 00 62 00 00 00

raven ridge Feb 26, 2023, 1:42 AM

#

it uses the minimum integer size that will fit all of the codepoints for the array

#

so no, that's represented as 61 62

#

but if you add a codepoint with a value above 256, you'd get padding 0's on the 61 and 62

gray galleon Feb 26, 2023, 1:54 AM

#

so adding a character outside bmp in a long ascii string is an expensive operation bc each character needs to be coerced from 8 bit to 32 bit

raven ridge Feb 26, 2023, 2:01 AM

#

gray galleon so adding a character outside bmp in a long ascii string is an expensive operati...

hm - no, it's not expensive. String concatenation is O(N + M), where N and M are the lengths of the two strings being concatenated - regardless of whether the characters in the string are ASCII or outside the BMP

#

remember that Python strings are immutable, so every concatenation creates a new string, and needs to copy over every character from each of the original two strings.

lone sun Feb 26, 2023, 2:30 AM

#

raven ridge Feb 26, 2023, 2:35 AM

#

ooh, I didn't know there was a PEP for that. TIL.

subtle phoenix Feb 26, 2023, 3:04 AM

#

Unicode geek here saying that this is so cool. The calculations for which length to use for storage are quick, and the savings are huge. I bow to whoever thought that up.

gray galleon Feb 26, 2023, 9:00 AM

#

https://glot.io/snippets/gil7ukq17o

Untitled

import time

def time_code(f, times=1000000):
start = time.perf_counter()
for _ in range(times):
f()
end = time.perf_counter()
time_ = end - start
return time_

tup = ('f

#

why is my timing results so inconsistent
sometimes the dict indexing version uses more time, someone it uses less

#

it is more often that tuple indexing wins, but still

cyan raven Feb 26, 2023, 11:52 AM

#

I'm not sure this is a good channel to ask, but I'll try.
so I have forked/cloned the cpython repository and installed the python native development environment thru Visual Studio.
How can I get the new re-compiled python after the build(PcBuild/build.bat) because it's not changing - so I cant test it out?

#internals-and-peps

code here

or