#internals-and-peps | Python | Page 94

raven ridge Feb 17, 2021, 3:08 AM

#

So someone has passed me T("double", "1.0") and I need to a) parse it as a double, and b) do something with it.

bronze acorn Feb 17, 2021, 3:09 AM

#

Hmmm

#

Sorry, never tried to do that in c++

raven ridge Feb 17, 2021, 3:09 AM

#

it's a complete mess of switches and templates, heh

bronze acorn Feb 17, 2021, 3:09 AM

#

But there is c++ server that can help, i can dm if you want

#

Not my server

#

In case you think it is advertising

raven ridge Feb 17, 2021, 3:10 AM

#

I'm a professional C++ dev. It's not a matter of me not knowing the language well enough, it's a matter of the language being insufficiently expressive for what I want to do.

bronze acorn Feb 17, 2021, 3:10 AM

#

Yeah i mean, maybe there is a solution though

raven ridge Feb 17, 2021, 3:10 AM

#

(and I could cheat and use macros, if I thought it'd pass code review, heh)

visual shadow Feb 17, 2021, 3:11 AM

#

perfect. macros + esoteric obfuscation then, and we have a winner

raven ridge Feb 17, 2021, 3:11 AM

#

macros could do it, or type erasure

#

but if I go the type erasure route, I lose performance.

gleaming rover Feb 17, 2021, 3:11 AM

#

rewrite the whole project in JS

#

compilation is overrated

visual shadow Feb 17, 2021, 3:12 AM

#

btw, hows js in terms of speed vs python? similar i assume?

raven ridge Feb 17, 2021, 3:12 AM

#

substantially faster, actually

gleaming rover Feb 17, 2021, 3:12 AM

#

visual shadow btw, hows js in terms of speed vs python? similar i assume?

I believe JS is faster

#

assuming you don’t include extension modules

#

because of highly optimised JIT

visual shadow Feb 17, 2021, 3:13 AM

#

what makes js faster?

raven ridge Feb 17, 2021, 3:13 AM

#

but that said, JS is probably a good benchmark for how fast you could possibly get Python to be through just a JIT.

gleaming rover Feb 17, 2021, 3:13 AM

#

and AFAIK a big part of that is that JS has had to run in browsers

visual shadow Feb 17, 2021, 3:13 AM

#

i see, okay thats pretty interesting

raven ridge Feb 17, 2021, 3:13 AM

#

and had a ton of corporate investment into making it fast.

gleaming rover Feb 17, 2021, 3:13 AM

#

yeah

#

whereas in Python you can just write the compute intensive parts in another language

#

=> no incentive to throw $$/effort at the problem

visual shadow Feb 17, 2021, 3:14 AM

#

js is also dynamically typed and has objects for datatypes and all that jazz, yeah?

#

or does it have primitives

gleaming rover Feb 17, 2021, 3:15 AM

#

visual shadow or does it have primitives

it kind of does

#

but not really either

raven ridge Feb 17, 2021, 3:15 AM

#

it is dynamically typed - but also intrinsically single threaded, which removes a lot of complexity

bronze acorn Feb 17, 2021, 3:15 AM

#

Yeah js has no multithreading

raven ridge Feb 17, 2021, 3:16 AM

#

one of the reasons a JIT has trouble making assumptions about how Python code should behave is that another thread could be changing the type of an object.

gleaming rover Feb 17, 2021, 3:16 AM

#

visual shadow or does it have primitives

effectively no, I’d say

bronze acorn Feb 17, 2021, 3:16 AM

#

But asyncio is used alot.

visual shadow Feb 17, 2021, 3:16 AM

#

oh wow interesting, i didnt realise that

bronze acorn Feb 17, 2021, 3:17 AM

#

Has someone heard of "Jai" programming language

#

The devs said it will be better than c++ in game devs, but I suspect it

raven ridge Feb 17, 2021, 3:19 AM

#

better in what respect? It's pretty hard to beat C++ for performance, but pretty easy to beat it for ease of use or developer time or safety

bronze acorn Feb 17, 2021, 3:20 AM

#

raven ridge better in what respect? It's pretty hard to beat C++ for performance, but pretty...

Idk, ask the devs CF_Kekw

#

This is why i suspect it being better than c++

#

Even in game devs

raven ridge Feb 17, 2021, 3:22 AM

#

there are language that are as fast as C++. C and Rust are basically equally fast as C++, assuming they're programmed by a competent developer. And C and C++ leave some optimizations on the table that other languages could pick up - the aliasing rules make it impossible to optimize some things as well as should be possible

#

faster languages than C and C++ can exist.

bronze acorn Feb 17, 2021, 3:23 AM

#

raven ridge better in what respect? It's pretty hard to beat C++ for performance, but pretty...

What do you mean by "safety"?

raven ridge Feb 17, 2021, 3:23 AM

#

it's very easy to write buggy C++ code. And many - maybe even most - C++ bugs are actually security vulnerabilities.

bronze acorn Feb 17, 2021, 3:24 AM

#

Yeah i agree on that

#

Well, C-Lang is worse

#

If we are talking about security vulnerabilities

raven ridge Feb 17, 2021, 3:25 AM

#

I'm not sure that it is...

#

Maybe slightly, but not by a huge amount.

#

both of these are equally bad: ```cpp
int c[2] = {0, 1};
c[2];

std::vector<int> c{0, 1};
c[2];

visual shadow Feb 17, 2021, 3:31 AM

#

what does this do wrong? as an outsider looking in, without context, this looks perfectly fine

raven ridge Feb 17, 2021, 3:32 AM

#

accesses memory past the end of an allocated array/vector

visual shadow Feb 17, 2021, 3:32 AM

#

(and im assuming c[2] is saying make a array of length 2?)

raven ridge Feb 17, 2021, 3:32 AM

#

yep.

visual shadow Feb 17, 2021, 3:32 AM

#

oh. ohh

#

wait, no indexerrors?

feral cedar Feb 17, 2021, 3:32 AM

#

segfault ?

torpid bridge Feb 17, 2021, 3:32 AM

#

feral cedar segfault ?

If you're lucky

raven ridge Feb 17, 2021, 3:32 AM

#

possibly segv, more likely access arbitrary memory.

torpid bridge Feb 17, 2021, 3:32 AM

#

I can do the same in python

raven ridge Feb 17, 2021, 3:33 AM

#

and yeah, the segv is the better outcome.

visual shadow Feb 17, 2021, 3:33 AM

#

spooky

#

I took IndexError for granted, guess it's not as common as it could be

raven ridge Feb 17, 2021, 3:33 AM

#

visual shadow wait, no indexerrors?

vector has .at() which does raise an error for out of bounds values - but no one uses it.

feral cedar Feb 17, 2021, 3:33 AM

#

lul

visual shadow Feb 17, 2021, 3:33 AM

#

denied!

torpid bridge Feb 17, 2021, 3:34 AM

#

Or something. I'll figure it out in a moment

raven ridge Feb 17, 2021, 3:34 AM

#

😄

visual shadow Feb 17, 2021, 3:34 AM

#

wait this is amazing

torpid bridge Feb 17, 2021, 3:35 AM

#

!e ```py
import ctypes
a = b"1234"
print(ctypes.string_at(id(a)+32, 15))

raven ridge Feb 17, 2021, 3:35 AM

#

!e ```py
import ctypes as c

c.POINTER(c.c_void_p).from_address(id(int)+96)[0] = c.POINTER(c.c_void_p).from_address(id(int)+96)[1]
a = 5
b = 3
print(a+b)

fallen slateBOT Feb 17, 2021, 3:35 AM

#

@raven ridge :white_check_mark: Your eval job has completed with return code 0.

#

@torpid bridge :white_check_mark: Your eval job has completed with return code 0.

b'1234\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'

torpid bridge Feb 17, 2021, 3:35 AM

#

There

gleaming rover Feb 17, 2021, 3:35 AM

#

visual shadow I took `IndexError` for granted, guess it's not as common as it could be

one of the big things about C++ is that you don't pay for what you don't use, and bounds checking has a cost

torpid bridge Feb 17, 2021, 3:35 AM

#

strings are different than bytes

#

and the bytes type header is 32 bytes away from its pointer

visual shadow Feb 17, 2021, 3:35 AM

#

gleaming rover one of the big things about C++ is that you don't pay for what you don't use, an...

aha. okay, that makes sense

#

tradeoffs. tradeoffs everywhere.

torpid bridge Feb 17, 2021, 3:36 AM

#

Anyway, you can see how I read ... 11 bytes past the end of a python bytes object

#

I got all zeros cause I was lucky

raven ridge Feb 17, 2021, 3:36 AM

#

make that 5000 and you'll SEGV for sure.

torpid bridge Feb 17, 2021, 3:37 AM

#

Honestly not sure what I'm hitting

visual shadow Feb 17, 2021, 3:37 AM

#

does this get limited/boxed into the memory reserved by python, or is this literally free to access any part of the memory?

torpid bridge Feb 17, 2021, 3:37 AM

#

!e ```py
import ctypes
a = b"1234"
b = b"1586test"
print(ctypes.string_at(id(a)+32, 15))
print(id(a), id(b))

fallen slateBOT Feb 17, 2021, 3:37 AM

#

@torpid bridge :white_check_mark: Your eval job has completed with return code 0.

001 | b'1234\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00'
002 | 139800389733472 139800389734000

torpid bridge Feb 17, 2021, 3:37 AM

#

looks like.. I'd need to read a few hundred

#

!e ```py
import ctypes
a = b"1234"
b = b"1586test"
print(ctypes.string_at(id(a)+32, 1000))
print(id(a), id(b))

fallen slateBOT Feb 17, 2021, 3:37 AM

#

@torpid bridge :white_check_mark: Your eval job has completed with return code 0.

001 | b'1234\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x04\x00\x00\x00\x00\x00\x00\x00\x00\xe9\xd0}\xfa\x7f\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\xff\xff\xff\xff\xff\xff\xff\xffb\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x000\\N}\xfa\x7f\x00\x00\xf0\xa0N}\xfa\x7f\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\xc0:\xd2}\xfa\x7f\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\xf0\xc4N}\xfa\x7f\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\xc0:\xd2}\xfa\x7f\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\x00\xdaN}\xfa\x7f\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\x00\xe9\xd0}\xfa\x7f\x00\x00\n\x00\x00\x00\x00\x00\x00\x00\xff\xff\xff\xff\xff\xff\xff\xfft\x00|\x00|\x01\x83\x02S\x00\x00\x00\x00\x00\x00\x00\xb0\x1dN}\xfa\x7f\x00\x00\xc0\xe3\xd3}\xfa\
... (truncated - too long)

Full output: https://paste.pythondiscord.com/ludukaliqo.txt

gleaming rover Feb 17, 2021, 3:38 AM

#

visual shadow does this get limited/boxed into the memory reserved by python, or is this liter...

what is "this"

gusty wraith Feb 17, 2021, 3:38 AM

#

hey guys

raven ridge Feb 17, 2021, 3:38 AM

#

visual shadow does this get limited/boxed into the memory reserved by python, or is this liter...

it's virtual memory - it can access any memory segment that is mapped into the Python process's address space and is readable.

visual shadow Feb 17, 2021, 3:38 AM

#

oh, just the way of accessing arbitrary data using ctypes.string_at

torpid bridge Feb 17, 2021, 3:38 AM

#

fwiw b is in there, it's like, 500 characters after a

visual shadow Feb 17, 2021, 3:38 AM

#

okay, so it's effectively still boxed into python reserved memory*

torpid bridge Feb 17, 2021, 3:38 AM

#

I can just

gleaming rover Feb 17, 2021, 3:38 AM

#

visual shadow oh, just the way of accessing arbitrary data using `ctypes.string_at`

in general, a process can't look at any other process's memory without permission (there are ways, but you'd have to go through the OS)

#

AFAIK

torpid bridge Feb 17, 2021, 3:39 AM

#

virtual addressing

#

I think you straight up can but it's not likely

raven ridge Feb 17, 2021, 3:39 AM

#

gleaming rover in general, a process can't look at any other process's memory without permissio...

barring attacks on the hardware itself, like rowhammer.

#

or spectre.

gleaming rover Feb 17, 2021, 3:39 AM

#

ye

torpid bridge Feb 17, 2021, 3:40 AM

#

!e ```py
import ctypes
a = b"1234"
b = b"1586test"
data = ctypes.string_at(id(a)+32, 1000)
print(data[:15], data[600:800])
print(id(a), id(b))

fallen slateBOT Feb 17, 2021, 3:40 AM

#

@torpid bridge :white_check_mark: Your eval job has completed with return code 0.

001 | b'1234\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00' b'\x00\xe9\x01\xa9\x8c\x7f\x00\x00\x06\x00\x00\x00\x00\x00\x00\x00\xff\xff\xff\xff\xff\xff\xff\xffctypes\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x02\x00\x00\x00\x00\x00\x00\x00\x00\xe9\x01\xa9\x8c\x7f\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\xff\xff\xff\xff\xff\xff\xff\xff.\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\x00\xe9\x01\xa9\x8c\x7f\x00\x00\t\x00\x00\x00\x00\x00\x00\x00\xff\xff\xff\xff\xff\xff\xff\xffstring_at\x00\x00\x00\x00\x00\x00\x00\x01\x00\x00\x00\x00\x00\x00\x00\x00\xe9\x01\xa9\x8c\x7f\x00\x00\n\x00\x00\x00\x00\x00\x00\x00\xff\xff\xff\xff\xff\xff\xff\xff\x08\x01\x04\x01\x04\x01\x14\x01\x1a\x01\x00\x00\x00\x00\x00\x00\x10\x1e\x7f\xa8\x8c\x7f\x00\x00 \x1e\x7f\xa8\x8c\x7f\x00\x00'
002 | 140242099051616 140242099052144

torpid bridge Feb 17, 2021, 3:40 AM

#

Ah, you can see, I'm reading through globals()

#

There's a way to access this mutably, so I could make things get really weird really quickly

raven ridge Feb 17, 2021, 3:41 AM

#

!e This thing that I pasted without comment above replaces integer addition with integer subtraction.

import ctypes as c

c.POINTER(c.c_void_p).from_address(id(int)+96)[0] = c.POINTER(c.c_void_p).from_address(id(int)+96)[1]
a = 5
b = 3
print(a+b)

fallen slateBOT Feb 17, 2021, 3:41 AM

#

@raven ridge :white_check_mark: Your eval job has completed with return code 0.

feral cedar Feb 17, 2021, 3:42 AM

#

otypes

torpid bridge Feb 17, 2021, 3:42 AM

#

that's intentional

feral cedar Feb 17, 2021, 3:42 AM

#

oh?

#

oh

torpid bridge Feb 17, 2021, 3:42 AM

#

I'm trying to overwrite something, one moment

raven ridge Feb 17, 2021, 3:43 AM

#

have you heard of #bot-commands? 😛

feral cedar Feb 17, 2021, 3:43 AM

#

or #esoteric-python actually

torpid bridge Feb 17, 2021, 3:43 AM

#

!e ```py
import ctypes
a = b"1234"
b = b"1586test"
data = ctypes.string_at(id(a)+32, 1000)
ctypes.c_char.from_address(id(a)+32+data.find(b"ctypes")).value = ord("o")
print(data[:15], data[600:800])
print(id(a), id(b))
print(otypes)

#

Anyway

signal tide Feb 17, 2021, 3:44 AM

#

raven ridge !e This thing that I pasted without comment above replaces integer addition with...

could you use this to overwrite builtins as well?

torpid bridge Feb 17, 2021, 3:44 AM

#

Yes

#

I used it to dynamically rewrite python's compiled bytecode to add a "jump to end" magic call for a with statement

raven ridge Feb 17, 2021, 3:44 AM

#

signal tide could you use this to overwrite builtins as well?

that is overwriting the builtin. It's replacing int.__add__ with int.__sub__, basically

torpid bridge Feb 17, 2021, 3:44 AM

#

Basically you can edit arbitrary ram

#

let me just

#

!e ```py
import ctypes
a = b"1234"
ctypes.c_char.from_address(id(a)+32+1).value = ord("o")
print(a)

fallen slateBOT Feb 17, 2021, 3:45 AM

#

@torpid bridge :white_check_mark: Your eval job has completed with return code 0.

b'1o34'

torpid bridge Feb 17, 2021, 3:45 AM

#

bytes objects are not so immutable when you oops your bounds checks

feral cedar Feb 17, 2021, 3:45 AM

#

bruh

visual shadow Feb 17, 2021, 3:45 AM

#

lol

torpid bridge Feb 17, 2021, 3:45 AM

#

In C, [] is literally syntactic sugar

#

for a pointer dereference

raven ridge Feb 17, 2021, 3:45 AM

#

at the C level, nothing's immutable. You can replace 0 with 1

torpid bridge Feb 17, 2021, 3:45 AM

#

that is c[5] is equivalent to *c+5 or something, I don't remember the syntax right

feral cedar Feb 17, 2021, 3:45 AM

#

wtf

signal tide Feb 17, 2021, 3:46 AM

#

huh

raven ridge Feb 17, 2021, 3:46 AM

#

*(c+5) but yeah.

torpid bridge Feb 17, 2021, 3:46 AM

#

which basically means "the memory address of variable c plus the index times sizeof *c"

raven ridge Feb 17, 2021, 3:46 AM

#

times sizeof *c

torpid bridge Feb 17, 2021, 3:46 AM

#

so if you go past the end you write into something else

#

thanks

raven ridge Feb 17, 2021, 3:47 AM

#

yeah. In fact, esoteric C: because x[5] is equivalent to *(x + 5), and *(x + 5) is equivalent to *(5 + x), it's possible to do 5[x] instead of x[5].

torpid bridge Feb 17, 2021, 3:47 AM

#

I made the assumption when I was writing the code before that python would try and allocate a and b next to each other but it looks like there's some optimizations there so I can't rely on that

#

because if they were pressed together, reading, say, the 6th character of the 4 character string asdf pressed up against the four character string test would give you e

#

C often has this issue because C strings are null-terminated, not size-prefixed

#

which means that you read until you see \0

#

unless someone forgot to tack that on, then you read into whatever until you happen upon one

raven ridge Feb 17, 2021, 3:49 AM

#

torpid bridge C often has this issue because C strings are null-terminated, not size-prefixed

that's one of the biggest performance mistakes in the entire C programming language, frankly.

torpid bridge Feb 17, 2021, 3:49 AM

#

Yeah

visual shadow Feb 17, 2021, 3:49 AM

#

(id(a)+32 this part makes me guess it would be asdf and then another jump of 32 first?

torpid bridge Feb 17, 2021, 3:49 AM

#

Many libraries add size-prefixed strings but then they have to use apis expecting \n

#

not quite

#

because this is python, a is not just a string of bytes in memory

#

it includes the size too

#

I can find the structure somewhere in the repo but

feral cedar Feb 17, 2021, 3:50 AM

#

each character is 2 bytes?

torpid bridge Feb 17, 2021, 3:50 AM

#

the "header" bit of bytes that contains all that data is 32 bytes in size

raven ridge Feb 17, 2021, 3:50 AM

#

the id of an object, in CPython, is the address of that object in memory. 32 bytes after the start of that bytes object is a dynamically allocated array of bytes containing the actual contents.

torpid bridge Feb 17, 2021, 3:50 AM

#

so a bytes object is:
[-----32 bytes of ref count and size and stuff ------]stringgoeshereasrawdata

feral cedar Feb 17, 2021, 3:50 AM

#

oh

torpid bridge Feb 17, 2021, 3:51 AM

#

So I basically do id(a) to get the address, which points at the start, then +32 to jump past the data I don't need

#

!e ```py
import ctypes
a = b"1234"
data = ctypes.string_at(id(a), 32+4)
print(data)

fallen slateBOT Feb 17, 2021, 3:52 AM

#

@torpid bridge :white_check_mark: Your eval job has completed with return code 0.

b'\x03\x00\x00\x00\x00\x00\x00\x00\x00\xc9u\x83\xc1\x7f\x00\x00\x04\x00\x00\x00\x00\x00\x00\x00[\xf2\xb2\n{\x11\x05\xbb1234'

raven ridge Feb 17, 2021, 3:52 AM

#

https://github.com/python/cpython/blob/master/Include/cpython/bytesobject.h#L5-L8 is the link to it in the CPython code.

torpid bridge Feb 17, 2021, 3:52 AM

#

This is the full bytes object, I think

#

thanks

raven ridge Feb 17, 2021, 3:53 AM

#

the PyObject_VAR_HEAD is 24 bytes, Py_hash_t is 8 bytes, so ob_sval starts 32 bytes after the start of the object.

visual shadow Feb 17, 2021, 3:53 AM

#

yeah, so im wondering if when you had 2 variables, would b start at id(a), 32+4 + 32?

raven ridge Feb 17, 2021, 3:53 AM

#

not necessarily, but possibly.

torpid bridge Feb 17, 2021, 3:55 AM

#

Python does pool allocations, so its more likely. But that relies on no other ones being allocated and having space and etc

#

There's a common thing that shows this, actually

#

so close

#

Anyway, one of those constructions on a live interpreter will give you identical values

#

because one has been deallocated and leaves exactly enough space for another to show up

raven ridge Feb 17, 2021, 3:58 AM

#

!e ```py
a = b"1234"
print(id(a))
del a

b = b"1234"
print(id(b))

fallen slateBOT Feb 17, 2021, 3:58 AM

#

@raven ridge :white_check_mark: Your eval job has completed with return code 0.

001 | 140092334993840
002 | 140092334993840

torpid bridge Feb 17, 2021, 3:58 AM

#

Yeah, like that

#

Use after free here can be used to edit a string you don't own

#

for example, if a is a string you own and are allowed to access, but access after it has been deleted and someone else controls b

#

like forgetting to change the locks on a storage unit

#

A lot of the recent ios exploits function off this, basically, you try and get the ios kernel to confuse two different datastructures (usually IOKit ones because they're full of pointers to good stuff and easy to get lots of), and once that happens you can get the kernel to change things that you shouldn't be able to.

open acorn Feb 17, 2021, 4:07 AM

#

Yo any one willing to help me with simple script that i maybe missing

#

How do i visit list of urls from excel column , i have imported cell value with Openpyxl . So what should i type in brackets driver.get(urls from excel) to get it to visit that url

boreal umbra Feb 17, 2021, 4:16 AM

#

@open acorn this is strictly a discussion channel. See #❓｜how-to-get-help

wide vector Feb 17, 2021, 6:22 AM

#

Hey folks! I have a question regarding the is operator, and furthermore, the way Python stores references when assigning variables. I noticed after a bit of digging that the is operator will return False if the two string variables are equal but contain characters that are not alphanumeric or an underscore. I traced it back to cpython's codeobject.c file which seems to only account for those alphanumeric (plus _) characters:

#define NAME_CHARS \
"0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ_abcdefghijklmnopqrstuvwxyz"

Is this done intentionally? Why are non A-N+_ strings stored with unique references? May be a lower level thing, I'm not sure.

terse pivot Feb 17, 2021, 6:24 AM

#

do u mean that ```py
a = "@"
b = "@"
a is b

#

@wide vector

wide vector Feb 17, 2021, 6:25 AM

#

Interesting, so that works. let me do a little more digging

terse pivot Feb 17, 2021, 6:26 AM

#

a = "@a"
b = "@a"
a is b
``` wouldnt

wide vector Feb 17, 2021, 6:26 AM

#

I think it is when you use in combo with alnum

#

yeah

#

I'm wondering why that is

terse pivot Feb 17, 2021, 6:27 AM

#

@wide vector is is used for checking refrence point of objects in memory

#

and the memory refrence changes for that @ character

#

>>> id(a)
2178931147184
>>> id(b)
2178931106480

wide vector Feb 17, 2021, 6:29 AM

#

Yes, however why does that occur only when either adding a non-alnum to an alnum, or having a sequence of non-alnums?

#

when using alnums, it doesn't seem to produce that issue. You can change the lengths and contents of both strings but as long as they hold the same alnum text it will output a True value

#

>>> a = "ABCDEFGHIJKLMNOPQRSTUVWXYZ"
>>> b = "ABCDEFGHIJKLMNOPQRSTUVWXYZ"
>>> a is b
True

#

I'm sure it is some low level implementation that causes this, I'm just wondering. Is there a reason to this inconsistency?

#

Maybe a limitation of C itself

amber nexus Feb 17, 2021, 6:43 AM

#

I assume it would be the same deal as how Python caches integers from -5 to 257 (someone correct me if my numbers are wrong).

#

It would store string combinations in order to save time that would otherwise be needed to construct the string

wide vector Feb 17, 2021, 6:44 AM

#

interesting optimization there...

terse pivot Feb 17, 2021, 6:46 AM

#

b = "@@"
a = "@@"
``` also are not refrenced pointed to same object

wide vector Feb 17, 2021, 6:51 AM

#

So with that we can determine that Python only caches alphanumeric strings, as well as the references to the individual non-alphanumeric characters. However, non-AN characters in combination with other non_AN OR AN characters will NOT be cached and therefore result in unique references.

#

Wow, that's interesting

safe hedge Feb 17, 2021, 6:52 AM

#

Single-character strings in the Latin1 range (U+0000 - U+00FF) are shared in CPython. This saves memory and CPU time of per-character processing of strings containing ASCII characters and characters from Latin based alphabets. But the users of languages that use non-Latin based alphabets are not so lucky. Proposed PR adds a cache for characters in BMP (U+0100 - U+FFFF) which covers most alphabetic scripts.

https://bugs.python.org/issue31484

wide vector Feb 17, 2021, 6:53 AM

#

So it's a bug?

#

Oh wait, it's closed

amber nexus Feb 17, 2021, 6:53 AM

#

No, that's just a PR suggesting other things to add to the cache

wide vector Feb 17, 2021, 6:53 AM

#

Gotcha

amber nexus Feb 17, 2021, 6:54 AM

#

And I believe you can add whatever you want to the cache

safe hedge Feb 17, 2021, 6:54 AM

#

>>> "hell" + "o" is "hello"
<stdin>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
True

#

And this: http://guilload.com/python-string-interning/

wide vector Feb 17, 2021, 6:55 AM

#

>>> "hell" + "@" is "hell@"
<stdin>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
True

#

wow

#

string interning, new to me!

raven ridge Feb 17, 2021, 6:57 AM

#

The key point here is that, if you're trying to understand is and id(), don't use them on immutable objects.

safe hedge Feb 17, 2021, 6:57 AM

#

The relevant part as to the original query appears to be:

The function all_name_chars rules out strings that are not composed of ascii letters, digits or underscores, i.e. strings looking like identifiers:

#

Which refers to function:

#define NAME_CHARS \
    "0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ_abcdefghijklmnopqrstuvwxyz"

/* all_name_chars(s): true iff all chars in s are valid NAME_CHARS */

static int
all_name_chars(unsigned char *s)
{
    static char ok_name_char[256];
    static unsigned char *name_chars = (unsigned char *)NAME_CHARS;

    if (ok_name_char[*name_chars] == 0) {
        unsigned char *p;
        for (p = name_chars; *p; p++)
            ok_name_char[*p] = 1;
    }
    while (*s) {
        if (ok_name_char[*s++] == 0)
            return 0;
    }
    return 1;
}

raven ridge Feb 17, 2021, 6:59 AM

#

With mutable objects, every instance has its own state, so it's relevant whether two objects with the same value are or are not references to the same object. With immutable objects, two objects with the same value are completely interchangeable, and the interpreter can, at its discretion, optimize away objects by making them refer to previously created objects.

#

The answer to the "why only alphanumerics plus underscore" is that the CPython authors thought those strings would be the most likely to get reused, so they're worth caching. Other strings are deemed less likely to get reused, or more expensive to cache, so they don't cache them.

hexed aspen Feb 17, 2021, 7:02 AM

#

Are there any PEP guidelines (or other principles) on whether you should make your class subscriptable or not?

raven ridge Feb 17, 2021, 7:02 AM

#

But the entire existence of this cache is an implementation detail. Using is or id() on immutable objects doesn't necessarily give you meaningful results, because the interpreter is allowed to cheat.

wide vector Feb 17, 2021, 7:04 AM

#

raven ridge The answer to the "why only alphanumerics plus underscore" is that the CPython a...

Ahh, interesting design decision!

raven ridge Feb 17, 2021, 7:04 AM

#

hexed aspen Are there any PEP guidelines (or other principles) on whether you should make yo...

Hm. If it's collection-like and you want to expose access to the items in the collection, you implement __getitem__, otherwise you don't

terse pivot Feb 17, 2021, 7:06 AM

#

variables are refrenced

hexed aspen Feb 17, 2021, 7:07 AM

#

Say I have a class that parses text into ~12 meaningful attributes, any or all of which you might export to a spreadsheet, etc. But conceivably you could add attributes, depending on the project you wanted to use it for. Do you have an opinion on whether it’s wise there?

raven ridge Feb 17, 2021, 7:08 AM

#

It's not. That's not collection-like

#

Instead, expose them as named attributes on the object

hexed aspen Feb 17, 2021, 7:09 AM

#

Cool. That’s how I implemented it, actually. I have a couple methods to dump the chosen attribute values to a list or dict.

#

I just learned how to implement subscripting in custom classes; just wasn’t 100% sure on when you would want to vs. not. Thanks!

raven ridge Feb 17, 2021, 7:12 AM

#

You should use it for things that are container-like: subclasses of dict or list or tuple or set, or things that behave like dicts or lists or tuples or sets, etc.

#

Basically, collections of stuff where the user can insert or remove elements in the fly, and where the primary purpose of the object is to contain those items.

hexed aspen Feb 17, 2021, 7:15 AM

#

Yeah, that makes perfect sense.

shrewd dune Feb 17, 2021, 7:28 AM

#

safe hedge ```python >>> "hell" + "o" is "hello" <stdin>:1: SyntaxWarning: "is" with a lite...

unrelated, but that + is being optimized away in the bytecode

#

it's going to be the same as 'hello' is 'hello'

safe hedge Feb 17, 2021, 7:29 AM

#

I mean that's kind of what that shows no?

shrewd dune Feb 17, 2021, 7:30 AM

#

well my point is that it's like when you do x = 4999999 + 1

raven ridge Feb 17, 2021, 7:30 AM

#

That's optimized before the code is ever executed.

shrewd dune Feb 17, 2021, 7:30 AM

#

during runtime that's going to be 5000000

raven ridge Feb 17, 2021, 7:30 AM

#

At no point is there a str object that contains "hell"

unkempt rock Feb 17, 2021, 7:30 AM

#

why is this code not accepting any input value from the user?

#

class metropolis:
def init(self,Mcode,MName,MPop,Area,PopDens):
self.Mcode=Mcode
self.MName=MName
self.MPop=MPop
self.Area=Area
self.PopDens=PopDens
def Calden(self):
d=PopDens/Area
print(d)
def Enter(self):
return(
int(input("Enter the area code")),
input("Enter the name"),
int(input("Enter the population")),
float(input("Enter the area coverage")),
float(input("Enter the pop density")),
)
print(metropolis.Calden())
def Viewall(self):
print(metropolis.init())

#

anyone knows?

raven ridge Feb 17, 2021, 7:31 AM

#

This is not a help channel, it's a channel for discussion about Python language concepts and implementation.

#

Check out #❓｜how-to-get-help

safe hedge Feb 17, 2021, 7:32 AM

#

You can see how it's different if you do this right:

>>> hell = 'hell'
>>> o = 'o'
>>> a = 'hello'
>>> hell + o is a
False

terse pivot Feb 17, 2021, 7:33 AM

#

i guess only immutables are refrenced

raven ridge Feb 17, 2021, 7:33 AM

#

When you do "hell" + "o", that's optimized away when the code is compiled, which is different from the optimizations that we've been looking at for immutable objects which happen when the code is being interpreted.

#

Some simple optimizations are performed at compile time, and constant folding is one of them.

#

https://bugs.python.org/issue1346238 shows where it was added.

safe hedge Feb 17, 2021, 7:36 AM

#

safe hedge And this: http://guilload.com/python-string-interning/

Indeed it's referenced in the article I linked

raven ridge Feb 17, 2021, 7:37 AM

#

Er, https://bugs.python.org/issue29469 actually

raven ridge Feb 17, 2021, 7:37 AM

#

safe hedge Indeed it's referenced in the article I linked

Ah, I didn't notice that. It's a separate topic from string interning, so I wouldn't have expected it to be there, but fair enough

safe hedge Feb 17, 2021, 7:38 AM

#

They explain it's why strings longer 20 are not interned

#

Or rather, why computed strings longer than 20 are not interned

#

i.e. a = 'a' * 21

unkempt rock Feb 17, 2021, 7:39 AM

#

longer than 20 characters?

In [4]: a = 'a' * 21

In [5]: 'a' * 21 is a
<>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
<ipython-input-5-870f341d8bb7>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
  'a' * 21 is a
Out[5]: True

#

Last I recall it was 4096

#

In [6]: a = 'a' * 4097

In [8]: 'a' * 4097 is a
Out[8]: False

In [9]: a = 'a' * 4096

In [10]: 'a' * 4096 is a
<>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
<ipython-input-10-0f9df0e048c2>:1: SyntaxWarning: "is" with a literal. Did you mean "=="?
  'a' * 4096 is a
Out[10]: True

safe hedge Feb 17, 2021, 7:41 AM

#

I mean that's what it was in that article. It's an implementation detail so maybe it changed since 2014...

#

Or maybe the article simplified down the number?

raven ridge Feb 17, 2021, 7:41 AM

#

IIRC, 32 and 64 bit Python have different limits.

safe hedge Feb 17, 2021, 7:41 AM

#

Would make sense

raven ridge Feb 17, 2021, 7:41 AM

#

And that article was written for Python 2, so it's at least a bit out of date

shrewd dune Feb 17, 2021, 7:41 AM

#

It doesn't fit on my screen

📎 Screenshot_2021-02-17-04-41-37-108_com.termux.jpg

#

>>> dis("'' * 4096")
  1           0 LOAD_CONST               0 ('')
              2 RETURN_VALUE
>>> dis("'' * 4097")
  1           0 LOAD_CONST               0 ('')
              2 RETURN_VALUE
>>> dis("'' * 4400")
  1           0 LOAD_CONST               0 ('')
              2 RETURN_VALUE
>>> dis("'' * 99999")
  1           0 LOAD_CONST               0 ('')
              2 RETURN_VALUE
>>>```smart

safe hedge Feb 17, 2021, 7:44 AM

#

haha

raven ridge Feb 17, 2021, 7:45 AM

#

I think that's constant propagation in play again

shrewd dune Feb 17, 2021, 7:45 AM

#

it's doing good

raven ridge Feb 17, 2021, 7:45 AM

#

I think it's evaluating it to empty string before even deciding whether to intern it.

#

I know string interning is a useful optimization from a performance point of view, but it's really annoying from a teaching point of view. It causes an awful lot of confusion about identity. People get taught "is" or "id()", and they try it out themselves, and it immediately does something unexpected by telling them that two objects they created at different times are actually the same object

fresh cargo Feb 17, 2021, 7:47 AM

#

Another rather interesting Python string optimization

>>> hello = "h"
>>> hello += "e"
>>> print(hello, id(hello))
he 1797357577712
>>> hello += "l"
>>> print(hello, id(hello))
hel 1797357577712
>>> hello += "l"
>>> print(hello, id(hello))
hell 1797357577712
>>> hello += "o"
>>> print(hello, id(hello))
hello 1797357577712

bronze acorn Feb 17, 2021, 7:48 AM

#

I have discussed this 1 day ago in a other server

safe hedge Feb 17, 2021, 7:48 AM

#

Didn't this get discussed yesterday

#

In this channel I think

shrewd dune Feb 17, 2021, 7:48 AM

#

fresh cargo Another rather interesting Python string optimization ```py >>> hello = "h" >>> ...

does that happen outside the REPL?

bronze acorn Feb 17, 2021, 7:48 AM

#

Id is different

raven ridge Feb 17, 2021, 7:48 AM

#

And that only works on a += b, not on a = a + b

bronze acorn Feb 17, 2021, 7:48 AM

#

You just can't see it I think

fresh cargo Feb 17, 2021, 7:48 AM

#

bronze acorn I have discussed this 1 day ago in a other server

Oops I probably missed it

bronze acorn Feb 17, 2021, 7:49 AM

#

Something to do with garbage collection

#

Memory location (id) would change after runtime

#

Afaik

raven ridge Feb 17, 2021, 7:49 AM

#

shrewd dune does that happen outside the REPL?

Yep, it does. The interpreter knows that there's only one reference to the object, and that it's about to be lost if it creates a new object to hold the new string, so it cheats and modifies the old one instead of destroying it and making a new one

#

!e ```py
x = "he"
print(id(x))
x += "l"
print(id(x))
y = x
x += "lo"
print(id(x))

fallen slateBOT Feb 17, 2021, 7:53 AM

#

@raven ridge :white_check_mark: Your eval job has completed with return code 0.

001 | 140298723851376
002 | 140298723877040
003 | 140298723877104

shrewd dune Feb 17, 2021, 7:53 AM

#

mmm

raven ridge Feb 17, 2021, 7:54 AM

#

Well that didn't do what I expected...

safe hedge Feb 17, 2021, 7:54 AM

#

Yeah I just tried that. I swear it was different yesterday hahah

bronze acorn Feb 17, 2021, 7:54 AM

#

Yeah

safe hedge Feb 17, 2021, 7:54 AM

#

I thought it was same id, same id, different id

bronze acorn Feb 17, 2021, 7:54 AM

#

So garbage collection can change id after you run the program sometimes

safe hedge Feb 17, 2021, 7:54 AM

#

But I got the same result as above haha

shrewd dune Feb 17, 2021, 7:54 AM

#

I asked because sometimes the REPL does some things differently

#

I had an "issue" once (~~not really I was just wasting my time~~) because the REPL would close a file descriptor in some esoteric code I was writing

#

because it was getting rid of some variable

bronze acorn Feb 17, 2021, 7:57 AM

#

fallen slate <@451976922361102357> :white_check_mark: Your eval job has completed with return...

The garbage collection time may vary

raven ridge Feb 17, 2021, 7:58 AM

#

Not really... At least, not with CPython.

bronze acorn Feb 17, 2021, 7:59 AM

#

At least that how I understand it BingShrug

unkempt rock Feb 17, 2021, 7:59 AM

#

"Up until version 3.7, Python used peephole optimization, and all strings longer than 20 characters were not interned. However, now it uses the AST optimizer, and (most) strings up to 4096 characters are interned."

This was the reason btw

safe hedge Feb 17, 2021, 8:00 AM

#

Yup makes sense

bronze acorn Feb 17, 2021, 8:00 AM

#

unkempt rock "Up until version 3.7, Python used peephole optimization, and all strings longer...

Interned?

shrewd dune Feb 17, 2021, 8:00 AM

#

oh yeah, we should try emojis

#

gg

safe hedge Feb 17, 2021, 8:00 AM

#

They were not

#

I tried

#

Because they aren't in the subset of characters in the cache

shrewd dune Feb 17, 2021, 8:00 AM

#

~~is the plural of emoji "emoji" or "emojis"?~~

safe hedge Feb 17, 2021, 8:01 AM

#

>>> a = '😂'
>>> b = '😂'
>>> a is b 
False

bronze acorn Feb 17, 2021, 8:01 AM

#

safe hedge ```py >>> a = '😂' >>> b = '😂' >>> a is b False ```

Cute

safe hedge Feb 17, 2021, 8:01 AM

#

Emojis are clearly unique personalities

bronze acorn Feb 17, 2021, 8:02 AM

#

Pytohn

raven ridge Feb 17, 2021, 8:02 AM

#

bronze acorn At least that how I understand it <:BingShrug:583791581497393162>

CPython has 2 different things called "garbage collection". Objects have a reference count, and most objects are destroyed when their reference count reaches 0. Some objects will never have their reference count reach 0, because object x holds a reference to object y, and object y holds a reference to object x. This is called a reference cycle. So Python has a second cycle collecting garbage collector that discovers dead objects whose reference counts can never reach 0 and destroys them.

shrewd dune Feb 17, 2021, 8:02 AM

#

\😂

bronze acorn Feb 17, 2021, 8:03 AM

#

raven ridge CPython has 2 different things called "garbage collection". Objects have a refer...

So how does this explain Id being different or not

raven ridge Feb 17, 2021, 8:03 AM

#

In all the examples we've been doing above, there are no reference cycles in play, so the cycle collecting garbage collector - the one that runs at arbitrary times - never comes into play

bronze acorn Feb 17, 2021, 8:04 AM

#

Yes, that understandable, no reference cycles, because there is no cycling in assigning objects to each other, but sometimes it show same ID sometimes not

#

How is this related to reference cycling

raven ridge Feb 17, 2021, 8:04 AM

#

I'm not sure what's causing that, but I'm certain it's not the GC.

raven ridge Feb 17, 2021, 8:05 AM

#

raven ridge !e ```py x = "he" print(id(x)) x += "l" print(id(x)) y = x x += "lo" print(id(x)...

If someone's at a computer, can they try running ^ this and see if they get the same results?

bronze acorn Feb 17, 2021, 8:05 AM

#

Only other answer I know, is it has to do with variable being mutable or something

shrewd dune Feb 17, 2021, 8:06 AM

#

safe hedge ```py >>> a = '😂' >>> b = '😂' >>> a is b False ```

I meant this because it said "most strings" but I guess it's only about interning, yeah, I thought it meant other optimizations

📎 Screenshot_2021-02-17-05-04-51-244_com.termux.jpg

bronze acorn Feb 17, 2021, 8:06 AM

#

Return the “identity” of an object. This is an integer which is guaranteed to be unique and constant for this object during its lifetime. Two objects with non-overlapping lifetimes may have the same id() value.

#

https://docs.python.org/3/library/functions.html#id

raven ridge Feb 17, 2021, 8:07 AM

#

Yep, and that's true. Two objects that are alive at different times can have the same id

#

But that's unrelated to the strings being updated in place stuff we were analyzing

bronze acorn Feb 17, 2021, 8:08 AM

#

https://stackoverflow.com/questions/62453270/why-do-different-strings-have-the-same-id-in-python

Stack Overflow

Why do different strings have the same ID in Python?

It is stated that strings are immutable objects, and when we make changes in that variable it actually creates a new string object.

So I wanted to test this phenomenon with this piece of code:

#

^ source

#

But still

#

It require some research

raven ridge Feb 17, 2021, 8:09 AM

#

!e ```py
x = "!"
print(id(x))
x += "!"
print(id(x))
x += "!"
print(id(x))
x += "!"
print(id(x))
x += "!"
print(id(x))
y = x
x += "!"
print(id(x))

fallen slateBOT Feb 17, 2021, 8:09 AM

#

@raven ridge :white_check_mark: Your eval job has completed with return code 0.

001 | 140059159624816
002 | 140059159658608
003 | 140059159658608
004 | 140059159658608
005 | 140059159658608
006 | 140059159658672

raven ridge Feb 17, 2021, 8:10 AM

#

There we go

safe hedge Feb 17, 2021, 8:10 AM

#

Yeah it's because we used single strings right

#

Which were coming from the cache

raven ridge Feb 17, 2021, 8:10 AM

#

Right.

#

It was due to interning again, heh.

safe hedge Feb 17, 2021, 8:10 AM

#

FUN.
TIMES.

bronze acorn Feb 17, 2021, 8:11 AM

#

https://www.tutorialspoint.com/How-can-we-change-the-id-of-an-immutable-string-in-Python#:~:text=The same strings have the,() value not being reused.
Interning I guess

How can we change the id of an immutable string in Python?

Strings in Python are immutable, that means that once a string is created, it can't be changed. When you create a string, and if you create same string and ...

raven ridge Feb 17, 2021, 8:11 AM

#

That shows the interpreter repeatedly modifying a string rather than creating a new one.

bronze acorn Feb 17, 2021, 8:11 AM

#

Oh God this link

shrewd dune Feb 17, 2021, 8:11 AM

#

interning is also confusing when you want to test efficiency qq

bronze acorn Feb 17, 2021, 8:12 AM

#

Yup

#

Interning

#

Though it changes with adding special characters

safe hedge Feb 17, 2021, 8:13 AM

#

It's an implementation detail though so you shouldn't really rely on it for any optimization

raven ridge Feb 17, 2021, 8:13 AM

#

bronze acorn Though it changes with adding special characters

Yep, that's why I switched to "!" in order to show off strings being modified

bronze acorn Feb 17, 2021, 8:13 AM

#

Hmmm

#

Pytohn var

raven ridge Feb 17, 2021, 8:14 AM

#

And the initial one is still not modified, because it's still interned (since it's a single character string, I think)

#

But yeah, very much implementation detail.

#

It's really unfortunate that everyone trying to learn is and id() bumps up against this implementation detail

safe hedge Feb 17, 2021, 8:16 AM

#

>>> x = '!'
>>> idx = id(x)
>>> print(idx)
4317569200
>>> x += '!'
>>> print(id(x))
4318584112
>>> x += '!'
>>> print(id(x))
4318584112
>>> idx == id('!')
True

#

Yeah it's unchanged

shrewd dune Feb 17, 2021, 8:16 AM

#

I'm almost off to bed but I remembered something

#

well I'm in my bed actually

#

>>> timeit('"holi"', number=100_000_000)
0.9777427129999978
>>> timeit('f"holi"', number=100_000_000)
1.4292464520000294 ```

bronze acorn Feb 17, 2021, 8:17 AM

#

safe hedge ```py >>> a = '😂' >>> b = '😂' >>> a is b False ```

Doesn't return False to me

shrewd dune Feb 17, 2021, 8:17 AM

#

this is an old snippet of mine

safe hedge Feb 17, 2021, 8:17 AM

#

bronze acorn Doesn't return False to me

I mean...

bronze acorn Feb 17, 2021, 8:17 AM

#

They are the same

#

So i was just wondering why would it return False

raven ridge Feb 17, 2021, 8:18 AM

#

shrewd dune I'm almost off to bed but I remembered something

Well, that seems reasonable enough. F strings do more work, it shouldn't be surprising that they're slower

shrewd dune Feb 17, 2021, 8:18 AM

#

>>> dis('"holi"')
  1           0 LOAD_CONST               0 ('holi')
              2 RETURN_VALUE
>>> dis('f"holi"')
  1           0 LOAD_CONST               0 ('holi')
              2 RETURN_VALUE ```

bronze acorn Feb 17, 2021, 8:18 AM

#

Where are you testing your code btw @shrewd dune

shrewd dune Feb 17, 2021, 8:19 AM

#

mmm but does it generate the byte code every time, or is it related to timeit?

raven ridge Feb 17, 2021, 8:19 AM

#

Wait, same bytecode and it's still 50% slower?

#

That's fascinating...

shrewd dune Feb 17, 2021, 8:19 AM

#

bronze acorn Where are you testing your code btw <@427194066216681474>

that was on my PC, months ago

bronze acorn Feb 17, 2021, 8:20 AM

#

LOAD_CONST and stuffs

shrewd dune Feb 17, 2021, 8:21 AM

#

raven ridge Wait, same bytecode and it's still 50% slower?

pypy showed similar times tho! (between the 2 cases, f-strings vs regular, there wasn't a faster one) but pypy was really wacky too, the JIT is weird. Sometimes it's much faster or much slower

#

during the exact same test

raven ridge Feb 17, 2021, 8:22 AM

#

I'm less knowledgeable about pypy, so something something JIT warm-up something.

#

🤷

shrewd dune Feb 17, 2021, 8:22 AM

#

what I shared is from CPython, still

#

ok good night everyone

undone hare Feb 17, 2021, 8:26 AM

#

https://github.com/smontanaro/python-0.9.1

GitHub

smontanaro/python-0.9.1

Upload and changes to Python 0.9.1 release (from 1991!) so that it would compile - smontanaro/python-0.9.1

#

That’s cool

safe hedge Feb 17, 2021, 8:31 AM

#

bronze acorn So i was just wondering why would it return False

Because they aren't the same memory space because they aren't in the set of cached strings?

bronze acorn Feb 17, 2021, 8:32 AM

#

Mhm

#

Anyone has looked at pattern matching (python 3.10)

safe hedge Feb 17, 2021, 8:33 AM

#

So the emoji thing returns true on the !e snekbox but false on my mac

#

Maybe down to the PEG parser in 3.9? Or maybe just a mac thing?

bronze acorn Feb 17, 2021, 8:33 AM

#

safe hedge So the emoji thing returns true on the `!e` snekbox but false on my mac

What about linux

safe hedge Feb 17, 2021, 8:33 AM

#

Well I assume snekbox is running on Linux

#

But I'm running 3.8.2 on my mac

#

vs 3.9 on !e

undone hare Feb 17, 2021, 8:37 AM

#

Snekbox is running 3.9.1 on GCC 8.3.0

#

on a debian image iirc

bronze acorn Feb 17, 2021, 8:38 AM

#

What is snekbox

undone hare Feb 17, 2021, 8:38 AM

#

Yep, debian buster

#

It is a webservice we use to sandbox the !eval command

bronze acorn Feb 17, 2021, 8:38 AM

#

Oh amusing.

tidal marten Feb 17, 2021, 8:52 AM

#

Oh wow, this actually works...
if (a := 12 + 3) == 15: print("OK", a)

#

It's kind of ugly tho

undone hare Feb 17, 2021, 8:53 AM

#

Why did you think it wouldn’t work?

tidal marten Feb 17, 2021, 8:54 AM

#

I just saw some articles mentioning the PEP, but I didn't know whether it's actually implemented.

#

Good to know

slim fiber Feb 17, 2021, 10:36 AM

#

Is there a library that can work with IPhone? E.g simulating swipe left/swipe right on the home screen

prime vale Feb 17, 2021, 10:42 AM

#

Can someone please help me out, I want to get started to write a research paper

slim fiber Feb 17, 2021, 10:47 AM

#

@prime vale Whats your rsrch paper on? How proficient are you w Python

prime vale Feb 17, 2021, 10:48 AM

#

slim fiber <@664398831386886154> Whats your rsrch paper on? How proficient are you w Python

I am an python intermediate guy

slim fiber Feb 17, 2021, 10:48 AM

#

How do you need help getting started?

#

Do you want someone to give you ideas? Write the paper for you? Hold your hand while you piss?

#

You gotta be more specific

prime vale Feb 17, 2021, 10:50 AM

#

I don't know how to get started

#

so, I want to narrow down a topic so that I can start a research paper

prime vale Feb 17, 2021, 10:50 AM

#

slim fiber How do you need help getting started?

.

unkempt roost Feb 17, 2021, 10:55 AM

#

prime vale so, I want to narrow down a topic so that I can start a research paper

You can search OpenCV with python. It provides face recognition or face detection, its used in Car’s lane keeping support. Very interesting topi

prime vale Feb 17, 2021, 10:56 AM

#

unkempt roost You can search OpenCV with python. It provides face recognition or face detectio...

@unkempt roost I have used open CV to make a project, so I want to know how can I proceed to write a research paper

unkempt roost Feb 17, 2021, 10:57 AM

#

You can find research abstracts online

gleaming rover Feb 17, 2021, 11:42 AM

#

prime vale Can someone please help me out, I want to get started to write a research paper

this is a channel for discussion of the language and probably isn't appropriate for your question...

#

also, I note that you have posted the same question in multiple channels

#

please don't.

weary garden Feb 17, 2021, 1:24 PM

#

to me a "primitive type" is a type that directly maps to the platform e.g. IEEE 754 floating point types usable by FPU or fixed width integer types usable directly by CPU.

#

I think adding syntactic sugar in the form of "methods" to these primitive types doesn't change them from being primitives.

wide shuttle Feb 17, 2021, 1:27 PM

#

Normally, a primitive is something that can't be broken down further in a simpler data type

#

Deep down in Python's objects, you may find such a primitive, but it's not the object you're interacting with directly

weary garden Feb 17, 2021, 1:27 PM

#

depends

wide shuttle Feb 17, 2021, 1:28 PM

#

Could you explain how it depends?

weary garden Feb 17, 2021, 1:28 PM

#

it depends on the implementation

#

my implementation will have optimizations so fixed width integers are preferred

wide shuttle Feb 17, 2021, 1:29 PM

#

Are you now talking about a language that is not Python?

weary garden Feb 17, 2021, 1:29 PM

#

I am talking about my implementation of Python

wide shuttle Feb 17, 2021, 1:29 PM

#

If you change how the objects fundamentally work, your implementation would not be a full implementation of the Python language standard

#

Which makes it "not Python" to me

weary garden Feb 17, 2021, 1:30 PM

#

no, my implementation will be a full implementation of the Python language; I don't care how existing implementations such as CPython works: as long as observable behavior is the same, i.e. the "as if" rule.

wide shuttle Feb 17, 2021, 1:31 PM

#

So, your floats and ints get wrapped in objects that expose an interface with methods and so on?

weary garden Feb 17, 2021, 1:32 PM

#

a primitve int will be transformed to a multi-precision int if such a transormation is deemed necessary in order for the Python program being compiled to remain valid.

wide shuttle Feb 17, 2021, 1:32 PM

#

That's not what I was asking

weary garden Feb 17, 2021, 1:32 PM

#

and "methods" will not require a CPython like object representation of the primitive in order to still work

wide shuttle Feb 17, 2021, 1:32 PM

#

In Python, you have instances of the class/type int, which are the individual int objects you work with

#

Those int objects give you access to the int methods defined in the class

weary garden Feb 17, 2021, 1:33 PM

#

it will appear to the Python program that int is still an object even though it isn't under the hood

wide shuttle Feb 17, 2021, 1:34 PM

#

Right, so what the programmer will experience is something compound that the compiler/interpreter takes care of

#

A primitive that also provides an interface as if it were an object

#

You're just not calling it that

weary garden Feb 17, 2021, 1:34 PM

#

it is a requirement that all existing Python 3.x programs remain valid when compiled with my implementation.

wide shuttle Feb 17, 2021, 1:35 PM

#

Yes, sure, I just don't really think that what you're exposing is something I'd consider to be a true primitive

#

Python objects may use such primitives internally as well

weary garden Feb 17, 2021, 1:35 PM

#

you mean CPython?

wide shuttle Feb 17, 2021, 1:36 PM

#

It's just that the element you interface with in your application is something that compounds both the value as well as the methods

#

No, your implementation

#

Which sounds awfully like objects to me, just emulated in a different way

weary garden Feb 17, 2021, 1:36 PM

#

I have yet to design the object semantic concept, it will be interesting to see what comes out in the wash

wide shuttle Feb 17, 2021, 1:37 PM

#

The way in which the language standard is defined, with objects that hold data and give access to methods, is what matters to me

#

Not how you fake it in your implementation

weary garden Feb 17, 2021, 1:37 PM

#

given my Python compiler doesn't know anything about Python

wide shuttle Feb 17, 2021, 1:37 PM

#

From my perspective, the abstract perspective of the user, it's not (just) a primitive

weary garden Feb 17, 2021, 1:38 PM

#

indeed, hence my mention of the "as if" rule

#

as long as observable behavior is the same

wide shuttle Feb 17, 2021, 1:38 PM

#

So, it's just a semantics game then. It's not a true primitive, from the perspective of the end user and the language definition, but you like to call it a primitive

weary garden Feb 17, 2021, 1:39 PM

#

it will be stored as the primitive type in the byte code

#

but that is a lower level of abstraction obviously

wide shuttle Feb 17, 2021, 1:39 PM

#

Still sounds like an implementation detail to me

weary garden Feb 17, 2021, 1:39 PM

#

indeed it is!

wide shuttle Feb 17, 2021, 1:40 PM

#

If you're talking about Python, it's still not a primitive type

weary garden Feb 17, 2021, 1:40 PM

#

which makes sense given I am creating an implementation! I need such details 🙂

wide shuttle Feb 17, 2021, 1:40 PM

#

It's still a compound type that consists of data and behavior

weary garden Feb 17, 2021, 1:40 PM

#

not a compound type, a type with meta data

wide shuttle Feb 17, 2021, 1:40 PM

#

Hence the discussion about if Python would contain primitives that happened earlier

weary garden Feb 17, 2021, 1:42 PM

#

my Python implementation contains no knowledge of Python itself

wide shuttle Feb 17, 2021, 1:42 PM

#

I don't really think that matters at all

#

From the perspective of Python, the language, the abstract concept, a float is not just a primitive

#

Whether or not you separate your logic in pieces and make sure another part of the implementation makes sure that you emulate Python's object behavior as specified in the language reference doesn't really matter

#

You're just putting parts of your implementation in different places, only look at a subset of what makes a "float" a "float" in python

#

and suddenly claim the entire thing is a primitive

weary garden Feb 17, 2021, 1:44 PM

#

in my implementation a Python float will be a generic semantic concept that maps to a native floating point type

wide shuttle Feb 17, 2021, 1:44 PM

#

which does not make a lot of sense to me

#

It also maps to behavior; the float methods

#

So, the Python float itself will not be your primitive

#

it uses a primitive internally, but that's not special

weary garden Feb 17, 2021, 1:45 PM

#

I think you need to read about extension methods to see where I am coming from. https://en.wikipedia.org/wiki/Extension_method

Extension method

In object-oriented computer programming, an extension method is a method added to an object after the original object was compiled. The modified object is often a class, a prototype or a type. Extension methods are features of some object-oriented programming languages. There is no syntactic difference between calling an extension method and c...

wide shuttle Feb 17, 2021, 1:46 PM

#

I don't really see how this changes the conceptualisation of floats on the level of Python

weary garden Feb 17, 2021, 1:46 PM

#

because my implementation doesn't know about Python.

wide shuttle Feb 17, 2021, 1:46 PM

#

And indeed it doesn't have to know about Python

#

When I'm writing in the Python language, I do

#

And when I use a Python float, it comes with both the value and the methods

#

However you implement it

#

That's up to you

weary garden Feb 17, 2021, 1:47 PM

#

such knowledge is transformed into something else during compilation, an implementation detail, as you say.

wide shuttle Feb 17, 2021, 1:47 PM

#

Anyway, if you feel better about it if you conceptualise it for yourself as being a "true primitive", please do

#

but I won't really say that your implementation makes it so it's a true primitive on the level of Python

weary garden Feb 17, 2021, 1:48 PM

#

key thing is multi-precision integers are slower that native integers so I will have an optimization that allows both to be used

wide shuttle Feb 17, 2021, 1:48 PM

#

Sounds cool

#

I'm sure a lot of people will be interested in it if you really manage to pull it off

weary garden Feb 17, 2021, 1:51 PM

#

project is https://neos.dev if you are interested.

neos ⚛ Universal Compiler and Bytecode JIT

neos Universal Compiler and Bytecode JIT

#

for example integer literals can be stored directly as a native integer type otherwise there will be an extra level of indirection to either a native integer type or a multi-precision type that can transform at will based on the outcome of expression evaluation.

#

(one way transform)

wide shuttle Feb 17, 2021, 2:04 PM

#

Sounds like you've got a plan for the implementation

#

It still means that I, as a Python programmer, don't deal with primitives directly

desert peak Feb 17, 2021, 2:04 PM

#

speaking of integers

weary garden Feb 17, 2021, 2:04 PM

#

I just thought about it two minutes ago 🙂

desert peak Feb 17, 2021, 2:04 PM

#

how does Python accomplish its "infinite" integer sizes?

#

whenever I try to do bitshifts or other clever things, I always seem to hit the C 32-bit int limits

#

is Python dynamically allocating for integers?

weary garden Feb 17, 2021, 2:05 PM

#

it likely uses multiple C 32-bit ints

feral cedar Feb 17, 2021, 2:16 PM

#

i believe it begins to store them akin to how it stores strings

#

i may be wrong

spark magnet Feb 17, 2021, 2:54 PM

#

desert peak is Python dynamically allocating for integers?

Yes, CPython ints dynamically grow as needed. is that what you are asking?

spark magnet Feb 17, 2021, 2:55 PM

#

desert peak whenever I try to do bitshifts or other clever things, I always seem to hit the ...

Where do you hit those limits?

#

language-agnostic integers sound very do-able. Language-agnostic classes do not 🙂

pliant tusk Feb 17, 2021, 3:09 PM

#

@weary garden python integers use an array of unsigned c ints specified by its ob_size field in the struct. Python floats already do map directly to native floats on the C level

#

python floats look like this in mem -> [ob_refcnt: c_ssize_t, ob_base: PyObject*, ob_fval: double]

#

in python2 integers mapped as you described, but that at runtime conversion caused slowdowns

#

and also caused the integers to have a finite size. Integers now have an infinite size and are on average much faster

grave jolt Feb 17, 2021, 3:32 PM

#

I'm not sure how you would implement integers as 'true primitives' given that they are subject to garbage collection

#

Well, they can't participate in reference cycles, but they still need to store the reference count

#

It's written in Python, and it's open-source https://github.com/python-discord/bot
However, questions about the server itself are better suited for #community-meta

spark magnet Feb 17, 2021, 3:55 PM

#

pliant tusk and also caused the integers to have a finite size. Integers now have an infinit...

Python 2 had unbounded ints (called longs, but you didn't need to care about the distinction). Conversion was automatic.

pliant tusk Feb 17, 2021, 3:56 PM

#

ah i inferred that they were just 64bit c long values cause they were called longs

spark magnet Feb 17, 2021, 3:58 PM

#

no, python 3's ints are exactly like python 2's longs.

visual shadow Feb 17, 2021, 3:59 PM

#

i will say, it is still amazing to me how we've created abstractions that are so easy to grasp when behind the scenes there's so much that goes into making them work

gleaming rover Feb 17, 2021, 4:02 PM

#

visual shadow i will say, it is still amazing to me how we've created abstractions that are so...

meanwhile in Haskell: String = [Char]

#

🥴

visual shadow Feb 17, 2021, 4:05 PM

#

im not familiar with haskell at all, am i correct in interpreting that as saying Strings are arrays of characters in haskell?

brave badger Feb 17, 2021, 4:06 PM

#

That's correct, yep

uncut sage Feb 17, 2021, 4:06 PM

#

When we create a process pool, is there any way for the processes in the pool to independently send messages back to the parent process? That is, apart from whatever comes back by way of apply or what have you?

grave jolt Feb 17, 2021, 4:08 PM

#

visual shadow im not familiar with haskell at all, am i correct in interpreting that as saying...

Well, it's a linked list of characters.
String is mostly used for low-performance things, and for real-world apps other string types are used

feral cedar Feb 17, 2021, 4:08 PM

#

there are more than one string type?

spark magnet Feb 17, 2021, 4:08 PM

#

uncut sage When we create a process pool, is there any way for the processes in the pool to...

You could pass a Queue to them, and they could put messages into the queue.

feral cedar Feb 17, 2021, 4:09 PM

#

i know rust has like 3 (?)

visual shadow Feb 17, 2021, 4:09 PM

#

does "bytes" count as a "string type"?

uncut sage Feb 17, 2021, 4:09 PM

#

spark magnet You could pass a Queue to them, and they could put messages into the queue.

that's true! I could always apply a set_up_queue function

visual shadow Feb 17, 2021, 4:09 PM

#

(in python context)

spark magnet Feb 17, 2021, 4:10 PM

#

yes, bytes has a lot of stringy methods on it

raven ridge Feb 17, 2021, 4:13 PM

#

bytes was called str in Python 2, in fact.

brave badger Feb 17, 2021, 4:14 PM

#

feral cedar there are more than one string type?

Text for Unicode and ByteString for performance, each coming with their own lazy and strict versions pydis_off_topeek

raven ridge Feb 17, 2021, 4:14 PM

#

The biggest breaking change in Python 3 was changing str from byte strings to Unicode strings

visual shadow Feb 17, 2021, 4:16 PM

#

yeah having read up on the differences between py2 and py3 while trying to understand unicode (oh look, someone here actually wrote what i read :P) it was a very important change

feral cedar Feb 17, 2021, 4:18 PM

#

there was a pretty good talk by nedbat 👀

raven ridge Feb 17, 2021, 4:19 PM

#

Yeah... I think it was a change for the better, because getting people to actually start using Unicode strings for themselves wasn't gonna happen when string literals remained arrays of bytes by default. But it also was where >90% of the Python 3 porting effort went, I'd say. Strings are everywhere in a program.

#

Python 2 also allowed Unicode strings to be silently converted to byte strings, and vice versa - unless they contain any non-ASCII codepoints / non-ASCII byte values. Which was a never ending source of bugs for things that worked when the developer tested them, but broke when real users used them with international text or filenames

spark magnet Feb 17, 2021, 4:48 PM

#

feral cedar there was a pretty good talk by nedbat 👀

https://bit.ly/unipain

Pragmatic Unicode

Unicode can be confusing. Here is a “just the facts I need” presentation about how to handle it correctly.

feral cedar Feb 17, 2021, 4:50 PM

#

it was very good

#

there's one called :heart_bean:

#

although i guess this is #community-meta now

cloud compass Feb 17, 2021, 4:55 PM

#

📎 705160575054905354.png

wide vector Feb 17, 2021, 7:10 PM

#

fresh cargo Another rather interesting Python string optimization ```py >>> hello = "h" >>> ...

Interesting, so appending a string will maintain its id... unless:

>>> hello = "h"
>>> goodbye = hello
>>> print(hello, id(hello))
h 2178279704944
>>> print(goodbye, id(goodbye))
h 2178279704944
>>> hello += "e"
>>> print(hello, id(hello))
he 2178288336560
>>> print(goodbye, id(goodbye))
h 2178279704944

wide vector Feb 17, 2021, 7:12 PM

#

raven ridge And that only works on `a += b`, not on `a = a + b`

why? The two operations are essentially identical

spark magnet Feb 17, 2021, 7:13 PM

#

It can be hard to tell if you are preserving the id, or just getting a new string made at the same address.

wide vector Feb 17, 2021, 7:13 PM

#

sorry for replying to old messages, I'm catching up with a convo from last night...

spark magnet Feb 17, 2021, 7:15 PM

#

@wide vector also, that example doesn't work for me.

#

(the original one showing the same id)

#

or maybe it does?

wide vector Feb 17, 2021, 7:16 PM

#

The one I just sent?

spark magnet Feb 17, 2021, 7:17 PM

#

Sorry, it does work for me.

wide vector Feb 17, 2021, 7:17 PM

#

It should, since hello and goodbye share the same reference until hello is modified

#

gotcha

#

Just read up from last night, super interesting how Python optimizes this stuff...

spark magnet Feb 17, 2021, 7:21 PM

#

wide vector why? The two operations are essentially identical

They are different because the "a+b" in "a = a + b" doesn't know about the "a=" part.

wide vector Feb 17, 2021, 7:23 PM

#

ah, just a different AST I guess

spark magnet Feb 17, 2021, 7:29 PM

#

yes, definitely a different AST, which produce different operations.

swift imp Feb 17, 2021, 7:37 PM

#

Why does every ast guide show how to make subclasses using the ast classes? Why do I have to subclass at all?

spark magnet Feb 17, 2021, 7:41 PM

#

link?

#

you definitely don't have to subclass

weary garden Feb 17, 2021, 7:53 PM

#

how often does Python language syntax change?

#

I assume most minor releases of CPython don't include syntax changes?

astral gazelle Feb 17, 2021, 7:58 PM

#

#❓｜how-to-get-help

spark magnet Feb 17, 2021, 7:59 PM

#

weary garden I assume most minor releases of CPython don't include syntax changes?

If minor means 3.4, 3.5, 3.6, then most of them have syntax changes

grave jolt Feb 17, 2021, 7:59 PM

#

this

#

3.5 introduced async/await, 3.6 introduced f-strings, 3.7 made async/await hard keywords, 3.8 introduced :=, 3.9 introduced relaxed decorator rules, 3.10 introduces patma and parenthesized with

weary garden Feb 17, 2021, 8:03 PM

#

I assume none of those changes were breaking changes?

grave jolt Feb 17, 2021, 8:03 PM

#

weary garden I assume none of those changes were breaking changes?

async/await becoming hard keywords is a breaking change

#

in 3.6 you could do async = 42, in 3.7 you can't

#

in general, Python doesn't follow semver

weary garden Feb 17, 2021, 8:04 PM

#

semver?

grave jolt Feb 17, 2021, 8:04 PM

#

semantic versioning

#

https://semver.org/

Semantic Versioning

Semantic Versioning 2.0.0

Semantic Versioning spec and website

unkempt rock Feb 17, 2021, 8:18 PM

#

g

limpid forum Feb 17, 2021, 8:28 PM

#

wide vector Interesting, so appending a string will maintain its id... unless: ```python >>>...

as for that "maintain its id"... I remember during one Python workshop I attended, we realised it also behaves differently different if you execute a saved code or if you write it in an interactive console. we had some exercise about id and one person did the other thing than the rest and they got different results 😄 I quickly tested both versions (console vs script) and I got the same difference.
but I don't remember what we were doing - it was over a year ago, late autumn 2019

spark magnet Feb 17, 2021, 8:30 PM

#

This is part of the subtlety: how the code gets compiled can change how objects are allocated and shared. This is why you almost never use "is".

weary garden Feb 17, 2021, 8:45 PM

#

"is"?

spark magnet Feb 17, 2021, 8:46 PM

#

weary garden "is"?

There's an operator in Python called is.

weary garden Feb 17, 2021, 8:47 PM

#

ah, to determine dynamic type?

spark magnet Feb 17, 2021, 8:47 PM

#

No, to check if two values are the same object.

weary garden Feb 17, 2021, 8:47 PM

#

I see.

#

I can't wait to start work on the implementation!

#

First goal: Python "Hello world!" by the end of the month.

#

only achievable if I can get neonumeral sorted by end of this week and libffi integrated

deep monolith Feb 17, 2021, 10:17 PM

#

anyone up for object oriented skill practical test ?

weary garden Feb 17, 2021, 10:18 PM

#

when using Python in interactive mode what is the significance of the three chevrons? ">>>"?

vale flax Feb 17, 2021, 10:18 PM

#

deep monolith anyone up for object oriented skill practical test ?

do your own homework

deep monolith Feb 17, 2021, 10:19 PM

#

dude its not a homework. i can do homework by my self 😋

gleaming rover Feb 17, 2021, 10:47 PM

#

deep monolith dude its not a homework. i can do homework by my self 😋

great!

gleaming rover Feb 17, 2021, 10:47 PM

#

spark magnet No, to check if two values are the same object.

why is is even a thing

#

like 99% of the time reference equality is only ever used for None

#

just wondering if there were any other concerns

unkempt rock Feb 17, 2021, 10:50 PM

#

Hey, im french and i've make a texte about some rules and i need someone for help and correct me, bc idk if all my sentences are english. If you want help me pls DM me !

red solar Feb 17, 2021, 10:56 PM

#

I'm a little confused by python3's float - the standard says it's an IEEE 754 floating point - C++ double and JS Number should be the same type

#

-12.0 % 5.0 in C++ and JS gives me -2.0

#

but in python3 it gives me 3.0?

#

and IEEE 754 fully defines floating point remainder as a standard operation, it's not just a recommended function

grave jolt Feb 17, 2021, 11:00 PM

#

weary garden when using Python in interactive mode what is the significance of the three chev...

To differntiate between the output and the input, I guess?

#

>>> 2 + 2
4
>>>

#

just like a prompt in the terminal

#

although that also shows the current directory by default

spark magnet Feb 17, 2021, 11:01 PM

#

gleaming rover why is `is` even a thing

It definitely shouldn't have been a keyword. sys.same_object(x, y) would have been fine.

grave jolt Feb 17, 2021, 11:02 PM

#

Or just comparing ids. It should work unless you're comparing immediately created objects (like id([]) == id([]))

#

and, well, I'd expect id to be in sys, but we had this discussion here a bit over a billion times

spark magnet Feb 17, 2021, 11:06 PM

#

yup, same old same old

feral cedar Feb 17, 2021, 11:11 PM

#

red solar but in python3 it gives me `3.0`?

if you do the math by hand you'll find that the answer is 3

red solar Feb 17, 2021, 11:11 PM

#

for an integer, yes

#

for an IEEE 754 float, no

feral cedar Feb 17, 2021, 11:11 PM

#

is there a difference?

red solar Feb 17, 2021, 11:12 PM

#

well... yes? one's positive, one's negative (sure they're the same value % n)

grave jolt Feb 17, 2021, 11:13 PM

#

@red solar https://en.wikipedia.org/wiki/Modulo_operation#In_programming_languages

#

wait, no, wrong link

#

no, it's right, I think

#

remainder is not the same as modulo, and different programming languages just choose different rules for modulo

red solar Feb 17, 2021, 11:16 PM

#

I would agree with you, except the documentation says that in python3 they're the same

#

x % y remainder of x / y

#

https://docs.python.org/3/library/stdtypes.html from the operations table

#

fmod(c/c++): Computes the floating-point remainder of the division operation x/y.

grave jolt Feb 17, 2021, 11:19 PM

#

I guess 'remainder' was used for familiarity?..

#

well, fmod operates differently from Python's %

red solar Feb 17, 2021, 11:20 PM

#

hmmm - i should look at the implementation

#

cuz i'm probably missing something

#

    mod = fmod(vx, wx);
    if (mod) {
        /* ensure the remainder has the same sign as the denominator */
        if ((wx < 0) != (mod < 0)) {
            mod += wx;
        }
    }

#

they literally use fmod, and the sign bit isn't documented as far as i can tell

grave jolt Feb 17, 2021, 11:27 PM

#

@red solar I meant that math.fmod has a different sign behaviour than % on floats.

#

i.e. math.fmod is the same as c/c++'s fmod

#

or I'm not sure

#

https://docs.python.org/3/library/math.html#math.fmod not guaranteed to be the same, it says

red solar Feb 17, 2021, 11:29 PM

#

Hmmm... ok it's nice that it says it somewhere, but ideally it should have a note of that in the operations table for floats

#

wonder how hard it is to submit a patch for the docs

flat gazelle Feb 17, 2021, 11:31 PM

#

It is because % on int must behave the same way, and the representative of the modulo class is conventionally the smallest positive member

gleaming rover Feb 17, 2021, 11:38 PM

#

grave jolt and, well, I'd expect `id` to be in `sys`, but we had this discussion here a bit...

something something unitTest

grave jolt Feb 17, 2021, 11:39 PM

#

gleaming rover something something `unitTest`

Yep, I've seen that. I've actually grepped all uses of id in CPython (that's how few there are).
Not having assertSameObject is a flaw on the unittest side, I'd say.

#

because it's a relatively common case in the stdlib

gleaming rover Feb 17, 2021, 11:39 PM

#

wait

grave jolt Feb 17, 2021, 11:39 PM

#

and they end up doing assertEqual(id(a), id(b))

gleaming rover Feb 17, 2021, 11:39 PM

#

oh

#

I was thinking assertTrue(a is b)

grave jolt Feb 17, 2021, 11:40 PM

#

then you don't get nice logs showing the difference

#

(which pytest does by default (using magic (yes, I do nest parentheses sometimes)) when you do assert a == b or assert a is b)

boreal umbra Feb 17, 2021, 11:43 PM

#

There's no assertIs?

grave jolt Feb 17, 2021, 11:44 PM

#

oh, there is

#

(no pun intended)

#

I don't know then

boreal umbra Feb 17, 2021, 11:46 PM

#

Also pytest good unittest bad

feral cedar Feb 17, 2021, 11:55 PM

#

if pytest is so good

#

why isn't it pyTest

spark magnet Feb 17, 2021, 11:57 PM

#

it used to be py.test

fallen turret Feb 18, 2021, 12:32 AM

#

what about nose2?

#

although, let's be honest, at least we aren't trying for anything like Chai.js

The less I see of those tests, the happier I'll be.

#

var expect = chai.expect;

expect(a).to.equal(b);

grave jolt Feb 18, 2021, 12:35 AM

#

expect.number(2).when.being.added.to(2).to.be.equal.to(4)

fallen turret Feb 18, 2021, 12:36 AM

#

driver.get('http://chaijs.com/');
expect('nav h1').dom.to.contain.text('Chai');
expect('#node .button').dom.to.have.style('float', 'left');

are you kidding me?

#

I mean, it makes sense, but holy cow do we really need to type that much?

spark magnet Feb 18, 2021, 12:41 AM

#

i'm not sure there's a reason to use nose2 over pytest. pytest is very powerful, and very actively maintained.

gleaming rover Feb 18, 2021, 12:42 AM

#

the only problem I have with pytest is (AFAIK) you can't use fixtures in test parameters

#

which is a bit 😔 but not a dealbreaker

raven ridge Feb 18, 2021, 2:26 AM

#

weary garden when using Python in interactive mode what is the significance of the three chev...

It's just a prompt - and the prompt changes to ... for continuation lines.

safe hedge Feb 18, 2021, 3:16 AM

#

You can also set it to whatever you want really using sys.ps1 right?

main crow Feb 18, 2021, 3:31 AM

#

hello

raven ridge Feb 18, 2021, 4:11 AM

#

safe hedge You can also set it to whatever you want really using `sys.ps1` right?

Apparently!

bronze acorn Feb 18, 2021, 4:15 AM

#

pypy is amazing

#

expect for the fact that it is not up-to-dated with python itself

#

latest version is 3.7, while python itself is 3.9

raven ridge Feb 18, 2021, 4:17 AM

#

It's nearly impossible for it to stay completely up to date with CPython. They can't start adding features introduced by a new version of CPython until after CPython decides what new features they're adding

#

The fact that it's only 15 months behind is pretty damn good, all things considered.

bronze acorn Feb 18, 2021, 4:21 AM

#

why it is not that popular though ?

raven ridge Feb 18, 2021, 4:23 AM

#

It's faster than CPython at running Python code, but slower than CPython at running C extension modules. Lots of important, performance sensitive things are written as C extension modules, so being slower at the things that need to be fast isn't great.

visual shadow Feb 18, 2021, 5:42 PM

#

as long as it adheres to the topic of the room

#

take a look at the channel description and pinned messages. tl;dr is this room is specifically for conversations about the python language itself

weary garden Feb 18, 2021, 9:39 PM

#

being able to start work on my Python implementation this weekend is starting to look promising as I have nearly sorted my multi-precision arithmetic library out: https://i.imgur.com/iiq3ONo.png \o/

Imgur

limpid marten Feb 18, 2021, 10:05 PM

#

@weary garden you're creating your own implementation?

weary garden Feb 18, 2021, 10:07 PM

#

yes

limpid marten Feb 18, 2021, 10:34 PM

#

Are you using the already existing parser?

#

Or are you re-doing everything?

weary garden Feb 18, 2021, 10:34 PM

#

I am creating it from scratch

limpid marten Feb 18, 2021, 10:34 PM

#

Sounds awesome

weary garden Feb 18, 2021, 10:35 PM

#

project website: https://neos.dev

neos ⚛ Universal Compiler and Bytecode JIT

neos Universal Compiler and Bytecode JIT

fallen slateBOT Feb 18, 2021, 11:36 PM

#

Hey @light condor!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .txt attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

light condor Feb 18, 2021, 11:37 PM

#

!code-blocks

#

https://onlinegdb.com/9VO3uijHi

GDB online Debugger

Online GDB is online ide with compiler and debugger for C/C++. Code, Compiler, Run, Debug Share code nippets.

#

I cant figure out how to make this not the question!

#

pls help!

spark magnet Feb 18, 2021, 11:40 PM

#

@light condor what do you mean? "how to make this not the question"?

#

@light condor oh, you are asking in #general. it should be there

light condor Feb 18, 2021, 11:41 PM

#

sorry I mean i cant figure out how to make it not repeat the question if u get it wrong

fallen slateBOT Feb 18, 2021, 11:53 PM

#

Hey @eternal solstice!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .txt attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

weary garden Feb 19, 2021, 12:21 AM

#

down to just 18 failing tests! \o/ my large signed multiply (base 16) works! https://i.imgur.com/i4heqTk.png all signed integer tests are passing; just need to sort out unsigned integers and what negative numbers mean for unsigned

Imgur

spark magnet Feb 19, 2021, 12:43 AM

#

I'm used to test output that's quiet for pass and loud for fail, so it's easier to tell what's going on.

weary garden Feb 19, 2021, 1:05 AM

#

big signed integer base 10 multiplication passes: https://imgur.com/a/ftuKT0L

Imgur

#

@spark magnet that will be the case when I move to gtest; for now I want to see everything

gleaming rover Feb 19, 2021, 1:07 AM

#

it's a great morning

#

to whine about Python's lambdas

grave jolt Feb 19, 2021, 1:07 AM

#

@weary garden You won't use your own testing framework? 🙂

gleaming rover Feb 19, 2021, 1:08 AM

#

okay, I don't actually know how the parser works, but in theory, would it be possible to have => lambdas?

weary garden Feb 19, 2021, 1:08 AM

#

no as that doesn't interest me

grave jolt Feb 19, 2021, 1:08 AM

#

gleaming rover okay, I don't actually know how the parser works, but *in theory*, would it be p...

I guess yes, that shouldn't break anything.

gleaming rover Feb 19, 2021, 1:08 AM

#

grave jolt I guess yes, that shouldn't break anything.

thank you

#

I'll expect the PR by close of business tomorrow

#

oh wait, it's Saturday

grave jolt Feb 19, 2021, 1:08 AM

#

😐

gleaming rover Feb 19, 2021, 1:09 AM

#

5 PM then

#

🥴

grave jolt Feb 19, 2021, 1:09 AM

#

@gleaming rover It's friday

gleaming rover Feb 19, 2021, 1:10 AM

#

grave jolt <@!171929073063297024> It's friday

I meant it'd be Saturday "tomorrow"

grave jolt Feb 19, 2021, 1:10 AM

#

no

#

For me it's 4am, Friday

gleaming rover Feb 19, 2021, 1:10 AM

#

yeah, I mean

grave jolt Feb 19, 2021, 1:10 AM

#

ah yes okay

gleaming rover Feb 19, 2021, 1:10 AM

#

gleaming rover I'll expect the PR by close of business tomorrow

when I said this the "tomorrow" would have been Saturday

grave jolt Feb 19, 2021, 1:10 AM

#

gotcha

raven ridge Feb 19, 2021, 1:24 AM

#

There's a suggestion on python-ideas right now to allow def in place of lambda

#

I think that'd be an improvement.

feral cedar Feb 19, 2021, 1:25 AM

#

hmm

#

that would be nice

raven ridge Feb 19, 2021, 1:25 AM

#

lst.sort(def k: k[0]) is a slight improvement over the current status quo.

feral cedar Feb 19, 2021, 1:25 AM

#

just replace that : with a -> and remove the def and you have nice looking lambdas !

gleaming rover Feb 19, 2021, 1:25 AM

#

🙂

raven ridge Feb 19, 2021, 1:25 AM

#

There's a lot more opposition to that.

gleaming rover Feb 19, 2021, 1:26 AM

#

😦

feral cedar Feb 19, 2021, 1:26 AM

#

😦

#

replace the colon with an arrow at least

gleaming rover Feb 19, 2021, 1:26 AM

#

the colon is so unintuitive

#

honestly I think it's worse than lambda for me

raven ridge Feb 19, 2021, 1:27 AM

#

https://mail.python.org/archives/list/python-ideas@python.org/thread/QQUS35S4IOV3CR2G3MZNT3RZ2F6TBE4G/ is the thread

feral cedar Feb 19, 2021, 1:27 AM

#

the arrow makes much more sense than the colon does

#

changing it to def makes it less annoying than lambda

raven ridge Feb 19, 2021, 1:29 AM

#

the best argument in that thread is "If Python didn't have anonymous functions and we were adding them for the first time now, there's no way we'd use lambda"

#

I think that's very, very true. And I think, if Python didn't have anonymous functions at all today and we were adding them for the first time, allowing def without a function name seems like a way we might add it.

grave jolt Feb 19, 2021, 1:32 AM

#

yeah

#

lambda seems out of place

#

I mean, it makes sense, because it's a lambda

#

but it feels like writing kleisli arrow into the async category f(): instead of async def f():

feral cedar Feb 19, 2021, 1:33 AM

#

anonymous_function x: x + 3

gleaming rover Feb 19, 2021, 1:34 AM

#

raven ridge the best argument in that thread is "If Python didn't have anonymous functions a...

if Python didn't have logging and we were adding it for the first time now...

#

I'm taking donations 🙂

grave jolt Feb 19, 2021, 1:35 AM

#

Python probably wouldn't have lambdas at all, I would guess?...

astral gazelle Feb 19, 2021, 1:35 AM

#

λ x : x[0]

#

Merge pls

gleaming rover Feb 19, 2021, 1:35 AM

#

like Haskell: \

grave jolt Feb 19, 2021, 1:35 AM

#

grave jolt Python probably wouldn't have lambdas at all, I would guess?...

Or rather-- if lambdas were proposed, they would cause controversy larger than patma

#

You're joking, but agda actually allows you to use λ

raven ridge Feb 19, 2021, 1:39 AM

#

the Python logging module isn't just problematic because of the camelcase - it's architecture, and the fact that it's implemented in Python, make it pretty slow

#

which is the motivation for https://github.com/brmmm3/fastlogging

grave jolt Feb 19, 2021, 1:48 AM

#

raven ridge which is the motivation for <https://github.com/brmmm3/fastlogging>

This doesn't seem like a good way to install it 🤔

#

or anything

raven ridge Feb 19, 2021, 1:52 AM

#

the latter is fine, right? It's basically what pip install . would do.

grave jolt Feb 19, 2021, 1:52 AM

#

right, but that's just for development

#

not if you want to use it as a dependency

raven ridge Feb 19, 2021, 1:53 AM

#

well, sure - there's pip install fastlogging for that, heh

#

it is in PyPI

#

though the PyPI description reuses the github readme, heh

#

https://github.com/brmmm3/fastlogging/blob/master/setup.py#L105 - lol

unkempt rock Feb 19, 2021, 2:18 AM

#

Why does Python have so many ways to format strings? You can use .format(), f-strings, etc. It's kinda contradict with "There should be one - and preferably one obvious way to do it" rule for me.

feral cedar Feb 19, 2021, 2:18 AM

#

legacy code

raven ridge Feb 19, 2021, 2:18 AM

#

Because it's an important thing that almost every program needs to do, and people kept coming up with better ways to do it.

#

it's not as though only f-strings would exist if there were only one way to do it - instead, only the old % style formatting would exist, and people wouldn't be any happier.

unkempt rock Feb 19, 2021, 2:19 AM

#

Oh

spark magnet Feb 19, 2021, 2:20 AM

#

@unkempt rock Python is 30 years old. new ideas come up.

unkempt rock Feb 19, 2021, 2:20 AM

#

Ok

#

I've been wondering that for quite a bit of time

#

Thanks yall for the clarification. Really appreciate it!

grave jolt Feb 19, 2021, 2:24 AM

#

unkempt rock Why does Python have so many ways to format strings? You can use `.format()`, f-...

!zen only one

fallen slateBOT Feb 19, 2021, 2:24 AM

#

The Zen of Python (line 12):

There should be one-- and preferably only one --obvious way to do it.

grave jolt Feb 19, 2021, 2:25 AM

#

There's a continuation:

#

!zen 13

fallen slateBOT Feb 19, 2021, 2:25 AM

#

The Zen of Python (line 13):

Although that way may not be obvious at first unless you're Dutch.

grave jolt Feb 19, 2021, 2:25 AM

#

So I guess nobody is Dutch enough to get string formatting right.

unkempt rock Feb 19, 2021, 2:26 AM

#

Yeah. Maybe except for Guido himself xD

grave jolt Feb 19, 2021, 2:27 AM

#

I'm not sure if the only-one-way is always the best approach

#

Sometimes it's like a distribution: there's one most-common way, and there are other, more niche ways.

#

Most of the times you want f-strings. More rarely, you'll want .format because you can dynamically choose the template. In very specific circumstances you'll need a different thing altogether -- you don't want to format URLs or SQL queries yourself; and sometimes you might want to use a templating engine.

sacred yew Feb 19, 2021, 2:29 AM

#

grave jolt Most of the times you want f-strings. More rarely, you'll want `.format` because...

when would you use template strings https://docs.python.org/3/library/string.html#template-strings 😛

grave jolt Feb 19, 2021, 2:29 AM

#

I actually used that once

unkempt rock Feb 19, 2021, 2:30 AM

#

Yeah
It depends on the situation to choose what to use, doesn't it?

unkempt rock Feb 19, 2021, 2:30 AM

#

sacred yew when would you use template strings https://docs.python.org/3/library/string.htm...

What is that 👀

grave jolt Feb 19, 2021, 2:30 AM

#

I guess the one-way mentality is about not having two competing ways, where one doesn't supersede another.

limpid marten Feb 19, 2021, 2:30 AM

#

gleaming rover to whine about Python's lambdas

Python lambdas do be quite lame :(

grave jolt Feb 19, 2021, 2:31 AM

#

sacred yew when would you use template strings https://docs.python.org/3/library/string.htm...

I used it for substituting stuff when generating docs: https://github.com/decorator-factory/python-fnl/blob/master/fnl/docs/__main__.py#L16
basically, it's an ad-hoc documentation generator for my toy markup language.

#

and the documentation is written in the language itself.

#

Kind of like a bootstrapping compiler.

sacred yew Feb 19, 2021, 2:32 AM

#

hmm why templates over str.format?

grave jolt Feb 19, 2021, 2:33 AM

#

sacred yew hmm why templates over str.format?

Because {...} is part of valid CSS. Although later I moved it to a separate file.

#

so I guess now it's just inertia lol

raven ridge Feb 19, 2021, 2:34 AM

#

template strings are safer in the presence of untrusted user input. That's a good reason to use them.

grave jolt Feb 19, 2021, 2:34 AM

#

Why are they safer?

sacred yew Feb 19, 2021, 2:36 AM

#

str.format lets you access attrs, so there could be unwanted leaks?

raven ridge Feb 19, 2021, 2:39 AM

#

it's safe to use untrusted user input as a template string, but isn't safe to use it as a format string. The latter leads to something called a format string injection attack.

unkempt rock Feb 19, 2021, 3:52 AM

#

what is best to use?

#

%, concatenation, .format(), or f-strings?

feral cedar Feb 19, 2021, 3:54 AM

#

generally format() and fstrings are better than % and concatenating

raven ridge Feb 19, 2021, 3:57 AM

#

f-strings should be preferred where they can be used.

unkempt rock Feb 19, 2021, 4:00 AM

#

raven ridge f-strings should be preferred where they can be used.

ok

solar delta Feb 19, 2021, 4:08 AM

#

hi guys! any1 home?

#

I am trying to deploy a django app but im having trouble 😦

#

gunicorn works when i use it inline with bash command: gunicorn -b /path/to/projectname/projectname.sock projectname.wsgi:application

#

but not when running on systemd service

#

also the systemd service says its running fine 😦

gleaming rover Feb 19, 2021, 4:10 AM

#

solar delta hi guys! any1 home?

this is a channel for discussion of the language.

#

try #web-development or #❓｜how-to-get-help

solar delta Feb 19, 2021, 4:11 AM

#

gunicorn is actually a python specific deployment thing but sure.

gleaming rover Feb 19, 2021, 4:11 AM

#

solar delta gunicorn is actually a python specific deployment thing but sure.

discussion of the language itself

#

i.e. where it's going, the pros/cons of its syntactic choices, differences between various implementations, etc.

limpid marten Feb 19, 2021, 5:31 AM

#

I was investigating using the socketserver package, and noticed something a bit odd
https://docs.python.org/3/library/socketserver.html#socketserver.BaseRequestHandler.handle

for datagram services, self.request is a pair of string and socket.
I don't quite get why it's a string and not bytes?

#

Ah, the documentation is wrong, it is actually bytes.

#

https://github.com/python/cpython/blob/b5711c940f70af89f2b4cf081a3fcd83924f3ae7/Lib/socketserver.py#L527

raven ridge Feb 19, 2021, 5:50 AM

#

limpid marten Ah, the documentation is wrong, it is actually bytes.

Probably missed in the Python 3 transition, when str was renamed to bytes. You should submit a PR to fix it!

limpid marten Feb 19, 2021, 5:50 AM

#

Great idea!

deft spruce Feb 19, 2021, 7:48 AM

#

might be a dumb question but

#

are functions even useful

#

im learning about them rn and idk what I would use them for

#

😂😂

#

hopefully functions aren’t too important later on😅

#

i always have trouble with them

raven ridge Feb 19, 2021, 8:08 AM

#

deft spruce hopefully functions aren’t too important later on😅

They're very important. As you get further into programming, you want to make bigger and better programs. You'll learn all of the major building blocks in only a few months, and after that everything is about how to combine those blocks together to make bigger and better and more interesting and useful programs. Functions are an incredible important tool for packaging up some little piece of functionality and wrapping a bow on it and giving it a name, so that you can reuse it elsewhere, and build future things that rely on that functionality without needing to copy and paste it or rewrite it.

deft spruce Feb 19, 2021, 8:11 AM

#

Oh wow

#

I understand, I just need to practice, but parameters, returns, and calling functions just don’t match up with my brain

raven ridge Feb 19, 2021, 8:12 AM

#

You'll get it. Like everything else, it takes time and practice to sink in.

deft spruce Feb 19, 2021, 8:12 AM

#

Yeah

#

I’ve tried learning it 3x over

#

Idk why but its the only thing I struggle with

#

Hopefully I can get it right by tomorrow

raven ridge Feb 19, 2021, 8:16 AM

#

Trying to build a program without functions is like trying to build a house without boards. It's possible, you can do everything yourself with some trees, an axe, and a saw, but it's much harder, and the end product won't be as good.

deft spruce Feb 19, 2021, 8:16 AM

#

I see

raven ridge Feb 19, 2021, 8:16 AM

#

The boards are like functions that someone has put in the time to get nice and reusable, and leveraging them lets you do more, better, faster than you could do without them.

deft spruce Feb 19, 2021, 8:21 AM

#

Makes sense a little

prime vine Feb 19, 2021, 9:01 AM

#

What are the main issues on Python ? Like a memory leak in Nodejs

supple venture Feb 19, 2021, 10:10 AM

#

@prime vine Depends on what you're using Python for and what you're comparing it to.

quaint mango Feb 19, 2021, 10:26 AM

#

deft spruce i always have trouble with them

For anything but the absolute simplest code, functions are essential. You would have used functions already, for example len() to get the length of a list.

grave jolt Feb 19, 2021, 12:04 PM

#

@deft spruce Functions are probably the single most useful tool you'll ever learn in programming.

cosmic edge Feb 19, 2021, 4:29 PM

#

is there any library to differentiate qr code engraved on a metal surface?

sacred yew Feb 19, 2021, 5:53 PM

#

this isn't a help channel #❓｜how-to-get-help @cosmic edge

prime vine Feb 20, 2021, 3:35 AM

#

supple venture <@!443380382633558027> Depends on what you're using Python for and what you're c...

For example a simple flask api vs express api.

#

And memory leak in Nodejs is kind of a general issue. It appears in every type of applications.

#

I wish I knew memory leaks in nodejs before prod. It's really hard to debug those things and it always appears in prod.

#

I am trying to learn Python, so I am preparing myself for something I should know before going all out.

silk pawn Feb 20, 2021, 3:42 AM

#

does the python interpreter do constant folding for string concatenation and if so, does anyone know how it does the folding? I was reading an interesting blog post on the accidentally quadratic blog on tumblr that was by someone who fixed constant folding of string concatenation taking O(n²) space instead of O(n) space in C# and VB.net and I wondered if Python was doing that also:
https://accidentallyquadratic.tumblr.com/post/187336866277/accidentally-quadratic-constant-folding

#

(asked here because I believe it's relevant to python internals)

prime vine Feb 20, 2021, 3:46 AM

#

silk pawn does the python interpreter do constant folding for string concatenation and if ...

Nice. But how did he implement the Ropes technique?

silk pawn Feb 20, 2021, 3:47 AM

#

not sure tbh

#

if C# is open source then one could browse through the old PRs and figure it out though

#

I would if I wasn't on my phone

prime vine Feb 20, 2021, 3:51 AM

#

https://www.google.com/amp/s/www.geeksforgeeks.org/ropes-data-structure-fast-string-concatenation/amp/

GeeksforGeeks

Ropes Data Structure (Fast String Concatenation) - GeeksforGeeks

A Computer Science portal for geeks. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions.

silk pawn Feb 20, 2021, 3:52 AM

#

I'd think the part of Python that deals with string concatenation is written in C, but idk

prime vine Feb 20, 2021, 3:53 AM

#

There are cons which not worth the pros.

silk pawn Feb 20, 2021, 3:53 AM

#

like?

#

ah ok I got to that point in the article

prime vine Feb 20, 2021, 3:53 AM

#

Look at the link.

silk pawn Feb 20, 2021, 3:54 AM

#

pretty sure the pros outweigh the cons in this case

prime vine Feb 20, 2021, 3:54 AM

#

There's a Python example. Wow

silk pawn Feb 20, 2021, 3:55 AM

#

only for normal string concatenation, not for ropes

#

only c++ for ropes it seems

prime vine Feb 20, 2021, 3:56 AM

#

I see

#

Maybe you can implement his c++ pseudo code to Python

silk pawn Feb 20, 2021, 3:56 AM

#

maybe, but I have a hunch that I'd need to implement that c++ code in normal C

#

would need to check the source code for this though

prime vine Feb 20, 2021, 4:03 AM

#

With this implementation, you are giving away cpu over memory usage.

#

Right?

#

What's your use case for that?

silk pawn Feb 20, 2021, 4:04 AM

#

It doesn't seem to be that much memory, and this would be done when making the bytecode, which we generally want done quickly iirc

prime vine Feb 20, 2021, 4:05 AM

#

I see.

silk pawn Feb 20, 2021, 4:05 AM

#

if you want to profile it I'd be happy to help analyze the results

prime vine Feb 20, 2021, 4:05 AM

#

But are you planning to implement it to your source code or the compiler itself?

silk pawn Feb 20, 2021, 4:09 AM

#

there's no python compiler per se, just an interpreter (not totally sure if I'm using the terms right feel free to correct me)
I was planning to implement it so that it'd be used by python to make the bytecode and stuff, so definitely not my source code but rather in the python repository

#

but all of that is moot if a core dev or someone who knows better how this string concatenation stuff works says it's not worth implementing

#

it seems it's best for larger strings that are modified more frequently anyway, so there are definitely rooms for different use cases

amber nexus Feb 20, 2021, 4:26 AM

#

Has there ever been a suggestion to specify the type of the variable a for-loop iteration is given to?

#

So that you can do conversions for example with less work and enforce the type

#

for variable: int in some_string_of_numbers:
    ...
```For example

#

it would attempt to convert the value to an int before assigning it to the variable

#

to prevent having to do something like ```py
for variable in some_string_of_numbers:
variable = int(variable)

native flame Feb 20, 2021, 4:29 AM

#

what about

for variable in map(int, some_string_of_numbers):

amber nexus Feb 20, 2021, 4:31 AM

#

Fair, it would probably be more syntactic sugar than anything else

prime vine Feb 20, 2021, 5:00 AM

#

It's interpreter my bad.

#

But yes, this technique has been there for a long time

#

Someone could have Merged it if it's worth implementing.

raven ridge Feb 20, 2021, 5:48 AM

#

silk pawn does the python interpreter do constant folding for string concatenation and if ...

It does constant folding through the AST optimizer. https://github.com/python/cpython/pull/2858

supple venture Feb 20, 2021, 8:26 AM

#

prime vine And memory leak in Nodejs is kind of a general issue. It appears in every type o...

I'm not an expert on nodejs, but as far as I'm aware JS doesn't have any way of implementing RAII patterns. And I'd claim that without those, you just cannot do resource management in a sensible way. C++ has constructors and destructors, python has context managers & with-blocks to deal with that.

gleaming rover Feb 20, 2021, 8:38 AM

#

supple venture I'm not an expert on nodejs, but as far as I'm aware JS doesn't have any way of ...

default to try-finally?

supple venture Feb 20, 2021, 9:59 AM

#

Try-finally is the only option JS has for management, but it simply isn't in the same ballpark as RAII when you are looking for encapsulation and composition of management logic.

grave jolt Feb 20, 2021, 10:08 AM

#

@supple venture tbh, try/finally or with statements get nested and messy pretty quickly if you have several resources.

#

Like, a connection pool, a connection and a cursor

supple venture Feb 20, 2021, 10:10 AM

#

Thats where ExitStack comes in.

#

Or move semantics in cpp

grave jolt Feb 20, 2021, 10:13 AM

#

I guess you could do

class ExitStack:
    def __init__(self, *callbacks):
        self.callbacks = callbacks

    def __del__(self):
        for cb in self.callbacks:
            cb()

and it would work in CPython, but it's a hack

supple venture Feb 20, 2021, 10:16 AM

#

I meant contextlib.ExitStack and contextlib.AsyncExitStack. Those are the core python solutions for situations where there is a dynamic set of resources you need to manage. Exactly for those you mentioned: connections, cursors, etc.

#

The point I was trying to make is: Languages like Python and C++ have first class concepts for defining managed resources, and for dealing with them as a consumer. JS doesn't. Thus I'd bluntly claim that @prime vine won't see the same level of memory leaks in python compared to nodejs.

grave jolt Feb 20, 2021, 10:39 AM

#

supple venture I meant `contextlib.ExitStack` and `contextlib.AsyncExitStack`. Those are the co...

huh, I didn't know these things existed, thanks

weary garden Feb 20, 2021, 3:27 PM

#

Test cases for my arbitrary precision arithmetic library floating point implementation are correctly failing as my test result has a higher precision than the machine floating point type (double): that's a win! \o/ https://i.imgur.com/J9iCMxp.png

silk pawn Feb 20, 2021, 4:52 PM

#

raven ridge It does constant folding through the AST optimizer. <https://github.com/python/c...

thanks!

raven ridge Feb 20, 2021, 5:34 PM

#

weary garden Test cases for my arbitrary precision arithmetic library floating point implemen...

How does it work? Is it binary floating point or decimal floating point? If binary, is the user in control of how many mantissa bits and how many exponent bits are used, rather than always using the 53/11 of an IEEE 754 binary64?

weary garden Feb 20, 2021, 5:35 PM

#

@raven ridge it is binary floating point with support for dynamic mantissa size

raven ridge Feb 20, 2021, 5:37 PM

#

Does the user choose the mantissa size, or is it automatically adjusted?

weary garden Feb 20, 2021, 5:37 PM

#

both

#

it is a template; you can specify a fixed size mantissa or it can be dynamic based on argument precision for operations

raven ridge Feb 20, 2021, 5:38 PM

#

That seems rather strange to me. That gives you two ways of adjusting the precision: either moving the floating point, or adding or subtracting significand bits. How do you automatically decide when to do which?

weary garden Feb 20, 2021, 5:39 PM

#

for an operation I decide on the appropriate mantissa size based on the precision of arguments unless a fixed size mantissa is specified.

#

it is implemented in terms of 64-bit words; I checked it works with an IEEE 754 128-bit implementation although I need to source some unit tests for high precision tests

raven ridge Feb 20, 2021, 5:46 PM

#

I don't think that really answers my question... I can understand how you can choose in advance a number of mantissa bits so the operation doesn't lose precision, but then it seems like what you'd have is a fixed point type, not floating. I'm not seeing how you choose between these two different ways of adjusting the precision - floating the point vs adding more mantissa bits.

weary garden Feb 20, 2021, 5:47 PM

#

there aren't two different ways

#

if a fixed size mantissa is specified then that is simply a hard limit on how much the mantissa can grow during an operation; there is only one algorithm though

raven ridge Feb 20, 2021, 5:51 PM

#

Ok, sure. But if the mantissa is allowed to grow then you have two choices of how to handle multiplying by two: you can make the mantissa one bit longer and set the extra bit to zero, or you can increment the exponent by 1. When do you do each of those?

weary garden Feb 20, 2021, 5:52 PM

#

mantissas only grow by word-size (64 bits)

#

i.e. mantissa size is a multiple of 64 bits

#

obviously I also have a separate exponent and mantissa normalisation similar to how IEEE 754 works

raven ridge Feb 20, 2021, 5:55 PM

#

Ok, so if you're going to multiply by 2**64, you have two choices for how to represent that: increasing the exponent, or adding another word of mantissa bits set to 64 zeroes. How do you choose which to do?

weary garden Feb 20, 2021, 5:57 PM

#

increasing precision of mantissa does not change the exponent

#

I guess it could do, might be something to think about during optimization phase which I will start in earnest once all my tests pass

raven ridge Feb 20, 2021, 5:59 PM

#

It would if you added new LSBs, or shifted after adding the bits. i guess you're instead adding new MSBs and setting them to 0?

weary garden Feb 20, 2021, 6:00 PM

#

no, the exponent doesn't change if you add LSBs, it would if you add MSBs which might be an optimization to do (see previous comment)

#

but I tend to renormalize after every operation anyway

#

also, this is still a work-in-progress so may still have bugs and isn't yet fast enough

#

currently using karatsuba for multiplication.

#

i also have repetend support in the mantissa

raven ridge Feb 20, 2021, 6:04 PM

#

the number grows if you add LSBs - adding a new least-significant zero is multiplying by two, just like how in base 10 you can multiply by 10 by adding a 0 at the end of a number

weary garden Feb 20, 2021, 6:05 PM

#

no, adding LSBs simply makes the number more precise, it doesn't change the exponent

raven ridge Feb 20, 2021, 6:05 PM

#

it doesn't change the exponent, but it makes a bigger number

weary garden Feb 20, 2021, 6:05 PM

#

no it don't

raven ridge Feb 20, 2021, 6:05 PM

#

unless you also modify the exponent

weary garden Feb 20, 2021, 6:06 PM

#

1.xxxxxxxxxxxxxxxxxxxyz // y is an LSB, adding z doesn't make the exponent bigger

#

(for a normalized mantissa)

raven ridge Feb 20, 2021, 6:08 PM

#

taking this to base 10, if you've got the number 50 already, and you're representing that as 5 * 10e0, you can multiply that by 10 to get 500 in two different ways. You can either add a digit to the end of the significand, making it 50 * 10e0, or you can increase the exponent, making it 5 * 10e1.

weary garden Feb 20, 2021, 6:10 PM

#

that isn't what we are talking about, we are talking about changing precision for a non-fixed size mantissa.

raven ridge Feb 20, 2021, 6:11 PM

#

well, it's what I'm trying to talk about, heh - you've got two different ways to represent a change to a different power-of-two as the multiplier - you can choose to modify the exponent, or you can choose to modify the mantissa (possibly by adding extra LSBs set to 0, in the case of multiplication by a power of two)

#

and I'm trying to understand when each of those approaches is taken - when an operation results in the exponent changing, and when it results in the exponent staying the same but the mantissa growing more or less precise.

weary garden Feb 20, 2021, 6:14 PM

#

those two things are orthogonal. adding extra fractional places to the end of a mantissa to support a higher precision result of the subsequent operation is unrelated to normal normal/sub-normal exponent based binary floating point math.

arctic peak Feb 20, 2021, 6:16 PM

#

godlygeek you work with?

raven ridge Feb 20, 2021, 6:16 PM

#

Work with who/what?

limpid marten Feb 20, 2021, 6:56 PM

#

I believe there are tools that exist that simplify floating point expressions to minimize floating point error, but I think in general if you'd want to always guarantee correct representation you'd have to have some analysis on the error propagation on your floating point operations?

mystic tangle Feb 20, 2021, 6:57 PM

#

Hello, guys. I want to ask about FORTRAN. Somebody knows how to embed code from fortran to python ? and is it make a sence? Just i have discipline related with fortran and computional math where we are solves equation via that lang... Also I heard that such lang can adopted to GPU that to make pararell computations)

limpid marten Feb 20, 2021, 6:57 PM

#

mystic tangle Hello, guys. I want to ask about FORTRAN. Somebody knows how to embed code from ...

Yes this is certainly possible with C FFI.

sacred yew Feb 20, 2021, 6:59 PM

#

numpy has f2py

#

sounds like what you want?

mystic tangle Feb 20, 2021, 7:01 PM

#

Thanks) also me interested how faster fortran for CUDA than CuPy? did somebody comparisons?

limpid marten Feb 20, 2021, 8:22 PM

#

Is there something particular about the C API that changes between versions that makes things that rely on C extensions not work between varying versions of Python 3.*?

#

I've noticed that there are some packages that work for a specific Python version, say 3.6, and don't work for versions 3.7+.

spark magnet Feb 20, 2021, 8:32 PM

#

Do you have some examples? What doesn't work?

limpid marten Feb 20, 2021, 8:34 PM

#

I think tensorflow is a popular example for this.
https://stackoverflow.com/questions/52584907/how-to-downgrade-python-from-3-7-to-3-6

raven ridge Feb 20, 2021, 8:39 PM

#

It's not really that they don't work on the new version, it's that the C API doesn't make ABI stability guarantees across minor versions, so they need to be recompiled for each new version. The same source code works, but it needs to be compiled again, and the maintainer of the package hasn't yet done that, or hasn't yet uploaded the new build to PyPI

#

there is now a limited version of the C API that makes ABI stability guarantees, so if an extension chooses to use only that subset it's possible to avoid needing to compile again for each new version.

limpid marten Feb 20, 2021, 8:41 PM

#

Ah that is neat, thank you for explaining it to me.

spark magnet Feb 20, 2021, 8:44 PM

#

@raven ridge i don't understand why these projects don't build wheels. It's not hard: https://pypi.org/project/coverage/#files

PyPI

coverage

Code coverage measurement for Python

raven ridge Feb 20, 2021, 8:44 PM

#

limpid marten I believe there are tools that exist that simplify floating point expressions to...

https://docs.python.org/3/library/math.html#math.fsum is an example of an iterated sum algorithm that tries to minimize floating point error.

limpid marten Feb 20, 2021, 8:46 PM

#

Wow! I didn't know the math library had this.

raven ridge Feb 20, 2021, 8:47 PM

#

spark magnet <@!451976922361102357> i don't understand why these projects don't build wheels....

I think lots of projects don't automate it, and upload them by hand. Which requires manual work from the maintainer on each new release. And is often dependent on them actually testing with the new version, and that requires that all of their dependencies have been packaged for the new version, recursively...

spark magnet Feb 20, 2021, 8:48 PM

#

@raven ridge i should hope tensorflow has the resources to automate this. seriously, it's not that hard.

raven ridge Feb 20, 2021, 8:49 PM

#

Yeah. I'm surprised at how long it takes before the ecosystem stabilizes after a new interpreter version is released, but it's not so surprising considering that it requires a lot of different actors who don't work in concert.

raven ridge Feb 20, 2021, 9:12 PM

#

more things starting to use the limited ABI would help.

tidal marten Feb 21, 2021, 4:35 AM

#

If I build a package using pep517, do I need to ship both the wheel and tar.gz or only the tar.gz is necessary?

raven ridge Feb 21, 2021, 5:17 AM

#

tidal marten If I build a package using pep517, do I need to ship both the wheel and tar.gz o...

Only the .tar.gz is strictly required, but it's preferable to ship the .whl as well. They're faster for your users to install, and safer in that they don't run untrusted code at install time

unkempt rock Feb 21, 2021, 7:21 AM

#

#python-discussion message why?

bronze acorn Feb 21, 2021, 7:29 AM

#

anyone read this book? does it give solid idea about ML and how to use it

#

it is specifically for python

severe hinge Feb 21, 2021, 10:09 AM

#

bronze acorn anyone read this book? does it give solid idea about ML and how to use it

No

bronze acorn Feb 21, 2021, 10:11 AM

#

no what?

#

you read it or no?

sacred yew Feb 21, 2021, 10:43 AM

#

wrong channel

#

#data-science-and-ml ?

severe hinge Feb 21, 2021, 10:57 AM

#

I don't read this book

granite bough Feb 21, 2021, 12:51 PM

#

Any one here good with web scraping, I need some help

visual shadow Feb 21, 2021, 1:55 PM

#

wrong room, this is about advanced discussion on the python language itself. try #❓｜how-to-get-help

strange heath Feb 21, 2021, 5:31 PM

#

bronze acorn anyone read this book? does it give solid idea about ML and how to use it

#data-science-and-ml

eternal root Feb 21, 2021, 6:54 PM

#

You guys think I can achieve something like this

#

Let's say I have a API that uploads a video to a servee

#

I also have a python script that checks if a file is uploaded or some change occured

#

Then it takes the video file that is uploaded

#

Does something let's say with FFMpeg

#

And upload it to the server again

#

And delete the old file

#

I know I need to know somethings about asynchronous programming

#

But is there a other way

#

I am making something like youtube

#

Now am in backend

undone hare Feb 21, 2021, 7:04 PM

#

Hello @eternal root, you should ask in #web-development

eternal root Feb 21, 2021, 7:05 PM

#

I don't think it's related to web development. It's more like server side processing

undone hare Feb 21, 2021, 7:06 PM

#

Hmm I think that would fit in the channel topic

strange heath Feb 21, 2021, 7:09 PM

#

It sounds like #web-development

#

this is more discussion of syntax in python

radiant garden Feb 22, 2021, 12:44 AM

#

I've had a look at pep 646 (variadic generics), and one thing struck as interesting.

Could the type unpack syntax Generic[*Ts] also be used to define fixed-length tuples generic over N? So something like tuple[*[T]*N] for a homogenous tuple of length N.

From my toying around with PyRight's typing_extensions, it looks like Unpack[Ts] (3.9 syntax compatible type unpacking) only supports TypeVarTuple, and not any unpackable sequence.

weary garden Feb 22, 2021, 1:58 AM

#

I am thinking I can get away with a fixed sized exponent.

spark magnet Feb 22, 2021, 2:10 AM

#

we're looking forward to hearing about the Python implementation

strange heath Feb 22, 2021, 2:11 AM

#

me looking at this even tho i don't even know advanced python

#

what is #internals-and-peps even for

astral gazelle Feb 22, 2021, 2:12 AM

#

Its for the python language itself, not necessarily about advanced programming concepts

strange heath Feb 22, 2021, 2:12 AM

#

uhhhhh ok

raven ridge Feb 22, 2021, 2:14 AM

#

How the language works, why they chose one feature over another feature, the differences between different implementations of Python, etc.

strange heath Feb 22, 2021, 2:15 AM

#

okie

#

i'm never gonna use this channel anyways so

weary garden Feb 22, 2021, 2:16 AM

#

@spark magnet I can't start on the Python implementation until I have sorted out my multi-precision math library.

strange heath Feb 22, 2021, 2:18 AM

#

your what now

#

forget i asked lmao

weary garden Feb 22, 2021, 2:20 AM

#

I am making a Python implementation

lone grove Feb 22, 2021, 2:26 AM

#

raven ridge How the language works, why they chose one feature over another feature, the dif...

Why not language-meta as a name?

raven ridge Feb 22, 2021, 2:29 AM

#

We tried a lot of different names, and there's not much appetite for trying more, but you can suggest one in #community-meta if you'd like

lone grove Feb 22, 2021, 2:30 AM

#

Oki

terse pivot Feb 22, 2021, 5:44 AM

#

@eternal root u will need something more og a VCS graph to do that, that only replaces if u get that new data. FFMPEG can read URL resources.

oak cedar Feb 22, 2021, 9:55 AM

#

Unlimited cloud storage using py,c and fvid

unkempt rock Feb 22, 2021, 2:11 PM

#

how to make requirements.txt from a ,py

#

pip freeze gives all modules name