haughty mountain Nov 20, 2022, 9:54 PM

#

but yes, the cyan arrows I drew were the two elements that decided what the value would be

fiery cosmos Nov 20, 2022, 9:55 PM

#

alright i think i have some pseudocode i can follow for an input set, now i wonder how to construct the set returner

#

i'll try to write some pseudo

opal oriole Nov 20, 2022, 9:57 PM

#

Btw, try not redefining built-ins like sum. Use a different name.

haughty mountain Nov 20, 2022, 9:57 PM

#

fiery cosmos Nov 20, 2022, 9:58 PM

#

opal oriole Btw, try not redefining built-ins like sum. Use a different name.

its pseudocode, won't matter. but i got you

opal oriole Nov 20, 2022, 9:58 PM

#

!e ```py
print(sum([1, 2, 3]))
sum = 0
print(sum([1, 2, 3]))

halcyon plankBOT Nov 20, 2022, 9:58 PM

#

@opal oriole :x: Your 3.11 eval job has completed with return code 1.

001 | 6
002 | Traceback (most recent call last):
003 |   File "<string>", line 3, in <module>
004 | TypeError: 'int' object is not callable

opal oriole Nov 20, 2022, 9:58 PM

#

Or you might run into this.

#

Python lets you do this, but I don't recommend it, it's like redefining things in C with macros.

haughty mountain Nov 20, 2022, 10:00 PM

#

#define private public

fiery cosmos Nov 20, 2022, 10:01 PM

#

can i use the same for j in range loop near the end to construct the sets?

haughty mountain Nov 20, 2022, 10:01 PM

#

other abuse of the preprocessor that I kinda helped with https://codeforces.com/blog/entry/77480

opal oriole Nov 20, 2022, 10:01 PM

#

fiery cosmos can i use the same for j in range loop near the end to construct the sets?

The var in the for is local to the for, so it's not being reused.

#

So one j in a loop is the not the same j in another, unless they are nested.

haughty mountain Nov 20, 2022, 10:02 PM

#

opal oriole The var in the for is local to the for, so it's not being reused.

that's not exactly true for python

opal oriole Nov 20, 2022, 10:02 PM

#

haughty mountain that's not exactly true for python

Yeah I think you can get away with it outside right?

haughty mountain Nov 20, 2022, 10:02 PM

#

python scoping is weird

opal oriole Nov 20, 2022, 10:03 PM

#

It's just not done normally, because that would be strange.

haughty mountain Nov 20, 2022, 10:03 PM

#

yes, but you can totally do it

opal oriole Nov 20, 2022, 10:03 PM

#

Like var i = 0 in a for in javascript?

haughty mountain Nov 20, 2022, 10:04 PM

#

I much prefer more restrictive scoping

#

it avoids dumb mistakes

opal oriole Nov 20, 2022, 10:04 PM

#

Yeah, kind of one of the whole selling points of structured programming.

haughty mountain Nov 20, 2022, 10:04 PM

#

fiery cosmos can i use the same for j in range loop near the end to construct the sets?

you could maybe use it to find the value you need to backtrack

#

by modifying it a bit

fiery cosmos Nov 20, 2022, 10:05 PM

#

maybe i start:

def set_getr(dp):
    for j in range(dp):
```?

haughty mountain Nov 20, 2022, 10:05 PM

#

the existing loop is a min while you want an argmin

#

so you can re-create the partition from the sum

opal oriole Nov 20, 2022, 10:07 PM

#

You can think of indices as holding the structure, while the values are well, the values.

#

And sometimes you want the structure, not the values.

#

Or both.

fiery cosmos Nov 20, 2022, 10:07 PM

#

def set_getr(dp,s):
    for j in range(s):
      y = argmin(dp[j][s])

haughty mountain Nov 20, 2022, 10:07 PM

#

dp tables in particular tend to hold a lot of information in the indexing 😛

fiery cosmos Nov 20, 2022, 10:07 PM

#

how far off am i

opal oriole Nov 20, 2022, 10:07 PM

#

And argmin, max, etc, deals with indices (while regular min is for values).

haughty mountain Nov 20, 2022, 10:08 PM

#

fiery cosmos how far off am i

very?

fiery cosmos Nov 20, 2022, 10:08 PM

#

figured

haughty mountain Nov 20, 2022, 10:08 PM

#

considering I don't know what you're trying to do there

fiery cosmos Nov 20, 2022, 10:08 PM

#

lmao ok

#

let me try again

haughty mountain Nov 20, 2022, 10:08 PM

#

you know what we mean by argmin, right?

fiery cosmos Nov 20, 2022, 10:09 PM

#

the argument which gave rise to the minimum

haughty mountain Nov 20, 2022, 10:09 PM

#

right

fiery cosmos Nov 20, 2022, 10:09 PM

#

the min diff in the existing loop?

opal oriole Nov 20, 2022, 10:10 PM

#

Think of a list / table as a function which maps index to value, so what would an argmin give (what is the argument?)?

fiery cosmos Nov 20, 2022, 10:10 PM

#

should i be trying to augment the existing for j loop or writing a new function

haughty mountain Nov 20, 2022, 10:10 PM

#

what index produced the minimum diff

#

also, I feel like that loop has a bug

opal oriole Nov 20, 2022, 10:11 PM

#

gtg, gl

haughty mountain Nov 20, 2022, 10:11 PM

#

oh wait

#

nvm

#

they only look at indices <= total_sum/2

#

but that's actually fine

fiery cosmos Nov 20, 2022, 10:12 PM

#

you can run it in python if you so desire

haughty mountain Nov 20, 2022, 10:13 PM

#

since it's mirrored

#

if one partition has sum total_sum/2 - x the other has sum total_sum/2 + x

#

so you actually only need to check one direction

#

why would I run the code? I'm trying to check that the logic is right

#

running examples can make things look correct unless you actually happen to try some edge case

fiery cosmos Nov 20, 2022, 10:14 PM

#

right right

#

ok yeah the pseudocode i have now won't work at all. i'm not doing +=1 in the way we did for the arrays above anywhere

#

and apparently they don't either anywhere either..

#

they just write True or False throughout the table

haughty mountain Nov 20, 2022, 10:22 PM

#

we weren't doing += 1

fiery cosmos Nov 20, 2022, 10:22 PM

#

no just assigning 1 as T

#

so im assuming their T/F is our 1/0, so everything is the same but i have the first for loop make the whole first column 1 instead of T

#

i got rid of the second loop, the for j in range where they set things equal to False

#

will that break things?

haughty mountain Nov 20, 2022, 10:24 PM

#

#

this isn't adding

#

it's an or

haughty mountain Nov 20, 2022, 10:24 PM

#

fiery cosmos i got rid of the second loop, the for j in range where they set things equal to ...

not if you default to False

fiery cosmos Nov 20, 2022, 10:24 PM

#

im defaulting to 0, as thats the way the table is instantiated

#

the only difference is that i set all the first column to be 1 as in your example using their existing loop

#

i'll be able to prove it works when i walk through it with examples but i haven't figured out how to return the subsets yet

haughty mountain Nov 20, 2022, 10:26 PM

#

actually, why do they do that?

#

you don't need to set dp[...][0] to trues

#

you only need to set dp[0][0] to true

fiery cosmos Nov 20, 2022, 10:26 PM

#

Initialize top row, except dp[0][0],

# as false. With 0 elements, no other
    # sum except 0 is possible
    for j in range(1, su + 1):
        dp[0][j] = False

haughty mountain Nov 20, 2022, 10:26 PM

#

and then the loop should handle the rest

#

oh, their loop is dumb

fiery cosmos Nov 20, 2022, 10:27 PM

#

im guessing you are saying a non-logical loop, rather than a dumb impl

haughty mountain Nov 20, 2022, 10:27 PM

#

it should go from zero

for j in range(0, su + 1):

#

that way you don't need to special case the whole first column like they do

#

i.e. default everything to 0, dp[0][0] = True and you're good to go

#

the rule we apply works fine for zero, it's not a special case like they make it look like

fiery cosmos Nov 20, 2022, 10:29 PM

#

im using dp[0][0] = 1, its ok?

haughty mountain Nov 20, 2022, 10:30 PM

#

right

#

1/True/whatever

fiery cosmos Nov 20, 2022, 10:30 PM

#

that means i don't need the loop initializing the first column to 1's..

haughty mountain Nov 20, 2022, 10:30 PM

#

as long as you fix the loop that's the one index you need to set

fiery cosmos Nov 20, 2022, 11:06 PM

#

so i've put it into pythontutor to see what's happening but theirs has 1 indexing in the j loop

wide gale Nov 21, 2022, 12:37 AM

#

i just dont know what the original uncompressed string is, since the source code only has the serialized form

#

thanks this is helpful. good to know I was sort of in the right direction because I planning on making classes for the pokemon and poke data. im still going to try and more or less do it on my own and check for references because I want to try figuring out myself. kind of like the 'look' 'cover' 'write' check' process.

haughty mountain Nov 21, 2022, 12:45 AM

#

you could at least grab my class that wraps their data and converts some of it to less awful formats

#

like a byte string rather than an array of strings representing bytes

#

the overall js code is quite bad

wide gale Nov 21, 2022, 1:02 AM

#

i didnt even know bytes was a data type

#

i still dont understand 1.how he compressedit and 2. how do i get the original un serialized data? And for the sake of understanding how did he serialise it to begin with (I know he uses a look up table, but from where)

wide gale Nov 21, 2022, 1:23 AM

#

Are there any tutorials that would help me understand this encoding decoding? because this stuff is kind of new to me and making my head hurt hahah. encoding decoding file formats sounds like it could be a really useful skill for me as a 3d artist

lean lake Nov 21, 2022, 2:06 AM

#

hello. would anyone know how to get from a pandas dataframe with 2 columns (x,y) as coordinates, and turn those into some kind of random walk?

#

#

the goal is to maximize profit. maximum of 24 hours to do the heist. we have to calculate travel time too. and we need to get back to the choppah before we run out of time. idk where to start with this.

#

i was thinking using networkx to generate a graph. i thought x,y coords could be seen as nodes. it's just not working. been googling for a few hours

topaz surge Nov 21, 2022, 7:31 AM

#

I have tree given in parent array representation. How can I inorder traverse it?

haughty mountain Nov 21, 2022, 9:06 AM

#

lean lake i was thinking using networkx to generate a graph. i thought x,y coords could be...

x,y coords would determine distances between nodes which would be weights on edges in a graph

fiery cosmos Nov 21, 2022, 9:57 AM

#

hi guys i have 2 txt files, textfile1 is 500k lines and textfile2 is 3m lines i want to compare textfile1 with textfile2 and copy the entry from textfile2 into a new textfile called output. What would be the fastest way/algo to do this ?

agile sundial Nov 21, 2022, 1:29 PM

#

fiery cosmos hi guys i have 2 txt files, textfile1 is 500k lines and textfile2 is 3m lines i ...

wdym "compare"

lean lake Nov 21, 2022, 1:36 PM

#

haughty mountain x,y coords would determine distances between nodes which would be weights on edg...

Yeah, I thought of doing this manually. But there's 10000 nodes. They're all connected too. I can't wrap my head around this problem

#

How do I go from the pandas dataframe to a code that checks the most lucrative path from node to node. But all that from a pandas dataframe 😬

#

I was able to use math.dist to check the distance between 2 nodes

haughty mountain Nov 21, 2022, 2:25 PM

#

lol, for 100000 nodes you're absolutely screwed trying to find an optimal solution

#

I expect this kind of problem to be NP-hard

lean lake Nov 21, 2022, 2:26 PM

#

haughty mountain I expect this kind of problem to be NP-hard

Yup agreed

#

Doesn't need to be perfect, but I'd like to give it a try at least lol

#

We have until Thursday.. I'm so stressed lmao

haughty mountain Nov 21, 2022, 2:27 PM

#

this feels like the kind of problem I would see in google hashcode

#

i.e. a large NP-hard problem that people try to optimize

#

also, it's 10k and not 100k, right?

#

at least that's your table size

lean lake Nov 21, 2022, 2:29 PM

#

Yeah 10k

#

My bad. On phone at work

haughty mountain Nov 21, 2022, 2:30 PM

#

(10k)^2 edges is borderline managable

lean lake Nov 21, 2022, 2:30 PM

#

With the full time job that leaves me 3-4 hours a day to actually do some work on the problem. Not optimal

lean lake Nov 21, 2022, 2:31 PM

#

haughty mountain (10k)^2 edges is borderline managable

That's like 100m ? Oof lol

haughty mountain Nov 21, 2022, 2:32 PM

#

maybe too much for python

#

but in a less memory wasting language we're talking less than a GB of memory

lean lake Nov 21, 2022, 2:35 PM

#

One of the rule is that the code has to run in less than 3 minutes on an average laptop

#

😬

agile sundial Nov 21, 2022, 2:36 PM

#

3 mins is p lenient, even for python

haughty mountain Nov 21, 2022, 2:54 PM

#

agile sundial 3 mins is p lenient, even for python

depends, I had a task where your typical reading from file took close to a minute 😛

#

like a GB of input data iirc

#

(then solving the problem took only a few seconds, because decent algos)

#

so a dumb attempt at a solution would be to just greedily pick the "best" node

#

where best is up to you

#

a reasonable heuristic would be something like profit divided by time spent

#

with a restriction that you only allow to pick nodes where you can make it back in time

#

if this was google hashcode that would be my first thing to try

#

O(n²) ish

#

but for n=10^4 I would consider using something other than python

#

or at least using pypy

lean lake Nov 21, 2022, 3:13 PM

#

haughty mountain a reasonable heuristic would be something like profit divided by time spent

That's what I thought selecting neighbors with highest ratio of money/time

#

And always moving towards 0,0 as we do the random walk

#

The problem is that I googled for 6 hours yesterday, trying to find snippets of code to get me started. Can't find anything that uses a pandas dataframe to create relations between the nodes I'd create with networkx. I managed to visualize a simple scatter plot with matplotlib tho. But it was useless

haughty mountain Nov 21, 2022, 3:40 PM

#

I would just write the graph structure myself, but that's just me

#

granted, since you want all connections you could use numpy

#

!e

import numpy as np
x = np.array([[1, 2, 7, 4]])
y = np.array([[-3, 2, 1, 4]])
print(np.sqrt((x - x.T)**2 + (y - y.T)**2))

halcyon plankBOT Nov 21, 2022, 3:49 PM

#

@haughty mountain :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | [[0.         5.09901951 7.21110255 7.61577311]
002 |  [5.09901951 0.         5.09901951 2.82842712]
003 |  [7.21110255 5.09901951 0.         4.24264069]
004 |  [7.61577311 2.82842712 4.24264069 0.        ]]

haughty mountain Nov 21, 2022, 3:50 PM

#

that also helps with space issues, since numpy is actually pretty good about memory usage since it doesn't use python collections

#

so what I computed there is the distance between all pairs

#

which is essentially an adjacency matrix

lean lake Nov 21, 2022, 3:52 PM

#

Interesting, I'd have to iterate over the rows in the dataframe and plug in the coords in numpy

haughty mountain Nov 21, 2022, 3:53 PM

#

would you? I expect pandas would play well with numpy

lean lake Nov 21, 2022, 3:53 PM

#

Yeah, pandas is built on top of numpy, now that I think of it

haughty mountain Nov 21, 2022, 3:54 PM

#

docs says .to_numpy on a dataframe

hot latch Nov 21, 2022, 4:23 PM

#

I'm practicing hackerrank and having some trouble finishing this solution

#

#

do any of you have any ideas for optimization for the algorithm?

#

of course brute force is easy

#

and I dn't think simply adding a list so we don't have to recheck certain strings

haughty mountain Nov 21, 2022, 4:31 PM

#

compute the True/False answer for each element, do a prefix sum, query sum in range in O(1) per query

hot latch Nov 21, 2022, 4:36 PM

#

haughty mountain compute the True/False answer for each element, do a prefix sum, query sum in ra...

do you mind typing a solution, I do not understand what you mean, sorry

#

or some pseudocode

haughty mountain Nov 21, 2022, 4:37 PM

#

the first part shouldmbe obvious, just compute the answer for each element

hot latch Nov 21, 2022, 4:37 PM

#

yup I got that

haughty mountain Nov 21, 2022, 4:37 PM

#

search for "prefix sum" then

#

that's the key term you need to look into

hot latch Nov 21, 2022, 4:39 PM

#

def prefix_sums(A):
2 n = len(A)
3 P = [0] * (n + 1)
4 for k in xrange(1, n + 1):
5 P[k] = P[k - 1] + A[k - 1]
6 return P

#

so something like this

#

and then what do you mean by the last part

#

query sum in range in O(1)

lean lake Nov 21, 2022, 6:16 PM

#

haughty mountain !e ```py import numpy as np x = np.array([[1, 2, 7, 4]]) y = np.array([[-3, 2, 1...

You think this would compute relatively fast for 10000 coords?

So like df[x_coords].to_numpy and same for y_coords. Store that in variables and plug it into the hypotenuse formula? I know math.dist does that. Numpy is probably faster tho

lean lake Nov 21, 2022, 6:45 PM

#

With that adj matrix graph for networkx would work

#

Just need to access IDs

#

And make a list of IDs as output somehow

agile sundial Nov 21, 2022, 6:47 PM

#

calm down with the gifs..

lean lake Nov 21, 2022, 6:48 PM

#

Should ban imo. Not the place for this crap.

haughty mountain Nov 21, 2022, 6:52 PM

#

<@&831776746206265384>

haughty mountain Nov 21, 2022, 6:53 PM

#

lean lake You think this would compute relatively fast for 10000 coords? So like df[x_coo...

I wouldn't even put this in networkx, I would expect the size to grow a bunch if you do

#

granted idk what their internal format is

#

you have an adjacency matrix, so you have a graph description already

lean lake Nov 21, 2022, 6:54 PM

#

haughty mountain you have an adjacency matrix, so you have a graph description already

Good point

haughty mountain Nov 21, 2022, 6:55 PM

#

and the indices correspond to rows in your date

lean lake Nov 21, 2022, 6:55 PM

#

I guess I'm lacking confidence without the training wheels 😂

fathom whale Nov 21, 2022, 6:56 PM

#

!ban 502883843825598475 not the place for those gifs

halcyon plankBOT Nov 21, 2022, 6:56 PM

#

:incoming_envelope: :ok_hand: applied ban to @lethal stirrup permanently.

haughty mountain Nov 21, 2022, 6:57 PM

#

and as for performance, it's probably isn't going to be great, though maybe you can vectorize most operations with numpy

#

rows of your matrix gives you the distance

#

you will also have vectors with things like the money, time, ...

#

which you can combine with the distance to compute a cost

#

which should be easily vectorizable with numpy

#

i.e. not python slowness

#

(basically do as little as you can in pure python)

brittle moat Nov 21, 2022, 7:02 PM

#

hey guys, can someone explain how in this topological sort algo, how it's traversing back to the nodes which had their vertices appended? My graph is in this order {'A': ['C'], 'C': ['E'], 'E': ['H', 'F'], 'B': ['C', 'D'], 'D': ['F'], 'F': ['G']}). After it indexes G from F, it goes back to F, then back to E, then back to C, etc. Where is the code that's doing that? I understand all other bits of the algo, except that part

haughty mountain Nov 21, 2022, 7:02 PM

#

hot latch query sum in range in O(1)

that's what a prefix sum allows you to do

brittle moat Nov 21, 2022, 7:02 PM

#

from collections import defaultdict #dict for graph
"""
1. If a vertex depends on CurrentVertex -> Go to that vertex and then come back to current vertex
2. Push current vertex to stack
"""
class Graph:
    def __init__(self, numVertices):
        self.graph = defaultdict(list)

    def addEdge(self, vertex, edge):
        #adding vertices and their edges
        self.graph[vertex].append(edge) #adding edge to vertex: A: C, B:C
        
        #print("added edges", self.graph) #adding edges and tehir vertices to graph
        print("starting graph", self.graph)
    def topologicalSortUtil(self, currentVertex, visited, stack):
        visited.append(currentVertex) #add all unvisited vertices to visited initially from first topological sort call. 
        
        #Add A, C, E H, F, G
        
            #after all the vertices have been added from the first function - edge elements gets appended
        
        print("visited elements", visited) #should be 

        # #finding edges of these vertices
        for i in self.graph[currentVertex]: #looping through edges
            print("when currentVertex is indexed", i) 
            if i not in visited: 
                print("not in visited", i) #A, C, E, H, F, G
                self.topologicalSortUtil(i, visited, stack) 
                
        print("item about to enter stack", currentVertex)
        stack.insert(0, currentVertex) 
        print("Stack", stack) 

    def topologicalSort(self):
        visited = []
        stack = []

        for k in list(self.graph): 
            if k not in visited: 
                print("not yet visited ", k)
                self.topologicalSortUtil(k, visited, stack)

        #print("finished stack", stack)

graph = Graph(8)

graph.addEdge("A", "C")
graph.addEdge("C", "E")
graph.addEdge("E", "H")
graph.addEdge("E", "F") 
graph.addEdge("B", "C")
graph.addEdge("B", "D")
graph.addEdge("D", "F")
graph.addEdge("F", "G")

graph.topologicalSort()```

fiery cosmos Nov 21, 2022, 7:09 PM

#

dang i definitely just missed something funny.. what sort of gifs were they spamming? also, why are mathematicians preoccupied with things like this:

#

lean lake Nov 21, 2022, 7:12 PM

#

Is this 3d representation of something 11d? I'd see why they are if so

fiery cosmos Nov 21, 2022, 7:15 PM

#

let me get back to where i found it

#

its a stereographic projection of a dodecaplex

#

#

https://en.wikipedia.org/wiki/Regular_dodecahedron

hybrid epoch Nov 21, 2022, 7:23 PM

#

Also known as a three-manifold
http://www.gang.umass.edu/~kusner/other/3mfd.html

fiery cosmos Nov 21, 2022, 7:26 PM

#

oh. is this the genesis of your username

#

factorial time is greater than polynomial time yeah?

#

n! > n^3?

hybrid epoch Nov 21, 2022, 7:28 PM

#

Yup

lean lake Nov 21, 2022, 7:29 PM

#

R3 is real numbers 3d?

#

I have so much to learn yet so little time 😅

hybrid epoch Nov 21, 2022, 9:02 PM

#

I think of R^n as n-dimensional euclidian space
So if you're talking about a real number, then you're referring to an element in R (or R^1), which is a 1-dimensional euclidian space

lean lake Nov 21, 2022, 9:49 PM

#

hybrid epoch I think of R^n as n-dimensional euclidian space So if you're talking about a re...

Makes sense

#

Just got back home. Will be working on the workshop for school. Hopefully I get somewhere today

lean lake Nov 21, 2022, 10:36 PM

#

haughty mountain !e ```py import numpy as np x = np.array([[1, 2, 7, 4]]) y = np.array([[-3, 2, 1...

output for this is flat. not quite sure why.. hmm .reshape?

df10x = df10.x_coordinate.to_numpy()
df10y = df10.y_coordinate.to_numpy()

df10adj = np.sqrt((df10x - df10x.T)**2 + (df10y - df10y.T)**2)

haughty mountain Nov 21, 2022, 10:37 PM

#

you'll need them as matrices yeah

#

rather than a vector

lean lake Nov 21, 2022, 10:38 PM

#

hmm

#

so reshape(10000,1)

haughty mountain Nov 21, 2022, 10:39 PM

#

np.asmatrix would do the work

lean lake Nov 21, 2022, 10:40 PM

#

sorry im kinda lost

#

just came back from work lol. long day.

haughty mountain Nov 21, 2022, 10:40 PM

#

df10x = np.asmatrix(...)

lean lake Nov 21, 2022, 10:40 PM

#

so, the pandas series gotta be turned asmatrix

#

hmm, i get a lot of nan

#

matrix([[        nan,         nan,         nan, ...,         nan,
                 nan,         nan],
        [        nan,         nan,         nan, ..., 27.29493721,
                 nan,         nan],
        [        nan,         nan,         nan, ...,         nan,
                 nan,         nan],
        ...,
        [        nan, 27.29493721,         nan, ...,         nan,
                 nan, 33.6148582 ],
        [        nan,         nan,         nan, ...,         nan,
                 nan,         nan],
        [        nan,         nan,         nan, ..., 33.6148582 ,
                 nan,         nan]])

#

df10x = df10.x_coordinate.to_numpy()
df10y = df10.y_coordinate.to_numpy()

df10x = np.asmatrix(df10x)
df10y = np.asmatrix(df10y)

df10adj = np.sqrt((df10x - df10x.T)**2 + (df10y - df10y.T)**2)
df10adj

haughty mountain Nov 21, 2022, 10:44 PM

#

oh wait, I see what's going on

lean lake Nov 21, 2022, 10:44 PM

#

i also reduced the df to 50 elements, just for the prototyping part

haughty mountain Nov 21, 2022, 10:45 PM

#

**2 for matrix is a matrix multiplication

#

maybe there is a better way to turn the array 2d...

lean lake Nov 21, 2022, 10:49 PM

#

haughty mountain maybe there is a better way to turn the array 2d...

df10x = df10.x_coordinate.to_numpy()
df10y = df10.y_coordinate.to_numpy()

nx = len(df10x)
ny = len(df10y)

df10x = np.reshape(df10x, (nx,1))
df10y = np.reshape(df10y, (ny,1))

df10adj = np.sqrt((df10x - df10x.T)**2 + (df10y - df10y.T)**2)
df10adj

#

output

array([[ 0.        ,  4.06879027,  4.98027366, ...,  8.46070511,
         7.68653398,  6.14089225],
       [ 4.06879027,  0.        ,  5.82129779, ..., 11.52977429,
         8.24517308,  2.33589784],
       [ 4.98027366,  5.82129779,  0.        , ...,  6.60271112,
         2.73392075,  6.21347304],
       ...,
       [ 8.46070511, 11.52977429,  6.60271112, ...,  0.        ,
         6.55712635, 12.58602939],
       [ 7.68653398,  8.24517308,  2.73392075, ...,  6.55712635,
         0.        ,  8.08790142],
       [ 6.14089225,  2.33589784,  6.21347304, ..., 12.58602939,
         8.08790142,  0.        ]])

#

#

does that sound right?

haughty mountain Nov 21, 2022, 10:50 PM

#

actually, there is also meshgrid which might be cleaner

#

x_mat, y_mat = np.meshgrid(x_vec, y_vec)

#

and then compute things with the x and y matrices

lean lake Nov 21, 2022, 10:53 PM

#

you're right

#

way cleaner

#

df10x = df10.x_coordinate.to_numpy()
df10y = df10.y_coordinate.to_numpy()

x_mat, y_mat = np.meshgrid(df10x, df10y)
df10adj = np.sqrt((x_mat - x_mat.T)**2 + (y_mat - y_mat.T)**2)
df10adj

smoky tapir Nov 21, 2022, 10:53 PM

#

Is it possible to generate nodes in a loop doubly linked lists

haughty mountain Nov 21, 2022, 10:53 PM

#

actually, would that even do the right thing...

#

I think not

#

you probably want the x - x.T

#

meshgrid doesn't quite do the same thing

lean lake Nov 21, 2022, 10:55 PM

#

both output looks the same, i think

haughty mountain Nov 21, 2022, 10:56 PM

#

they shouldn't pithink

#

the equivalent operation would be something like

x1,x2 = np.meshgrid(x, x)
x_dist = x1 - x2
y1,y2 = np.meshgrid(y, y)
y_dist = y1 - y2
np.sqrt(x_dist**2 + y_dist**2)

#

which is...not great

#

why do you even get a vector from pandas?

#

oh, is the thing you're converting not a dataframe?

#

but a column, or something?

haughty mountain Nov 21, 2022, 11:02 PM

#

lean lake ```py df10x = df10.x_coordinate.to_numpy() df10y = df10.y_coordinate.to_numpy() ...

I'd say this is probably clean enough

lean lake Nov 21, 2022, 11:03 PM

#

yeah df[x_coordinate] and the y equivalent, they are Series in pandas i think

haughty mountain Nov 21, 2022, 11:05 PM

#

apparently

series.reset_index().to_numpy()
```would work

#

reset_index turns it into a dataframe

#

so you should get a 2d array

lean lake Nov 21, 2022, 11:06 PM

#

oh wow look at the mess i made

lean lake Nov 21, 2022, 11:07 PM

#

haughty mountain reset_index turns it into a dataframe

oh interesting, let me check that one out

lean lake Nov 21, 2022, 11:08 PM

#

haughty mountain so you should get a 2d array

looks like the shape i get is 500,2

#

but

ValueError                                Traceback (most recent call last)
Cell In [56], line 10
      2 df10y = df10.y_coordinate.reset_index().to_numpy()
      4 # nx = len(df10x)
      5 # ny = len(df10y)
      6 
      7 # df10x = np.reshape(df10x, (nx, 1))
      8 # df10y = np.reshape(df10y, (ny, 1))
---> 10 df10adj = np.sqrt((df10x - df10x.T)**2 + (df10y - df10y.T)**2)
     11 df10adj

ValueError: operands could not be broadcast together with shapes (500,2) (2,500)

#

df10x = df10.x_coordinate.reset_index().to_numpy()
df10y = df10.y_coordinate.reset_index().to_numpy()

df10adj = np.sqrt((df10x - df10x.T)**2 + (df10y - df10y.T)**2)
df10adj

#

oh the index is thrown in

#

ok i cheated

#

df10x = df10.x_coordinate.reset_index().to_numpy()[:,1:]
df10y = df10.y_coordinate.reset_index().to_numpy()[:,1:]

#

added a slice at the end lol

#

im not sure how this works but i got node IDs too

lean lake Nov 22, 2022, 1:11 AM

#

so i somehow managed to make dijkstra work on my adjacency matrix. can i use it to compute the "optimal" path to steal from the banks

haughty mountain Nov 22, 2022, 1:51 AM

#

not really?

#

it's not a shortest path problem

#

did you try just implementing the greedy choices?

#

it's probably the best you can (easily) do

#

it feels like a constrained version of the longest path problem

#

and the longest path problem is NP-hard

short condor Nov 22, 2022, 2:24 PM

#

Is this the right place to ask about trees? I'm unsure if I can ask it in #help channels because it is not really Python-exclusive.

#

Please delete if this is not the right place, but how do I know where to stop "tree-ing" the 1's in a Fibonnacci tree? I am watching a tree introduction and this is how they make a fibonnaci tree with root 3

/
1 2
/\ /
0 1 1 1
/
0 1

#

How do I know when to stop? If I can divide the 1's into [0,1], then why is a Fibonnaci tree not an inifite sequence of 0's and 1's

#

i edited the tree

#

so if i follow your 4th condition, then the correct tree is ...

#

-----3
/
1 2
/\ /
0 1 1 1
/\ /
0 1 0 1

#

in his code, if i am understanding it correctly, when the node key is one (or zero) and it is a root then it is a tree (a leaf) also

fiery cosmos Nov 22, 2022, 6:42 PM

#

is it multithreaded or linear algorithms that are N/A to python?

fiery cosmos Nov 22, 2022, 7:02 PM

#

can someone explain the vertex cover problem to me

#

i don't get what they're after

ionic mantle Nov 22, 2022, 7:13 PM

#

what other algorithms could i use for this

#

fiery cosmos Nov 22, 2022, 7:18 PM

#

i don't mean algos with O(n) runtime

#

i mean linear programming

#

ok thanks

fiery cosmos Nov 22, 2022, 7:21 PM

#

fiery cosmos can someone explain the vertex cover problem to me

bump

#

it is min by problem definition, but yes, that. i need to first understand what a vertex cover is. i am reading the wikipedia now

#

oh i get it. every edge has at least one endpoint node in the vertex cover:

#

#

#

vertex covers and min vertex covers

#

top and bottom

#

right

fiery cosmos Nov 22, 2022, 8:13 PM

#

https://en.wikipedia.org/wiki/Clique_problem

#

#

3-CNF >=P clique

half storm Nov 22, 2022, 8:18 PM

#

What's cnf and what's p?

fiery cosmos Nov 22, 2022, 8:18 PM

#

CNF = conjunctive normal form. the P is polynomial time reducible

#

NP problems are determined for hardness by reductions from known NP problems

half storm Nov 22, 2022, 8:18 PM

#

Oh thanks 🙂

fiery cosmos Nov 22, 2022, 8:19 PM

#

the original NP being circuit satisfiability

half storm Nov 22, 2022, 8:19 PM

#

Try to understand why it's greater

fiery cosmos Nov 22, 2022, 8:19 PM

#

i may have written it in the wrong direction

haughty mountain Nov 22, 2022, 8:19 PM

#

the typical name is 3SAT

half storm Nov 22, 2022, 8:20 PM

#

Nah I get what you want to say, no worry

haughty mountain Nov 22, 2022, 8:20 PM

#

3CNF is the canonical form

half storm Nov 22, 2022, 8:20 PM

#

haughty mountain 3CNF is the canonical form

3sat?

fiery cosmos Nov 22, 2022, 8:20 PM

#

circuit sat is separate from 3cnf sat

#

haughty mountain Nov 22, 2022, 8:22 PM

#

who talked about circuit sat?

half storm Nov 22, 2022, 8:23 PM

#

Think me by, accident 😦

fiery cosmos Nov 22, 2022, 8:23 PM

#

i was saying circuit sat is the original NP problem from which others are derived

haughty mountain Nov 22, 2022, 8:23 PM

#

oh, I was complaining about you calling the problem 3CNF

half storm Nov 22, 2022, 8:24 PM

#

Because it's 45 cnf

haughty mountain Nov 22, 2022, 8:24 PM

#

3-SAT is the usual name

fiery cosmos Nov 22, 2022, 8:24 PM

#

#

just listing off wiki stuff

haughty mountain Nov 22, 2022, 8:24 PM

#

https://en.wikipedia.org/wiki/Boolean_satisfiability_problem#3-satisfiability

Boolean satisfiability problem

In logic and computer science, the Boolean satisfiability problem (sometimes called propositional satisfiability problem and abbreviated SATISFIABILITY, SAT or B-SAT) is the problem of determining if there exists an interpretation that satisfies a given Boolean formula. In other words, it asks whether the variables of a given Boolean formula can...

half storm Nov 22, 2022, 8:25 PM

#

Nobody was against you dude xS

haughty mountain Nov 22, 2022, 8:25 PM

#

fiery cosmos

that says 3-CNF Satisfiability

#

which is another name for it

fiery cosmos Nov 22, 2022, 8:26 PM

#

another name for 3SAT. got it

haughty mountain Nov 22, 2022, 8:27 PM

#

the SAT part is the actual problem

#

is this thing satisfiable

#

the 3-CNF is a restriction on the allowed input

half storm Nov 22, 2022, 8:27 PM

#

Tried to make a truth table?

#

That makes it quite clear I think

#

If I did not miss anything

haughty mountain Nov 22, 2022, 8:29 PM

#

fiery cosmos i was saying circuit sat is the original NP problem from which others are derive...

(and that's one way of deriving it, you could really start with any problem)

#

the NP complete problems are fun in that if you can solve one, you can solve all

fiery cosmos Nov 22, 2022, 8:30 PM

#

how about the problem of figuring out what's wrong with my mother-in-law, definitely NP complete

half storm Nov 22, 2022, 8:30 PM

#

haughty mountain (and that's one way of deriving it, you could really start with any problem)

And it's the first way to understand in logic, as you have the bollean rules

#

And de Morgan of course

fiery cosmos Nov 22, 2022, 8:31 PM

#

de Morgan was a slouch

half storm Nov 22, 2022, 8:33 PM

#

fiery cosmos de Morgan was a slouch

Why! 😮 just know his laws for logic

fiery cosmos Nov 22, 2022, 8:33 PM

#

i'm totally joking. as was i for the mother in law comment. i dont have a mother in law

fiery cosmos Nov 22, 2022, 8:34 PM

#

fiery cosmos how about the problem of figuring out what's wrong with my mother-in-law, defini...

c'mon this was pretty good

#

now i know how @haughty mountain felt when nobody appreciated the pi hexadecimal reconstruction 😦

half storm Nov 22, 2022, 8:35 PM

#

OK, I think I miss ome concept to understand this joke :/

fiery cosmos Nov 22, 2022, 8:35 PM

#

which joke

half storm Nov 22, 2022, 8:35 PM

#

😢

haughty mountain Nov 22, 2022, 8:42 PM

#

fiery cosmos how about the problem of figuring out what's wrong with my mother-in-law, defini...

is your mother-in-law 3-SAT? ||because she's near impossible to satisfy||

fiery cosmos Nov 22, 2022, 8:43 PM

#

lmao there we go

haughty mountain Nov 22, 2022, 8:43 PM

#

the lowest of humor

fiery cosmos Nov 22, 2022, 8:43 PM

#

the nichest

haughty mountain Nov 22, 2022, 8:44 PM

#

it's a hard thing to cover

fiery cosmos Nov 22, 2022, 8:44 PM

#

ahhhhh hahahaha

wide gale Nov 22, 2022, 11:04 PM

#

haughty mountain you could at least grab my class that wraps their data and converts some of it t...

I know it's been a couple days but, want to know how uncompressed Pokemon data was serialized with a look up table.

signal path Nov 22, 2022, 11:29 PM

#

would this be an appropriate channel to ask for an explanation for why one solution might be better than another?

agile sundial Nov 22, 2022, 11:57 PM

#

probably

haughty mountain Nov 23, 2022, 1:04 AM

#

wide gale I know it's been a couple days but, want to know how uncompressed Pokemon data ...

idk what their serialization process is, but the deserialization logic is simple enough

wide gale Nov 23, 2022, 3:55 AM

#

haughty mountain idk what their serialization process is, but the deserialization logic is simple...

in your example was the raw_pkmn_data you imported made into a class? because I tried without classes and got an error at line parsed = dict(map(parse_name, raw_pkmn_data.pkmns.split('|'))) saying that raw_pkm_data doesnt have any pkmns attribute which makes sense because the raw data was just two strings in the original

Screen_Shot_2022-11-23_at_2.51.15_pm.png

wide gale Nov 23, 2022, 4:18 AM

#

i know why this doesnt work but how would that pkmdata class work in your one since your acessing raw_pkm _data attributes

Screen_Shot_2022-11-23_at_3.16.50_pm.png

haughty mountain Nov 23, 2022, 8:46 AM

#

wide gale in your example was the raw_pkmn_data you imported made into a class? because I ...

oh, the pkmns, eggs, and types are from the js file

#

https://github.com/LegendaryPKMN/ivcalc/blob/master/javacalc.html#L533-L534

halcyon plankBOT Nov 23, 2022, 8:52 AM

#

javacalc.html lines 533 to 534

types = ['Normal','Fighting','Flying','Poison','Ground','Rock','Bug','Ghost','Steel','???','Fire','Water','Grass','Electric','Psychic','Ice','Dragon','Dark'];
eggs = ['???','Monster','Water1','Bug','Flying','Ground','Fairy','Plant','Humanshape','Water3','Mineral','Indeterminate','Water2','Ditto','Dragon','No Eggs'];```

haughty mountain Nov 23, 2022, 8:52 AM

#

https://github.com/LegendaryPKMN/ivcalc/blob/master/javacalc.html#L497

GitHub

ivcalc/javacalc.html at master · LegendaryPKMN/ivcalc

LegendaryPKMN.net’s Pokémon Individual Value & Stat Calculator. - ivcalc/javacalc.html at master · LegendaryPKMN/ivcalc

#

and of course the other lines below that

wide gale Nov 23, 2022, 9:21 AM

#

haughty mountain oh, the pkmns, eggs, and types are from the js file

thanks, for pointing that out. though I don't think that answers my original question. like all that is necessary data but it doesn't address my original problem. how does raw_pkm_data have a pkms attribute if there all various strings/ lists (not a custom datatype)

haughty mountain Nov 23, 2022, 9:51 AM

#

wide gale thanks, for pointing that out. though I don't think that answers my original qu...

it is just a string

#

like, raw_pkmn_data.py is just

#

their data

#

(ignore the list[...] typehint from my editor)

#

the actual code I have just puts their data in a nicer form

#

I just put all their raw data in a separate module

wide gale Nov 23, 2022, 11:21 AM

#

Oh I see, all good. Thanks for explaining!

#

I'm so used .attribute being a class /data type thing I forgot you can just do that with global variables in python

inner yacht Nov 23, 2022, 10:14 PM

#

Assume you have a set of jobs, each taking from time x to time y.
You can only do one at once and each must be completed their entire time.
How would you maximize the amount of time taken? (Not all jobs have to be completed, just maximize the total time)

fiery cosmos Nov 23, 2022, 11:46 PM

#

log(log n) ∈ o(log n) is true, right pithink (that's little-o, not big)

haughty mountain Nov 24, 2022, 12:22 AM

#

yes

#

for any ε these exist some n0 such that for n >= n0

log n/log log n ≤ ε

marsh birch Nov 24, 2022, 12:24 AM

#

#

Im confused is there another way I can run code what does the lines mean?

haughty mountain Nov 24, 2022, 12:24 AM

#

haughty mountain for any ε these exist some n0 such that for n >= n0 log n/log log n ≤ ε

more informally, the fraction tends to zero

haughty mountain Nov 24, 2022, 12:26 AM

#

marsh birch

your answer is wrong, it will print an empty line

#

at the beginning

marsh birch Nov 24, 2022, 12:26 AM

#

Yes I know

#

But Idk how

#

Like how do I even find it out?

lament totem Nov 24, 2022, 12:27 AM

#

Because for j in range(0): does not even enter the code block indented below it

#

So it just executes print() and continues to the next i

marsh birch Nov 24, 2022, 12:38 AM

#

Ok thank you

wide gale Nov 24, 2022, 8:15 AM

#

haughty mountain like, raw_pkmn_data.py is just

sorry to bother you again. ive tried studying/ playing around with your code but I don't understand it. (specifically, the parts where you're extracting the serialized data) the things that are throwing me off (they go hand in hand). 1 your use of the byte datatype is wholly unfamiliar to me. so when you're manipulating those bytes variables i dont know what your doing 2. i don't know how you deserialized it. normally I could probably figure it out by myself but because it revolves around an unfamiliar datatype(bytes) i dont really know what to do

#

I admitt to being a beginner . my intentions doing this was to just to write my own simplified version. id have the data in regular data (lists of strings, or int arrays ect.) and use that to form the basis of everything. If knew it involved everything else i wouldnt have done it because its too advanced for my level.

#

even if it's bloated as hell id like to start with the base stats for every pokemon as a normal list of an array of integers in the raw data and work from there. How would you just get all the base stats in national dex order ? Or is it mangled in such a way that makes that difficult

haughty mountain Nov 24, 2022, 9:45 AM

#

wide gale sorry to bother you again. ive tried studying/ playing around with your code but...

the bytes datatype isn't advanced, it's just a sequence of values in range 0-255

#

you could use a list instead

#

Their pk is based on a string of hexadevimal values separated by comma, which they split to have a list of strings. Every time they read something from pk they need to do a conversion from the hex to int, I say just do the conversion once since at the end of the day what's needed is the byte values.

I use bytes.fromhex but you could also do [int(value, 16) for value in pk]

#

the pokemon stats (and some other stuff) is represented as 12 characters in the pkmn string, so raw_data grabs the relevant 12 bytes

#

basically it's just 12 characters for each pokemon in dex order in pkmn

#

these characters are mapped to indices using mn, and the actual byte values are in pk

#

(it's a dumb encoding, idk why they do it)

#

in any case, my raw_data function deals with this nonsense and just gives you 12 values

#

12 bytes

#

the first 6 are just the base stats

#

the next 2 are primary/secondary type

#

the next 2 are egg groups

#

the last 2 encode the ev yield

#

the format is dumb, but believe me when I say my code to deal with it is a lot better than the js code that does this...

wide gale Nov 24, 2022, 10:24 AM

#

Oh I believe you hahaha. It's night for me and I'm mentally tired so I'll try tomorrow morning but this clears it up. Thanks!

haughty mountain Nov 24, 2022, 11:05 AM

#

as an example, the second set of 12 characters (index 1) is '2ρρ**21()B )'

#

it's the data for bulbasaur

#

lets look at the first char '2' which should correspond to hp

reef swallow Nov 24, 2022, 11:06 AM

#

hi, how is going?

haughty mountain Nov 24, 2022, 11:07 AM

#

we see what index it has in mn

#               1111111111
#     01234567890123456789
mn = ' !"#$&()*+,-./01234567...'

#

index 16

reef swallow Nov 24, 2022, 11:08 AM

#

why 12 chars, not 16?

haughty mountain Nov 24, 2022, 11:08 AM

#

we look up the value at index 16 in pk

reef swallow Nov 24, 2022, 11:08 AM

#

or 12 is just random?

haughty mountain Nov 24, 2022, 11:10 AM

#

haughty mountain we look up the value at index 16 in `pk`

which is 2D or in decimal 45

#

which is indeed bulbasaur's hp stat

reef swallow Nov 24, 2022, 11:12 AM

#

what is in pk?

haughty mountain Nov 24, 2022, 11:13 AM

#

haughty mountain https://github.com/LegendaryPKMN/ivcalc/blob/master/javacalc.html#L497

you can look at the data here if you're interested

#

it's a bad format, but that's what they are using

haughty mountain Nov 24, 2022, 11:21 AM

#

reef swallow why 12 chars, not 16?

12 is enough for the data they are storing

haughty mountain Nov 24, 2022, 11:22 AM

#

haughty mountain 12 bytes

this

fiery cosmos Nov 24, 2022, 5:50 PM

#

Hi all, I saw on Reddit that I’ll be better working with data sets if I’m comfortable with eigenvectors and eigenvalues.. what are those and how will that allow me to comprehend a given dataset better

fiery cosmos Nov 24, 2022, 6:11 PM

#

remind me what is a doubly nested loop i can use to do an operation on all pairs of a list?

#

i think its something like:

for i in range(n):
  for i+1 in range(n)

agile sundial Nov 24, 2022, 6:14 PM

#

fiery cosmos Hi all, I saw on Reddit that I’ll be better working with data sets if I’m comfor...

seems kinda random..idk how reasonable that advice is. but an eigenvector of a matrix is a vector that when multiplied by that matrix is only scaled by a constant factor

#

that constant factor is the eigenvalue for that eigenvector

agile sundial Nov 24, 2022, 6:15 PM

#

fiery cosmos i think its something like: ```py for i in range(n): for i+1 in range(n) ```

you're probably thinking of

for i in range(n):
  for j in range(i + 1, n):
    ...

fiery cosmos Nov 24, 2022, 6:16 PM

#

agile sundial seems kinda random..idk how reasonable that advice is. but an eigenvector of a m...

idk if it helps but for context the data would be in bioinformatics so like gene expression data

agile sundial Nov 24, 2022, 6:17 PM

#

probably, yeah

agile sundial Nov 24, 2022, 6:17 PM

#

fiery cosmos idk if it helps but for context the data would be in bioinformatics so like gene...

i don't know anything about that ¯_(ツ)_/¯

fiery cosmos Nov 24, 2022, 6:17 PM

#

but what i am reading is a list of lists where the first value of each list is the name and the second is the pairwise comparison data. so i'll modify

fiery cosmos Nov 24, 2022, 6:18 PM

#

agile sundial i don't know anything about that ¯\_(ツ)_/¯

fair enough

#

gene expression data is really straightforward, you have a bunch of genes and their lvl of expression is the number of mRNA molecules that were counted. so you'll have:

geneA  1200
geneB  100
geneC  0
geneD  17
geneE  51

etc

haughty mountain Nov 24, 2022, 6:50 PM

#

conceptually eigenvalues and eigenvectors are easy

#

say you have some matrix M

#

then vectors v and constants λ such that
M v = λ v
are the eigenvectors and eigenvalues of the matrix

#

i.e., what are the vectors for which the linear transformation M is effecticely just multiplying by a constant

fiery cosmos Nov 24, 2022, 7:00 PM

#

😵‍💫

#

sounds like i'll need to do some reading to understand

#

struggling with my recent algo. cannot paste here i'll dm

haughty mountain Nov 24, 2022, 7:01 PM

#

basically in which directions does the data expand/contract

#

if you have a eigenvalue > 1 in some direction, vectors pointing in that direction will grow larger

#

< 1 and they would become smaller

fiery cosmos Nov 24, 2022, 7:03 PM

#

haughty mountain i.e., what are the vectors for which the linear transformation M is effecticely ...

what is the linear transformation M

haughty mountain Nov 24, 2022, 7:04 PM

#

e.g. a matrix

#

a matrix performs a linear transformatiom

fiery cosmos Nov 24, 2022, 7:04 PM

#

oh

#

i'm thinking of a matrix to store data, not as a functional structure

#

perhaps inaccurately so

#

above: i can just add a bunch of append statements yeah

haughty mountain Nov 24, 2022, 7:05 PM

#

I would totally use yield there

fiery cosmos Nov 24, 2022, 7:05 PM

#

and build a string

haughty mountain Nov 24, 2022, 7:07 PM

#

the function name is now totally misleading, but still

def print_lcs(b, string_a, i, j):
    if i == 0 or j == 0:
        return
    if b[i-1][j-1] == 3:
        yield from print_lcs(b, string_a, i-1, j-1)
        yield string_a[i-1]
    elif b[i-1][j-1] == 2:
        yield from print_lcs(b, string_a,i-1,j)
    else:
        yield from print_lcs(b,string_a,i,j-1)

fiery cosmos Nov 24, 2022, 7:07 PM

#

right i see what u mean. ok ill try

haughty mountain Nov 24, 2022, 7:08 PM

#

then something like ''.join(print_lcs(...))

fiery cosmos Nov 24, 2022, 7:09 PM

#

so it's working great, just every char gets its own newline, which i do not want

#

data in out is a list of lists where element zero is string name and element [0][1] is string to compare

haughty mountain Nov 24, 2022, 7:10 PM

#

better to build a string and then printing it, doing some hacky prints in functions is generally a bad idea

fiery cosmos Nov 24, 2022, 7:10 PM

#

i agree

#

but uhh.. do i need to use yield?

#

i can just make a string and add a bunch of append()s?

haughty mountain Nov 24, 2022, 7:11 PM

#

doing that recursively feels annoying

#

yield is the neater tool here

fiery cosmos Nov 24, 2022, 7:11 PM

#

ok so i'll add those yield statements and then have each line wrapped in an append

#

?

haughty mountain Nov 24, 2022, 7:12 PM

#

wdym?

fiery cosmos Nov 24, 2022, 7:12 PM

#

or with yield i don't need append

haughty mountain Nov 24, 2022, 7:12 PM

#

exactly

fiery cosmos Nov 24, 2022, 7:12 PM

#

it'll just tack onto a string?

#

do i do like string += yield xyz

haughty mountain Nov 24, 2022, 7:13 PM

#

it returns the values you yield one by one

fiery cosmos Nov 24, 2022, 7:13 PM

#

how do i yield them into a string

haughty mountain Nov 24, 2022, 7:14 PM

#

!e

def f():
  yield "a"
  yield "b"
  yield "c"

print(f())
print(list(f()))
print("".join(f()))

halcyon plankBOT Nov 24, 2022, 7:14 PM

#

@haughty mountain :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | <generator object f at 0x7f531971c880>
002 | ['a', 'b', 'c']
003 | abc

fiery cosmos Nov 24, 2022, 7:15 PM

#

🤔

#

ah beautiful

#

👌

#

much perfection

haughty mountain Nov 24, 2022, 7:20 PM

#

generator functions are great for building sequences like this

fiery cosmos Nov 24, 2022, 7:20 PM

#

what other generators are common aside from yielf

#

yield

haughty mountain Nov 24, 2022, 7:21 PM

#

yield is what makes a generator function

fiery cosmos Nov 24, 2022, 7:21 PM

#

oh oh you use it to confer generator functionality to any function you're writing i got it

haughty mountain Nov 24, 2022, 7:24 PM

#

yielding also makes the function lazy

#

!e

def squares():
  i = 0
  while True:
    yield i**2
    i += 1

for sq in squares():
  if sq > 90:
    break
  print(sq)

halcyon plankBOT Nov 24, 2022, 7:26 PM

#

@haughty mountain :white_check_mark: Your 3.11 eval job has completed with return code 0.

haughty mountain Nov 24, 2022, 7:27 PM

#

that generator function generates all the squares 😄

#

(if you ask for them)

fiery cosmos Nov 24, 2022, 7:27 PM

#

oh wow it'd just keep on going

haughty mountain Nov 24, 2022, 7:27 PM

#

e.g. list(squares()) would be a bad idea

fiery cosmos Nov 24, 2022, 7:28 PM

#

lol

#

i think the next thing i want to learn is using more than a single processor of my cpu while computing, or even use the GPU

haughty mountain Nov 24, 2022, 7:35 PM

#

parallelism in python? probably not

fiery cosmos Nov 24, 2022, 7:36 PM

#

oof

agile sundial Nov 24, 2022, 7:39 PM

#

using the GPU is (somewhat) straightforward. lots of modules for that

fiery cosmos Nov 24, 2022, 7:42 PM

#

thats cool. i guess i dont really have any applications for that quite yet. although the strassen's matrix multiplier was really slow. could have been better there

vocal gorge Nov 24, 2022, 7:42 PM

#

~~i sure wonder what language you could do it in~~ 🦀

haughty mountain Nov 24, 2022, 7:44 PM

#

||C ||

opal oriole Nov 24, 2022, 8:37 PM

#

fiery cosmos i'm thinking of a matrix to store data, not as a functional structure

Try thinking of numbers not as "how much" but as actions, e.g. +3 -> add three or move three to the right, *3 -> scale up / stretch. Then consider M again as a more fancy number, you can multiply it with other stuff. So like with normal numbers, it does a transformation / action.

#

(data <-> code (data is code and code is data))

opal oriole Nov 24, 2022, 8:42 PM

#

fiery cosmos i think the next thing i want to learn is using more than a single processor of ...

The easiest way in Python is via Numba.

fiery cosmos Nov 24, 2022, 8:44 PM

#

ok thanks @opal oriole

haughty mountain Nov 24, 2022, 8:45 PM

#

you can't do any fancier stuff though

#

I remember in an HPC class we had a thread delegating work to worker threads and another thread writing to file as results came back

vocal gorge Nov 24, 2022, 8:46 PM

#

oh, that's actually efficient?

#

i always wonder if such designs are a good idea whenever I'm writing multithreaded anything

opal oriole Nov 24, 2022, 8:48 PM

#

The threading Numba provides is not for things like IO, only numeric stuff, and it does so naively. But the performance gain to coding effort ratio is great.

haughty mountain Nov 24, 2022, 8:49 PM

#

actually I think there wasn't even active logic to delegate work, we set up a list of tasks protected by a mutex, and spun up a bunch of worker threads to take on tasks

#

and then the main thread became the writer thread

opal oriole Nov 24, 2022, 8:50 PM

#

Optimal multithreading is kind of crazy on modern machines, it's rarely done.

lament totem Nov 24, 2022, 8:50 PM

#

haughty mountain I remember in an HPC class we had a thread delegating work to worker threads and...

It's called master-slave paradigm iirc, but I think that name is not allowed anymore 😛

vocal gorge Nov 24, 2022, 8:50 PM

#

haughty mountain actually I think there wasn't even active logic to delegate work, we set up a li...

not a fancy job-stealing queue? :p

haughty mountain Nov 24, 2022, 8:50 PM

#

vocal gorge oh, that's actually efficient?

but yes, as long as you can make threads work mostly independently you can have great speedups

maiden urchin Nov 24, 2022, 8:51 PM

#

wrong channel

haughty mountain Nov 24, 2022, 8:51 PM

#

granted, our single threaded code beat our professor's multithreaded target time 😛

opal oriole Nov 24, 2022, 8:52 PM

#

I recommend always trying to squeeze more out of single core first, because modern cores are so fast (for less complexity).

haughty mountain Nov 24, 2022, 8:52 PM

#

but our stuff also scaled well with threads

#

we were actually bottlenecked by fprintf

vocal gorge Nov 24, 2022, 8:52 PM

#

haughty mountain granted, our single threaded code beat our professor's multithreaded target time...

good old https://www.frankmcsherry.org/graph/scalability/cost/2015/01/15/COST.html

Scalability! But at what COST?

Michael Isard, Derek Murray, and I recently sent in a HotOS submission (it’s not blind, so no harm talking about it, we think). The subject is hinted at from...

opal oriole Nov 24, 2022, 8:52 PM

#

haughty mountain we were actually bottlenecked by fprintf

*Which funnily enough Python does faster due to buffering stuff.

fiery cosmos Nov 24, 2022, 8:53 PM

#

i love how i have absolutely no idea what y'all are talking about

haughty mountain Nov 24, 2022, 8:53 PM

#

so we switched to mmap and manually printing integers

#

and then we were finally kinda limited by hardware speeds 😛

fiery cosmos Nov 24, 2022, 8:54 PM

#

i think im going to have to learn AWS at some point. that's industry standard if the company doesn't have their own comp. cluster

haughty mountain Nov 24, 2022, 8:55 PM

#

there are others

fiery cosmos Nov 24, 2022, 8:55 PM

#

it'd be cooler to build out a cluster and do all in-house computation 😛

#

thats what they did at an academic lab i was a part of. but when i was in industry they used AWS

haughty mountain Nov 24, 2022, 8:56 PM

#

let me just map-reduce a few petabytes of data with these 10k machines

opal oriole Nov 24, 2022, 8:57 PM

#

fiery cosmos it'd be cooler to build out a cluster and do all in-house computation 😛

The timing on that project is a bit rough as getting chips for such clusters shot up in price in the last couple of years. Raspberry PI for $300...

fiery cosmos Nov 24, 2022, 8:58 PM

#

yeah i'd be interested which hardware was used to create the cluster

#

i just got a raspberry pi for a project for super cheap.. it was like $20 USD

haughty mountain Nov 24, 2022, 9:00 PM

#

(I saw discussion at work about some compute cluster having actually running into issues of using 64 bit integers for addressable storage)

fiery cosmos Nov 24, 2022, 9:00 PM

#

translate pls

opal oriole Nov 24, 2022, 9:01 PM

#

haughty mountain (I saw discussion at work about some compute cluster having actually running int...

Yeah that is happening more and more now. "Big Data" / everyone doing data science.

haughty mountain Nov 24, 2022, 9:01 PM

#

32 bit numbers can address ~4GB of data, 64 bits could address...a lot more

#

how much is it again?

#

4EiB?

fiery cosmos Nov 24, 2022, 9:02 PM

#

and ppl are still running out of space

#

is the fact you were pointing out

opal oriole Nov 24, 2022, 9:02 PM

#

16 I think.

haughty mountain Nov 24, 2022, 9:03 PM

#

opal oriole 16 I think.

correct

You have: 2**64 bytes
You want: EiB
    * 16
    / 0.0625

haughty mountain Nov 24, 2022, 9:03 PM

#

fiery cosmos and ppl are still running out of space

yes, someone was questioning why utilities dealing with byte sizes was using 128 bit integers

#

and got the response that 64 bits actually started causing issues

fiery cosmos Nov 24, 2022, 9:04 PM

#

👀

haughty mountain Nov 24, 2022, 9:04 PM

#

fiery cosmos and ppl are still running out of space

for reference, 16 EiB is 16777216 TiB

fiery cosmos Nov 24, 2022, 9:05 PM

#

TiB?

haughty mountain Nov 24, 2022, 9:05 PM

#

Tibibytes, just to be pedantic about it being the base 2 version

fiery cosmos Nov 24, 2022, 9:06 PM

#

wth are tibibytes

haughty mountain Nov 24, 2022, 9:06 PM

#

18446744 TB if you prefer that

haughty mountain Nov 24, 2022, 9:06 PM

#

fiery cosmos wth are tibibytes

so computers like base 2

#

our prefixes like kilo, mega, ... don't

fiery cosmos Nov 24, 2022, 9:06 PM

#

like binary?

haughty mountain Nov 24, 2022, 9:07 PM

#

e.g. 1kb = 1000bytes

#

which is not that nice for computing

#

so it's common to use 1kib = 1024bytes

#

kibibytes

fiery cosmos Nov 24, 2022, 9:07 PM

#

ohh got it

opal oriole Nov 24, 2022, 9:07 PM

#

kb used to mean 1024, but they changed it because it confused consumers when they saw the numbers on the boxes.

haughty mountain Nov 24, 2022, 9:08 PM

#

yeah...

fiery cosmos Nov 24, 2022, 9:08 PM

#

well yeah

#

1kb in my world is 1k base pairs

#

anyone know how to create a histogram from an image?

haughty mountain Nov 24, 2022, 9:08 PM

#

harddrive manufacturers love the base 10 version

#

because they can claim higher numbers

opal oriole Nov 24, 2022, 9:08 PM

#

And so depending on who you ask, they will tell you that kb is still 1024, if they are being stubborn or depending on context (e.g. a kernel's code).

fiery cosmos Nov 24, 2022, 9:09 PM

#

i have all SSDs in my PC 🙂

haughty mountain Nov 24, 2022, 9:09 PM

#

opal oriole And so depending on who you ask, they will tell you that kb is still 1024, if th...

or you are the pedantic nerd in the room who uses the base 2 prefixes

fiery cosmos Nov 24, 2022, 9:09 PM

#

i agree it should be changed, you cannot claim kilo is anything other than 1000 of something

opal oriole Nov 24, 2022, 9:09 PM

#

(Like with math, just define it beforehand, then continue)

fiery cosmos Nov 24, 2022, 9:10 PM

#

kilogram, kilodalton, kilometer

haughty mountain Nov 24, 2022, 9:10 PM

#

kilogram being the SI unit is fun

fiery cosmos Nov 24, 2022, 9:10 PM

#

i usually work in terms of like micromolar or nM (millions or billions of a mol / L)

#

μM = micromolar

#

although that world will be long forgotten if i continue the computational route

#

sry, not on topic

opal oriole Nov 24, 2022, 9:13 PM

#

Unless you start doing simulations or other tools for that.

#

In which DS&A come up way more than other kinds of programming.

fiery cosmos Nov 24, 2022, 9:15 PM

#

i doubt i'll go into chemical informatics but i suppose it's possible. i've had 4 semesters of chemistry and i was really good at organic which a lot of people flounder at

haughty mountain Nov 24, 2022, 9:17 PM

#

let me see if I can find some old stuff from my HPC course

#

the threading task I was talking about was about generating Newton fractals

#

#

we learned that writing the complex math by hand was 2x faster :^)

#

and terrible to write and look at

#

looking at the current site for the course (which has changed over time) running our code single threaded code is about at the limit for 10 threads

#

https://hpc.raum-brothers.eu/chalmers.html

fiery cosmos Nov 24, 2022, 9:32 PM

#

haughty mountain

woah

haughty mountain Nov 24, 2022, 9:32 PM

#

actually, maybe I can find the original constraints we had

fiery cosmos Nov 24, 2022, 9:33 PM

#

how was Gothenburg

haughty mountain Nov 24, 2022, 9:34 PM

#

I liked it

fiery cosmos Nov 24, 2022, 9:34 PM

#

i was asking about why mathematicians are so preoccupied with shapes like above the other day

haughty mountain Nov 24, 2022, 9:35 PM

#

ah, found the old thing http://www.math.chalmers.se/Math/Grundutb/CTH/tma881/1617/assignments.html#optimization

#

#

I'm re-running our old code on my a tad more modern hardware

#

for fun

#

-rw-r--r-- 1 algmyr algmyr 7.2G Nov 24 22:39 /tmp/newton_attractors_x7.ppm
-rw-r--r-- 1 algmyr algmyr 8.5G Nov 24 22:39 /tmp/newton_convergence_x7.ppm

#

the file sizes for the 50k lines are a bit chunky

fiery cosmos Nov 24, 2022, 9:44 PM

#

what does this code do?

#

oh newton fractals

haughty mountain Nov 24, 2022, 9:44 PM

#

it generates image files for the fractals yeah

#

50k x 50k pixels

#

which is kinda ridiculous

fiery cosmos Nov 24, 2022, 9:44 PM

#

woah

haughty mountain Nov 24, 2022, 9:45 PM

#

the 1 thread version is almost finished

fiery cosmos Nov 24, 2022, 9:45 PM

#

The Newton fractal is a boundary set in the complex plane which is characterized by Newton's method applied to a fixed polynomial p(Z) ∈ ℂ[Z] or transcendental function. It is the Julia set of the meromorphic function z ↦ z − p(z)p′(z) which is given by Newton's method.

#

https://tenor.com/view/nope-gif-23756243

Tenor

haughty mountain Nov 24, 2022, 9:46 PM

#

lol, these times are great

              1000    50000
 1  thread  0.130s   307.7s
10 threads  0.021s   40.91s

vocal gorge Nov 24, 2022, 9:46 PM

#

fiery cosmos The Newton fractal is a boundary set in the complex plane which is characterized...

https://www.youtube.com/watch?v=-RdOwhmqP5s

YouTube

3Blue1Brown

From Newton’s method to Newton’s fractal (which Newton knew nothing...

Who knew root-finding could be so complicated?
Next part: https://youtu.be/LqbZpur38nw
Special thanks to the following supporters: https://3b1b.co/lessons/newtons-fractal#thanks
An equally valuable form of support is to simply share the videos.

Interactive for this video:
https://www.3blue1brown.com/lessons/newtons-fractal

...

▶ Play video

haughty mountain Nov 24, 2022, 9:47 PM

#

our aim for all tasks was basically to beat the target time with one thread

#

and I think we succeeded basically all the time

#

Here is the short report we wrote, we used some cute math tricks
http://algmyr.se/upload/newton.pdf

#

(it's not a long report)

fiery cosmos Nov 24, 2022, 9:56 PM

#

mathematical cunningness and laziness

#

lmao

#

can you really put stuff like that in your academic work

haughty mountain Nov 24, 2022, 9:57 PM

#

this is just a hand-in, not like an article 😛

#

(and this was probably written at some point in the middle of the night)

fiery cosmos Nov 24, 2022, 9:57 PM

#

yeah im seeing spelling errors

#

what are race conditions

haughty mountain Nov 24, 2022, 9:58 PM

#

what did we misspell?

fiery cosmos Nov 24, 2022, 9:58 PM

#

haughty mountain Nov 24, 2022, 9:58 PM

#

ah

#

race conditions is when two (or more) threads is working on the same data at the same time

vocal gorge Nov 24, 2022, 9:59 PM

#

z should be z_n in this formula

haughty mountain Nov 24, 2022, 9:59 PM

#

which can lead to...issues

#

shush

#

yes

fiery cosmos Nov 24, 2022, 9:59 PM

#

lol

vocal gorge Nov 24, 2022, 9:59 PM

#

lemme just critique the spelling of your years-old work real quick 😛

fiery cosmos Nov 24, 2022, 9:59 PM

#

what is mutex

haughty mountain Nov 24, 2022, 10:00 PM

#

I think the convergence trick was quite neat

#

cheaper, and requires no knowledge about the exact answer

fiery cosmos Nov 24, 2022, 10:00 PM

#

my spelling comment was more to confirm "yes, i can tell it was written in the middle of the night as you have just stated" 😛

vocal gorge Nov 24, 2022, 10:01 PM

#

hate ∆ as a variable name tbh, I keep reading "laplacian, wtf??"

haughty mountain Nov 24, 2022, 10:01 PM

#

fiery cosmos what is mutex

mutex stands for mutual exclusion

#

basically, only one thing can hold the mutex at once

fiery cosmos Nov 24, 2022, 10:01 PM

#

"Worker works workingly"

#

haha

haughty mountain Nov 24, 2022, 10:01 PM

#

think of it as only being allowed to do things when you hold a specific object

#

you pick it up, do your work, and put it down

#

if the thing is already picked up, sucks to be you, you have to wait

fiery cosmos Nov 24, 2022, 10:02 PM

#

sometimes during parties people will have a speaking stick (some random object) and only the person holding it is allowed to speak

#

sounds like that

haughty mountain Nov 24, 2022, 10:03 PM

#

a bit

#

I think we avoided most use of mutexes in this code

#

it's basically only used when picking up new tasks

#

so multiple threads don't pick up the same task

haughty mountain Nov 24, 2022, 10:05 PM

#

vocal gorge hate ∆ as a variable name tbh, I keep reading "laplacian, wtf??"

we probably should have used δ instead, but annoying people with Δ is fun too

fiery cosmos Nov 24, 2022, 10:05 PM

#

TIL: unsolvability of the quintic

#

enter newton's method

haughty mountain Nov 24, 2022, 10:06 PM

#

this is good life advice for programmers

fiery cosmos Nov 24, 2022, 10:06 PM

#

when does a linear approximation to the function around a value x equal zero

#

yeah i suffer from trying to develop the plan within an IDE.. need to work out on paper first

#

although it somehow worked today to get some code running

#

sometimes i can see what i'm trying to do better when all the variables are at hand

vocal gorge Nov 24, 2022, 10:08 PM

#

haughty mountain Here is the short report we wrote, we used some cute math tricks http://algmyr.s...

great stuff, thanks

haughty mountain Nov 24, 2022, 10:08 PM

#

rare usage of the because symbol

vocal gorge Nov 24, 2022, 10:09 PM

#

I think i'd use impliedby for that 😛

vocal gorge Nov 24, 2022, 10:10 PM

#

haughty mountain ah, found the old thing http://www.math.chalmers.se/Math/Grundutb/CTH/tma881/161...

Write a C program sum that computes naively and outputs the sum of the first billion integers. The makefile should contain
already too busy screaming, brb

fiery cosmos Nov 24, 2022, 10:10 PM

#

nope.jpg

haughty mountain Nov 24, 2022, 10:11 PM

#

the last task was very dumb

#

solve a Dijkstra problem with distributed computing

vocal gorge Nov 24, 2022, 10:11 PM

#

interesting tasks though, I wonder which of them make sense in rust

haughty mountain Nov 24, 2022, 10:12 PM

#

haughty mountain solve a Dijkstra problem with distributed computing

which is very much BS since Djikstra really doesn't benefit from it

vocal gorge Nov 24, 2022, 10:12 PM

#

haughty mountain which is very much BS since Djikstra really doesn't benefit from it

something something scalability but at worth cost

haughty mountain Nov 24, 2022, 10:12 PM

#

I think the intended thing was to find the min in a distributed way

but...we could just use a priority queue...

fiery cosmos Nov 24, 2022, 10:12 PM

#

one of the best academia things i ever solved was determining structures of chemicals from proton NMR data. a classmate of mine and I were working on some homework and perfectly drew the structure which was a cyclic structure with a bridge on the top

#

wish i could find that problem

#

its buried in an organic chem book somewhere

haughty mountain Nov 24, 2022, 10:15 PM

#

we did actually make use of the distributed thing though, just because

You can solve the problem quicker if you assume that the longest path is below some limit, it means you can also throw away a lot of the edges. So start different computers with different assumptions about the shortest path

#

and whichever computer gives an answer first we take

#

and kill the other computers

#

for another one of the tasks we tried to implement fft on a GPU instead of using the GPU for computing convolutions

#

sadly we had precision issues that we couldn't resolve in time

#

we had a working impl of fft on a gpu though

fiery cosmos Nov 24, 2022, 10:18 PM

#

i'm watching this 3blue1brown vid and i have no idea what he's on about

#

fft?

#

oh something fourier transform?

haughty mountain Nov 24, 2022, 10:19 PM

#

fast fourier transform

#

one of the most important algorithms ever

fiery cosmos Nov 24, 2022, 10:19 PM

#

uh oh

#

sounds like i need to learn that

haughty mountain Nov 24, 2022, 10:19 PM

#

probably not

#

but it's super important for so much technology

agile sundial Nov 24, 2022, 10:20 PM

#

my cs prof always talks about how one of his former students scanned his wife's sinuses and he could see all the layers in real time with an MRI or something and that back in the day it would take a week to process

vocal gorge Nov 24, 2022, 10:22 PM

#

fun application of... that's not even a fourier transform, just a single-frequency component of it... is synchronous detection. If you want to detect a signal with known frequency, you can do that very well even when you have a lot (orders of magnitude more than the signal) of noise.

haughty mountain Nov 24, 2022, 10:22 PM

#

iirc you can extract a single frequency quicker than by doing a full fft as well

vocal gorge Nov 24, 2022, 10:23 PM

#

yeah, sure, fft is n log n, you can do one in n

fiery cosmos Nov 24, 2022, 10:23 PM

#

agile sundial my cs prof always talks about how one of his former students scanned his wife's ...

we're getting to the point where ultrasounds are cheap enough now to have one in the house and look at things when you're having pain, and yet, interpreting the data probably requires a medical degree

vocal gorge Nov 24, 2022, 10:24 PM

#

you basically just compute

average = signal.mean()
amplitude = np.hypot(np.mean(signal * np.sin(freq*time)), np.mean(signal * np.cos(freq*time)))

haughty mountain Nov 24, 2022, 10:24 PM

#

Gilbert Strang, author of the classic textbook Linear Algebra and Its Applications, once referred to the fast Fourier transform, or FFT, as “the most important numerical algorithm in our lifetime.”

fiery cosmos Nov 24, 2022, 10:24 PM

#

wth is a discrete fourier transform

vocal gorge Nov 24, 2022, 10:24 PM

#

vocal gorge you basically just compute ```py average = signal.mean() amplitude = np.hypot(np...

and this can get you insane sensitivity to differences in frequency with long enough "exposure" (number of points) - basically all other frequencies get filtered away.

fiery cosmos Nov 24, 2022, 10:25 PM

#

its pretty interesting how in most modern tech, multiple different fields are converging

haughty mountain Nov 24, 2022, 10:25 PM

#

a discrete version of the usual fourier transform :^)

vocal gorge Nov 24, 2022, 10:25 PM

#

fiery cosmos wth is a discrete fourier transform

well, in digital computing you don't get continious signals - you measure the signal once every 5 microseconds or whatever and get a stream of values like that.

agile sundial Nov 24, 2022, 10:25 PM

#

the fourier transform but discrete

vocal gorge Nov 24, 2022, 10:26 PM

#

vocal gorge well, in digital computing you don't get continious signals - you measure the si...

so you need to adapt the fourier transform math to work with sums rather than integrals. That's the DFT.

haughty mountain Nov 24, 2022, 10:26 PM

#

but as a more serious answer, one view of fourier transforms is that you can go from a time representation of a signal (e.g. a waveform of sound) to an equivalent frequency representation of a signal

#

an interesting view of DFT is in the context of polynomials, where the regular representation is a bunch of function values, and the transformed version are the coefficients of the polynomial

fiery cosmos Nov 24, 2022, 10:28 PM

#

i'm reading the discrete fourier transform wiki and my head is spinning

haughty mountain Nov 24, 2022, 10:28 PM

#

this also leads to fun consequences, multiplying polynomials by only knowing the coefficients is expensive

#

bit if I knew function values it would be trivial

#

so you can use fft to transform into function values, multiply, and then transform back

#

avoiding the usual expensive O(n^2) multiplication

#

in the context of the task we were supposed to do on the GPU, it was basically do a heat transfer simulation, which boiled down to doing an convolution against a specific kernel over and over

#

being more math savvy we immediately saw that we could do this much faster if we used fourier transform

fiery cosmos Nov 24, 2022, 10:32 PM

#

heat transfer -> boiled down

vocal gorge Nov 24, 2022, 10:33 PM

#

i wonder if you can literally just... ask some distributed computing library like opencl here to implement a convolution as a fourier transform

haughty mountain Nov 24, 2022, 10:33 PM

#

because convolution is just regular pointwise multiplication in the transformed world, which can be computed quickly even for a huge number of iterations

vocal gorge Nov 24, 2022, 10:33 PM

#

vocal gorge i wonder if you can literally just... ask some distributed computing library lik...

since it's a very normal thing to do

haughty mountain Nov 24, 2022, 10:33 PM

#

I really doubt it would be faster than doing it on a cpu

vocal gorge Nov 24, 2022, 10:33 PM

#

ah, that's fair

haughty mountain Nov 24, 2022, 10:33 PM

#

we wanted to do it on a gpu because we could

vocal gorge Nov 24, 2022, 10:34 PM

#

...and because the assignment asks for it 😛

haughty mountain Nov 24, 2022, 10:34 PM

#

for one interpretation of asks

#

the intended solution was for sure just to do the convolutions

#

vocal gorge Nov 24, 2022, 10:36 PM

#

I was going to say that scipy's convolve uses fft, but looking at the code... looks like only the signal one does

#

and the ndimage one doesn't

#

at least, can't easily see it

haughty mountain Nov 24, 2022, 10:38 PM

#

I can recommend looking at Kahan summation, it's a nice technique

vocal gorge Nov 24, 2022, 10:39 PM

#

ah, the thing math.fsum does presumably

#

and yeah, consumer(gaming) GPUs are usually so slow at doubles it's not worth it

#

like, it depends on how they are split between 32-bit and 64-bit processing units AFAIK, and GPUs for graphics generally lean hard 32-bit (ones made for compute generally have a more even split)

haughty mountain Nov 24, 2022, 10:40 PM

#

I think our main problems were that this was basically the first time we wrote an fft

#

so we made some dumb mistakes

vocal gorge Nov 24, 2022, 10:41 PM

#

ah, fair

haughty mountain Nov 24, 2022, 10:42 PM

#

we had so much fun doing stupid stuff in this course

vocal gorge Nov 24, 2022, 10:43 PM

#

a thousand-line FFT in C and OpenCL is an interesting definition of fun 😛

agile sundial Nov 24, 2022, 10:43 PM

#

it's the friends you made along the way 🥺

haughty mountain Nov 24, 2022, 10:49 PM

#

We were also super pedantic in one task about getting exact solutions

#

the cell distance one on the website

#

iirc the professor's solution wasn't even exact

#

Just a slight brag about beating the professor's reference solution while also guaranteeing perfect accuracy

#

Turns out using a float sqrt and correcting the small error afterwards is a viable thing to do for the input we had. So we could process twice the values at a time in our SIMD

#

I also suspect we have some UB in the code

#

needless to say this was probably one of my favorite courses since I ended up getting the opportunity of doing dumb algorithms, math and low level optimizations with a friend of mine 😄

amber hare Nov 24, 2022, 11:25 PM

#

Hello everyone!

#

I came across this server as I like python (not favorite lang) but have gotten exposure to logo_numpy and logo_pandas_white during data mining and also like writing python code for some alg problems!

opal oriole Nov 25, 2022, 1:34 AM

#

vocal gorge hate ∆ as a variable name tbh, I keep reading "laplacian, wtf??"

For me it's the other way around. I don't like the use of capital delta for the Laplacian.

fiery cosmos Nov 25, 2022, 2:05 PM

#

Anyone got good recommendations to learn data structures and algorithms of python??

wise flax Nov 25, 2022, 3:32 PM

#

fiery cosmos Anyone got good recommendations to learn data structures and algorithms of pytho...

Check the pins

#

What are the potential use cases of using a column as an index in a Pandas dataframe while also keeping the original column, e.g.

df = pd.DataFrame(data)
df = df.set_index("Name", drop = False)

Is this useful in some obscure edge cases or is my imagination too limited?

vocal gorge Nov 25, 2022, 4:37 PM

#

sometimes you have two unique columns, like maybe user ids and user names, and want to switch which one is the index without dropping the other, perhaps

fiery cosmos Nov 25, 2022, 6:08 PM

#

if __name__ == "__main__":
    main()

#

what's this whole mess about

fiery cosmos Nov 25, 2022, 6:08 PM

#

fiery cosmos Anyone got good recommendations to learn data structures and algorithms of pytho...

DS courses are typically language agnostic

#

can someone help me implement a flag to include an optional output

#

ah shit nvm. i need to be able to handle input that is both one and multiple lines

haughty mountain Nov 25, 2022, 6:29 PM

#

fiery cosmos ```py if __name__ == "__main__": main() ```

I showed you this some time ago I'm pretty sure

#

!main

halcyon plankBOT Nov 25, 2022, 6:29 PM

#

if __name__ == '__main__'

This is a statement that is only true if the module (your source code) it appears in is being run directly, as opposed to being imported into another module. When you run your module, the __name__ special variable is automatically set to the string '__main__'. Conversely, when you import that same module into a different one, and run that, __name__ is instead set to the filename of your module minus the .py extension.

Example

# foo.py

print('spam')

if __name__ == '__main__':
    print('eggs')

If you run the above module foo.py directly, both 'spam'and 'eggs' will be printed. Now consider this next example:

# bar.py

import foo

If you run this module named bar.py, it will execute the code in foo.py. First it will print 'spam', and then the if statement will fail, because __name__ will now be the string 'foo'.

Why would I do this?

• Your module is a library, but also has a special case where it can be run directly
• Your module is a library and you want to safeguard it against people running it directly (like what pip does)
• Your module is the main program, but has unit tests and the testing framework works by importing your module, and you want to avoid having your main code run during the test

fiery cosmos Nov 25, 2022, 6:30 PM

#

"This is a statement that is only true if the module (your source code) it appears in is being run directly"

#

ok that seems to always be the case for my progs

#

sooo my program was working beautifully but i think it falls apart when one of my input sequences spans multiple lines

#

bc of the way i am reading the input

#

here is how i am reading input:

#

which works great for my example input, where each string is on its own line. however, i wanted to test my program with much longer sequences (actual gene sequences from different species) and it breaks

#

so im wondering how i can write it to handle both single-lined strings and strings which span many lines

#

🤔

#

maybe like py if char == '\n': pass
or some logic like that

haughty mountain Nov 25, 2022, 6:38 PM

#

what's the input format?

fiery cosmos Nov 25, 2022, 6:39 PM

#

text file that looks like this

#

or in my test case, each string declared spanned many lines

haughty mountain Nov 25, 2022, 6:39 PM

#

that's what I was asking, an example of that?

fiery cosmos Nov 25, 2022, 9:32 PM

#

yo guys

brittle moat Nov 26, 2022, 12:09 AM

#

does anyone know how to do this?

storm bridge Nov 26, 2022, 12:14 AM

#

agile sundial Nov 26, 2022, 12:20 AM

#

brittle moat does anyone know how to do this?

it seems pretty straightforward. just sum up the transfers in and out and the fee

brittle moat Nov 26, 2022, 12:20 AM

#

can you give me code for checking the condition on calculating the fee?

agile sundial Nov 26, 2022, 12:21 AM

#

what have you tried so far?

brittle moat Nov 26, 2022, 12:22 AM

#

comparing datetime objs and calculating that way but it's not dynamic enough

#

and I don't think that's how you're supposed to do it in actual interviews

agile sundial Nov 26, 2022, 12:24 AM

#

wdym "it's not dynamic enough"

brittle moat Nov 26, 2022, 12:26 AM

#

shouldn't the fee here only be 10/mo? why is it 60?

agile sundial Nov 26, 2022, 12:28 AM

#

yeah, that looks wrong

#

it's not 10/mo, but 10 total since there's 2 months

#

ah

#

it's asking "at the end of the year 2020", not at the end of the data

#

so that's 12 * 5, which accounts for the 60

brittle moat Nov 26, 2022, 12:29 AM

#

so it auto applies a fee of 60$/year unless transactions are made 3x within that month

agile sundial Nov 26, 2022, 12:30 AM

#

at least 3x, and they must sum to greater than 100

#

but yeah

brittle moat Nov 26, 2022, 12:58 AM

#

agile sundial at least 3x, and they must sum to greater than 100

do you recommend placing things in a dict by dict, if not, how should I compare dt objects

agile sundial Nov 26, 2022, 12:58 AM

#

i don't see how you'd use a dict for this, unless you mean using the months as keys?

brittle moat Nov 26, 2022, 12:58 AM

#

yeah

agile sundial Nov 26, 2022, 12:58 AM

#

why not a list

brittle moat Nov 26, 2022, 12:58 AM

#

how would I compare dt objects

agile sundial Nov 26, 2022, 12:59 AM

#

i just wouldn't use dt objects. i would just get the month out of the string directly

#

and you'd only have dates, not datetimes

wide gale Nov 26, 2022, 3:43 AM

#

haughty mountain it's a bad format, but that's what they are using

thanks, I get it since you explained it quite well. my question is out of curiosity what would be a good way to serialize it/compress it?

gloomy herald Nov 26, 2022, 7:44 AM

#

Hello

#

Has anyone heard of sloot digital coding system? So my friend randomly saw this video on YouTube youtu.be/KOvoD1upTxM and he sought after making the algorithm himself. From what I read jan sloot's algorithm was mathematically not possible but somehow my friend has come up with a very basic code that as per him applies the same theory. Now we want to convert it for large scale picture and documents for compression. But again as I mentioned before lot of ppl say that it's mathematically impossible. Here is his code:

#

#

We wanted to work on it so that we can offer a service that provides compression by a factor of 10

#

Now do note that the video he looked up was a Google developers video that was posted on the 1st of April so we are in the blue whether the thing is possible or not but since this code has been working he has been very adamant that the theory works

haughty mountain Nov 26, 2022, 7:49 AM

#

gloomy herald Has anyone heard of sloot digital coding system? So my friend randomly saw this ...

seems like a joke to me, though it's technically possible

#

it's amazingly hard to try to compress anything of non-trivial length

#

it's kind of the best kind of programming joke, it's technically correct but wildly wrong in all other ways

gloomy herald Nov 26, 2022, 7:54 AM

#

haughty mountain it's kind of the best kind of programming joke, it's technically correct but wil...

So it works with small amount of data but say with a 4k image it goes brr

#

Can you explain in a bit more detail please

haughty mountain Nov 26, 2022, 7:55 AM

#

so let's do a binary stream for simplicity, you want to have a seed that ends up generating the N bits you want

#

you would expect to have to try something like 2^N seeds to find one that works

#

and it's very clear in the video that the larger examples are generated by inverting the process

#

as in, just generate a bunch of random values

cinder cedar Nov 26, 2022, 8:46 AM

#

#

Is that’s true what I did?

#

Need to proof or disproof

haughty mountain Nov 26, 2022, 8:48 AM

#

log 2 <= 1/2 log n isn't generally true

#

you'll need some lower bound on n

cinder cedar Nov 26, 2022, 8:48 AM

#

It’s not

haughty mountain Nov 26, 2022, 8:49 AM

#

wrong inequality, let me fix

cinder cedar Nov 26, 2022, 8:49 AM

#

Let’s say we have 10 - 1 >= 10 - 5 and 1 is log2 and 5 is 1/2logn that’s true

haughty mountain Nov 26, 2022, 8:50 AM

#

try n=1

cinder cedar Nov 26, 2022, 8:51 AM

#

lemon_sentimental

haughty mountain Nov 26, 2022, 8:51 AM

#

you just need to say that it's valid for n greater than some value

cinder cedar Nov 26, 2022, 8:52 AM

#

Yeah let me find it

haughty mountain Nov 26, 2022, 8:54 AM

#

~~you could also just do

n/2 log n/2 <= n log n
```and be done (you still need a lower bound, but whatever)~~

cinder cedar Nov 26, 2022, 8:55 AM

#

Yo wait

#

Why n=1 is not good

haughty mountain Nov 26, 2022, 8:55 AM

#

errr

cinder cedar Nov 26, 2022, 8:56 AM

#

We get log2=1 >= 0.5*0

#

log2 >= 1/2*logn if n=1 -> log2>1/2x0=0

haughty mountain Nov 26, 2022, 9:01 AM

#

cinder cedar Nov 26, 2022, 9:01 AM

#

n can’t be 0

haughty mountain Nov 26, 2022, 9:01 AM

#

the top inequality is what your step assumes

#

I meant 1

#

my bad

#

n=1 gives that inequality

#

log 1 = 0

cinder cedar Nov 26, 2022, 9:02 AM

#

Hmm yeah right

#

So n>1

haughty mountain Nov 26, 2022, 9:03 AM

#

n=2 is also bad

#

log 2 <= 1/2 log 2

cinder cedar Nov 26, 2022, 9:03 AM

#

n>=4

#

pithink

haughty mountain Nov 26, 2022, 9:05 AM

#

n=4 is the cutoff yes

#

#

not that it matters much what the exact cutoff is

#

it just needs to be some finite number

cinder cedar Nov 26, 2022, 9:06 AM

#

Like a single number

haughty mountain Nov 26, 2022, 9:06 AM

#

finite number as in some constant < infinity

cinder cedar Nov 26, 2022, 9:07 AM

#

🤝

#

Thx

spare spindle Nov 26, 2022, 10:07 AM

#

anyone knows how to write this in a pythonic way? xd

floors = {
            "Zone 1": [1,2,3,4,5,6,7,8,9,10],
            "Zone 2": [11,12,13,14,15,16,17,18,19,20], 
            "Zone 3": [21,22,23,24,25,26,27,28,29,30],
            "Zone 4": [31,32,33,34,35,36,37,38,39,40],
            "Zone 5": [41,42,43,44,45,46,47,48,49,50],
            "Zone 6": [51,52,53,54,55,56,57,58,59,60],
            "Zone 7": [61,62,63,64,65,66,67,68,69,70],
            "Zone 8": [71,72,73,74,75,76,77,78,79,80],
            "Zone 9": [81,82,83,84,85,86,87,88,89,90],
            "Zone 10": [91,92,93,94,95,96,97,98,99,100]
        }

covert thorn Nov 26, 2022, 10:15 AM

#

spare spindle anyone knows how to write this in a pythonic way? xd ```py floors = { ...

dunno if i'd call it pythonic, but here

#

!e

floors = {f'Zone {z}': [*range(10*(z-1) + 1, 10*z + 1)] for z in range(1, 11)}
print(floors)

halcyon plankBOT Nov 26, 2022, 10:16 AM

#

@covert thorn :white_check_mark: Your 3.11 eval job has completed with return code 0.

{'Zone 1': [1, 2, 3, 4, 5, 6, 7, 8, 9, 10], 'Zone 2': [11, 12, 13, 14, 15, 16, 17, 18, 19, 20], 'Zone 3': [21, 22, 23, 24, 25, 26, 27, 28, 29, 30], 'Zone 4': [31, 32, 33, 34, 35, 36, 37, 38, 39, 40], 'Zone 5': [41, 42, 43, 44, 45, 46, 47, 48, 49, 50], 'Zone 6': [51, 52, 53, 54, 55, 56, 57, 58, 59, 60], 'Zone 7': [61, 62, 63, 64, 65, 66, 67, 68, 69, 70], 'Zone 8': [71, 72, 73, 74, 75, 76, 77, 78, 79, 80], 'Zone 9': [81, 82, 83, 84, 85, 86, 87, 88, 89, 90], 'Zone 10': [91, 92, 93, 94, 95, 96, 97, 98, 99, 100]}

spare spindle Nov 26, 2022, 10:18 AM

#

covert thorn dunno if i'd call it *pythonic*, but here

its ok, its perfect Latom thats a smart way of going around it

#

thank you lol

visual zephyr Nov 26, 2022, 10:45 AM

#

any channel open?

#

sorry by mistake

mystic dust Nov 26, 2022, 4:24 PM

#

I'm curious, is there a simpler (or more pythonic) way to do this?

created = []
for attrs in data:         # data type: list[list[str]]
  if len(attrs) >= 3:
    work = Work(*attrs)    # initialize object
    if work.save():        # returns bool
      created.append(work)

mystic dust Nov 26, 2022, 4:48 PM

#

Will this do the same trick?

created = [work for attrs in data if len(attrs) >= 3 and (work := Work(*attrs)).save()]

vocal gorge Nov 26, 2022, 4:50 PM

#

yes, but I like the first more tbh

mystic dust Nov 26, 2022, 4:52 PM

#

Because of the controversy with the walrus operator or because it only works on 3.8+?

vocal gorge Nov 26, 2022, 4:52 PM

#

Just because it's harder to read, actually.

vocal gorge Nov 26, 2022, 4:53 PM

#

vocal gorge yes, but I like the first more tbh

although I'm personally tempted to write it like something like

creates = SI(data).filter(len(_)>=3).map(Work(*_)).filter(_.save()).tolist()

which uses https://github.com/kachayev/fn.py, so your mileage may wary

mystic dust Nov 26, 2022, 4:53 PM

#

vocal gorge although I'm personally tempted to write it like something like ```py creates = ...

Ooooo thanks!

worthy escarp Nov 26, 2022, 9:30 PM

#

mystic dust I'm curious, is there a simpler (or more pythonic) way to do this? ```py created...

If you want other people to understand your code, I advise you to choose this version. (Often, the other person is a future version of yourself)

merry rivet Nov 26, 2022, 11:03 PM

#

can anyone help me w algo coursework??

waxen cloak Nov 26, 2022, 11:59 PM

#

Hello

#

Is there a way to such thing in python , call function by its name stored in a variable

x = "bool"
l = x(0)
print(l) # Should print False

vocal gorge Nov 27, 2022, 12:04 AM

#

yes, but why

stray fractal Nov 27, 2022, 12:04 AM

#

waxen cloak Is there a way to such thing in python , call function by its name stored in a v...

!e this, also off-topic ```py
x = "bool"
l = locals()x
print(l)

halcyon plankBOT Nov 27, 2022, 12:04 AM

#

@stray fractal :x: Your 3.11 eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 2, in <module>
003 | KeyError: 'bool'

stray fractal Nov 27, 2022, 12:04 AM

#

nvm

waxen cloak Nov 27, 2022, 12:07 AM

#

vocal gorge yes, but why

I'm testing a function when I want to edit some app parameteres , and I want to pass the new value and a type to help me convert (cast) in case I need to

slender sandal Nov 27, 2022, 9:29 AM

#

stray fractal !e this, also off-topic ```py x = "bool" l = locals()[x](0) print(l) ```

You'd use vars, not locals

#

Nevermind

waxen cloak Nov 27, 2022, 11:18 AM

#

thank u

reef spear Nov 27, 2022, 12:29 PM

#

how to cut equal pieces in cake using python

fiery cosmos Nov 27, 2022, 12:56 PM

#

can someone look at this: https://discord.com/channels/267624335836053506/1035199133436354600

#

check if tuple with range is inside list

fiery cosmos Nov 27, 2022, 2:20 PM

#

how can i just convert this timestamp to say yy:mm:day

#

it is too specific

#

i get a bad line plot

fiery cosmos Nov 27, 2022, 5:44 PM

#

what is the most concise way to write if character is not equivalent to one of the following several characters

fiery cosmos Nov 27, 2022, 6:03 PM

#

if 'c' not in 'abc':

#

ty

mild hound Nov 27, 2022, 6:25 PM

#

fiery cosmos i get a bad line plot

are you sure the timestamps are sorted correctly ?

fiery cosmos Nov 27, 2022, 6:27 PM

#

im struggling with my __repr__ method for a custom error class

#

idk why it's returning as a tuple

#

i would like a string

spring iris Nov 27, 2022, 6:35 PM

#

hello!

fiery cosmos Nov 27, 2022, 6:35 PM

#

👋

spring iris Nov 27, 2022, 6:35 PM

#

i want the compiler keep asking use to input until the user is done

#

user*

#

when the user press enter whith no input

#

thats when he is done

fiery cosmos Nov 27, 2022, 6:36 PM

#

while loop?

spring iris Nov 27, 2022, 6:36 PM

#

don't how it's going to help

#

i know how it works but couldn't phrase it with my condition

fiery cosmos Nov 27, 2022, 6:37 PM

#

while input != ''

#

? guessing

spring iris Nov 27, 2022, 6:37 PM

#

hmmm

#

lol

#

thanks

fiery cosmos Nov 27, 2022, 6:37 PM

#

stick around someone will be able to help you better

spring iris Nov 27, 2022, 6:37 PM

#

no that's the answer

#

dont know how i missed it xD

fiery cosmos Nov 27, 2022, 6:38 PM

#

welp. ok

#

i think input is missing some braces above

#

anyone can help with my custom error __repr__? idk why its printing as a tuple like this:
('my error message', 'error_character')

#

sry its a custom exception

hasty glen Nov 27, 2022, 7:12 PM

#

Hi. Is there a person who might know how to access a 2D list using a tuple?
like using (0, 0) to access 5 in [[5, 3], [6, 7]]

fiery cosmos Nov 27, 2022, 7:15 PM

#

you don't use a tuple, you use both indices:
y = [[5,3], [6,7]]
if you want 5:
print(y[0][0])

rich rapids Nov 27, 2022, 7:15 PM

#

fiery cosmos anyone can help with my custom error `__repr__`? idk why its printing as a tuple...

can you show the class definition?

fiery cosmos Nov 27, 2022, 7:15 PM

#

rich rapids can you show the class definition?

nevermind, i figured it out

rich rapids Nov 27, 2022, 7:15 PM

#

ahh nice

hasty glen Nov 27, 2022, 7:16 PM

#

fiery cosmos you don't use a tuple, you use both indices: y = [[5,3], [6,7]] if you want 5: p...

I know the normal mode but i need to access it using a tuple

fiery cosmos Nov 27, 2022, 7:17 PM

#

why must you use a tuple

#

if you absolutely have to do that, you probably need to convert it to the proper access syntax somehow

rich rapids Nov 27, 2022, 7:17 PM

#

You can do something weird with a reduce function and calling it on the tuple

#

Other than that, I don't think python supports tuples for array indexing

fiery cosmos Nov 27, 2022, 7:18 PM

#

for (a,b) in input_list_of_tuples: print(2darray([a][b]))?

#

idk if this'll work

rich rapids Nov 27, 2022, 7:18 PM

#

I don't think it will, the unpacking won't work as intended

hasty glen Nov 27, 2022, 7:19 PM

#

my input to program is like this
[(0, 0), (3, 3)]
and these are the items in my matrices so i should access to y[0][0] or y[3][3]
something like that

rich rapids Nov 27, 2022, 7:19 PM

#

what universe posted will work then

hasty glen Nov 27, 2022, 7:21 PM

#

fiery cosmos `for (a,b) in input_list_of_tuples: print(2darray([a][b]))`?

now this will work but i want to modify them too

fiery cosmos Nov 27, 2022, 7:21 PM

#

modify what

hasty glen Nov 27, 2022, 7:22 PM

#

y[0][0] for example if it is 0 i want to change it to 19

fiery cosmos Nov 27, 2022, 7:22 PM

#

just add some if statements to the code above

#

if a == 0: a = 19

#

oh you mean the value at y[0][0]

hasty glen Nov 27, 2022, 7:23 PM

#

yup

fiery cosmos Nov 27, 2022, 7:23 PM

#

    y[a][b] = 19```

hasty glen Nov 27, 2022, 7:23 PM

#

not like that

hasty glen Nov 27, 2022, 7:24 PM

#

fiery cosmos `for (a,b) in input_list_of_tuples: print(2darray([a][b]))`?

okay i think i figured out a way thanks for your help

fiery cosmos Nov 27, 2022, 7:31 PM

#

hmm right now i am catching an error and aborting the program, it'd be nice to instead just remove the sequence with the error and run the others as usual

bronze sail Nov 27, 2022, 8:04 PM

#

Does python set's hashing function can have hash collision same way like dictionary ?

agile sundial Nov 27, 2022, 8:05 PM

#

yes

fiery cosmos Nov 27, 2022, 8:32 PM

#

is it possible to remove list elements while iterating over a list and would this not ruin the notion of a list index

#

e.g.,
if i am running a loop from i to len(list), but then remove an element from the list during the loop

jolly mortar Nov 27, 2022, 8:33 PM

#

iterate in reverse order

fiery cosmos Nov 27, 2022, 8:33 PM

#

🤔

#

like

for i in range(len(list),0,-1)

jolly mortar Nov 27, 2022, 8:36 PM

#

!e

xs = ["a", "b", "c", "d", "e"]
for i in range(5):
    print(i, xs[i])
    xs.pop(i)

halcyon plankBOT Nov 27, 2022, 8:36 PM

#

@jolly mortar :x: Your 3.11 eval job has completed with return code 1.

001 | 0 a
002 | 1 c
003 | 2 e
004 | Traceback (most recent call last):
005 |   File "<string>", line 3, in <module>
006 | IndexError: list index out of range

jolly mortar Nov 27, 2022, 8:37 PM

#

!e

xs = ["a", "b", "c", "d", "e"]
for i in range(4, -1, -1):
    print(i, xs[i])
    xs.pop(i)

halcyon plankBOT Nov 27, 2022, 8:37 PM

#

@jolly mortar :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 4 e
002 | 3 d
003 | 2 c
004 | 1 b
005 | 0 a

fiery cosmos Nov 27, 2022, 8:37 PM

#

interesting i will try this thank you

#

why -1 for end index

jolly mortar Nov 27, 2022, 8:38 PM

#

arguably simpler way is to just make a new list for the elements you want to keep insteas of mutating the current one

jolly mortar Nov 27, 2022, 8:39 PM

#

fiery cosmos why -1 for end index

range is stop-exclusive, so the last thing it produces is 0

fiery cosmos Nov 27, 2022, 8:39 PM

#

oo

#

right

fiery cosmos Nov 27, 2022, 8:57 PM

#

yeah its kind of tough bc there is error handling and i already have a list from the text file input that i'd like to just remove stuff from when I catch an error

#

an error being that one of the chars in the string is not in the language under consideration

#

can you not continue or pass after an exception is raised?

#

in a loop

opal oriole Nov 27, 2022, 9:13 PM

#

jolly mortar !e ```py xs = ["a", "b", "c", "d", "e"] for i in range(4, -1, -1): print(i, ...

!e ```py
xs = ["a", "b", "c", "d", "e"]
for i in reversed(range(5)):
print(i, xs[i])
xs.pop(i)

halcyon plankBOT Nov 27, 2022, 9:13 PM

#

@opal oriole :white_check_mark: Your 3.11 eval job has completed with return code 0.

001 | 4 e
002 | 3 d
003 | 2 c
004 | 1 b
005 | 0 a

opal oriole Nov 27, 2022, 9:13 PM

#

I recommend reversed to not mess up the range values.

#

@fiery cosmos

jolly mortar Nov 27, 2022, 9:18 PM

#

agree

opal oriole Nov 27, 2022, 9:19 PM

#

fiery cosmos can you not continue or pass after an exception is raised?

If you catch it.

fiery cosmos Nov 27, 2022, 9:20 PM

#

Oof

opal oriole Nov 27, 2022, 9:25 PM

#

fiery cosmos hmm right now i am catching an error and aborting the program, it'd be nice to i...

Do you need to remove it or just skip that iteration?

#

If just skip then a continue in the except.

#

If remove, it's easier to do what hsop said and generate a new list.

#

It's effectively the same as the exception skip version, but if not skipped, it adds to the new list.

#

So it's a copy that sometimes skips on exception.

opal oriole Nov 27, 2022, 9:30 PM

#

fiery cosmos an error being that one of the chars in the string is not in the language under ...

This specific task is actually not so simple (depending on language and more).

#

If by language you mean spoken language.

fiery cosmos Nov 27, 2022, 9:33 PM

#

No the language is just a series of characters

opal oriole Nov 27, 2022, 9:33 PM

#

fiery cosmos No the language is just a series of characters

So you have a set of values and are checking if it's in there?

fiery cosmos Nov 27, 2022, 9:35 PM

#

I do the other way around. ‘if char not in ‘stringwitheverychar’’

#

raise exception. I also want to remove that string (which is the second item in a list) from consideration from the master 2D list

opal oriole Nov 27, 2022, 9:36 PM

#

So you have a list of strings and if the string contains an invalid character remove it from the list?

fiery cosmos Nov 27, 2022, 9:36 PM

#

Not exactly. Let me get on my PC 1s

rigid gyro Nov 27, 2022, 9:41 PM

#

hey guys, sorry if this is the wrong section to post in. but i’m wondering the most efficient way to search and extract from 46 files in a folder / directory. basically i need to extract the files only with specific peak absorbances

fiery cosmos Nov 27, 2022, 9:42 PM

#

rigid gyro hey guys, sorry if this is the wrong section to post in. but i’m wondering the m...

do the ones with peak absorbances have a commonality to their file name?

rigid gyro Nov 27, 2022, 9:43 PM

#

fiery cosmos do the ones with peak absorbances have a commonality to their file name?

nope, the files are just named based on their dates and sample specification

fiery cosmos Nov 27, 2022, 9:43 PM

#

what is the file type

rigid gyro Nov 27, 2022, 9:43 PM

#

the data is only available inside the file

#

csv

fiery cosmos Nov 27, 2022, 9:43 PM

#

and the specific peak absorbance is, a range?

opal oriole Nov 27, 2022, 9:44 PM

#

While you could raise an exception for this, it may be easier to just return a boolean (is valid) and then either process it or skip it.

fiery cosmos Nov 27, 2022, 9:45 PM

#

🤔

opal oriole Nov 27, 2022, 9:46 PM

#

Whether to use an exception or not depends on the code. One of the main things about exceptions is that they can be passed up (propagated).

rigid gyro Nov 27, 2022, 9:46 PM

#

fiery cosmos and the specific peak absorbance is, a range?

so, i need to extract files with peak absorbances at 451, 434, 320, 271. this is to identify the files which contain peaks for a specific molecule so i can plot the spectra

opal oriole Nov 27, 2022, 9:49 PM

#

"raise an exception so i can print to file which invalid character was detected" - You could do all the normal processing in the try and in the except do that printing.

#

Or if you want to do the printing later, add those exceptions to a list.

fiery cosmos Nov 27, 2022, 9:50 PM

#

@rigid gyro this sounds easily emenable to automation. you'll just write a python program to iterate through each line in each file in your directory looking for the peak_abs or whatever string in a cell, and if the next cell is one of those values, store the file in a list of files to return

#

it'll look roughly like this:

return = []
for file in filedirectory:
    with open(file) as f:
        if f.readline == 'peak_abs':
          if peak_abs == range(220-694):
            return.append(file)

#

you'll need to find the specifics of opening a .cvs file however

rigid gyro Nov 27, 2022, 9:52 PM

#

okay! though the peak absorbance isn’t stated in the files, basically i have only wavelength, absorbance in visible region and concentration. so, i guess i will have to use scipy_findpeaks somewhere too

#

absorbance between 220-694 actually

fiery cosmos Nov 27, 2022, 9:53 PM

#

if it's a csv, you'll have to decide the row you want to parse on

#

row or rows

#

you need to be able to read a specific row and decide to return that file or not based on the integer there

#

that's not quite right above but its the general principle

rigid gyro Nov 27, 2022, 9:57 PM

#

okay thank you @fiery cosmos ! that’s helpful too know

fiery cosmos Nov 27, 2022, 9:57 PM

#

@rigid gyro https://www.geeksforgeeks.org/reading-rows-from-a-csv-file-in-python/

GeeksforGeeks

Reading Rows from a CSV File in Python - GeeksforGeeks

A Computer Science portal for geeks. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions.

opal oriole Nov 27, 2022, 9:58 PM

#

So there are different kinds of "errors". There is the error where if it happens you have the cancel the whole operation and/or rewind. And there there is the "error" where you process as much as you can and report the failed ones.

#

And so I would have the data_read give back two results, the list of things correctly read, and a list of failures.

fiery cosmos Nov 27, 2022, 9:58 PM

#

yeah i'd like to do the second one, this is a self-imposed error, a custom exception

#algos-and-data-structs

Initialize top row, except dp[0][0],