#algos-and-data-structs | Python | Page 120

odd steppe Jun 11, 2021, 5:19 AM

#

Should I be learning C soon

last fulcrum Jun 11, 2021, 5:20 AM

#

If you want. I'll say C++ not C.

#

It's far easier than C

half lotus Jun 11, 2021, 5:22 AM

#

I must resist the urge to start arguing on why C++ would be a terrible first language.

last fulcrum Jun 11, 2021, 5:25 AM

#

half lotus I must resist the urge to start arguing on why C++ would be a terrible first lan...

Great first language. Or just go with Java.

signal knot Jun 11, 2021, 7:05 AM

#

How do you learn C once you've been using Python for so long?

sage coral Jun 11, 2021, 7:37 AM

#

is anyone free to help me out with a mcts algorithm im working on?

last fulcrum Jun 11, 2021, 9:38 AM

#

signal knot How do you learn C once you've been using Python for so long?

Should be easy to pick up the basics if you've been using python for long. Learn how data types are used, how loops are used, etc. Or just create a project and use that to learn.

naive cloak Jun 11, 2021, 10:17 AM

#

in queries, what does <> mean?

loud trail Jun 11, 2021, 1:33 PM

#

naive cloak in queries, what does <> mean?

i assume this should have been in #databases, and i assume you are asking about sql. <> in sql means "not equal", like != in python.

spice owl Jun 11, 2021, 1:55 PM

#

Hello,

How can you convert JSON B to JSON A in encoding in Python?

JSON A

{\"version\":10000,\"archive3dm\":60,\"opennurbs\":-1937213195,\"data\":\"+n8CAMA4AAAAAAAA+/8CABQAAAAAAAAAQ8Jv8CqjCEad2KfSxM4qNij+rDr8/wIAiDgAAAAAAAAzAIAAQEETAAAAAAAAEBgAAAABAAAA+n8CAL0AAAAAAAAA+/8CABQAAAAAAAAAGRGvXlEL1BG//gAQgwEi8EoaeRf8/...

JSON B

{"version": 10000, "archive3dm": 60, "opennurbs": -1910998424, "data": "+n8CAE8qHwAAAAAA+/8CABQAAAAAAAAAxdu1YGDm0xG/5AAQgwEi8JhRycz8/wIAFyofAAAAAAAzAIAAQLFHBwAAAAAAEAwJAAABAAAA+n8CAL4AAAAAAAAA+/8CABQAAAAAAAAA3dTXTkfp0xG/5QAQgwEi8BDyeHD8/wIAhgAAAAAAAAARAgAAAAAAAAACAAAAAgAAAAAAAAAAAAAAAAAAAAAA8D8AAAAAAAAAAAAAAAAAAAAAAAAAAAAA8L8AAAAAAAAAAAAAAAAAAAAAAgAAAAAAAAAAAAAAAxcfMiErVEACAAAAAAAAAAAAAAAAAAAAAAAAAAAAHzIhK1RAAAAAAAAA4D0ACWqW+f9/AoAAAAAAAAAAAAEAAAD6fwIAvgAAAAAAAAD7/wIAFAAAAAAAAADd1NdOR+nTEb/lABCDASLwEPJ4cPz/...

vocal gorge Jun 11, 2021, 1:57 PM

#

the first one looks like a JSON string, while the latter is contents of this string, basically.

#

anyway, just json.loads it

loud trail Jun 11, 2021, 2:01 PM

#

@spice owl what is the context for this question? i agree that these look like 2 different representations of the same thing, but i've seen enough cursed mishandling of json data in the past to be cautious of making a recommendation without more information.

spice owl Jun 11, 2021, 2:02 PM

#

@loud trail @vocal gorge I am getting Newsoft JSON PARSE on my back-end server when trying to send JSON B

#

Then found the JSON A example, and figured I needed to encode it.

#

Error on back-end

vocal gorge Jun 11, 2021, 2:04 PM

#

How are you sending it?

last fulcrum Jun 11, 2021, 2:04 PM

#

Why is the len of a python list or tuple in constant time

spice owl Jun 11, 2021, 2:04 PM

#

res = requests.post(compute_url, json=payload)

last fulcrum Jun 11, 2021, 2:06 PM

#

last fulcrum Why is the len of a python list or tuple in constant time

Got it

loud trail Jun 11, 2021, 2:06 PM

#

last fulcrum Why is the len of a python list or tuple in constant time

i assume it's because the PyListObject itself tracks its own length

last fulcrum Jun 11, 2021, 2:06 PM

#

loud trail i assume it's because the `PyListObject` itself tracks its own length

Yes. That's why

vocal gorge Jun 11, 2021, 2:08 PM

#

essentially, because tuples and lists are arrays (a pointer to a place in memory and a length), not something like linked lists

#

(though I don't believe it's, like, in the language specification?.. so a sufficiently crazy Python implementation could use linked lists for lists)

loud trail Jun 11, 2021, 2:09 PM

#

reimplement python in common lisp

#

the more i think about it, the less bad of an idea it seems

#

back python lists and dicts directly with common lisp dicts and hash tables

last fulcrum Jun 11, 2021, 2:11 PM

#

loud trail back python lists and dicts directly with common lisp dicts and hash tables

I think python is sufficiently optimised enough.

loud trail Jun 11, 2021, 2:11 PM

#

if anything it's de-optimizing the python implementation and letting the lisp compiler take care of it

#

idris 2 i believe is the same way, the chez backend is so damn good that they don't need to do a lot of optimizing in the idris compiler itself

#

anyway, here:
https://github.com/python/cpython/blob/bb3e0c240bc60fe08d332ff5955d54197f79751c/Objects/listobject.c#L197-L206
https://github.com/python/cpython/blob/304dfec8d3c0763734ea8b5fa2af1d9e1ce69ffa/Include/object.h#L140
ultimately PyList_Size uses the Py_SIZE macro which just gets the ob_size attribute

halcyon plankBOT Jun 11, 2021, 2:13 PM

#

Objects/listobject.c lines 197 to 206

Py_ssize_t
PyList_Size(PyObject *op)
{
    if (!PyList_Check(op)) {
        PyErr_BadInternalCall();
        return -1;
    }
    else
        return Py_SIZE(op);
}```
`Include/object.h` line 140
```h
#define Py_SIZE(ob)             (_PyVarObject_CAST(ob)->ob_size)```

last fulcrum Jun 11, 2021, 2:13 PM

#

loud trail idris 2 i believe is the same way, the chez backend is so damn good that they do...

Never even heard of idris 2 before

loud trail Jun 11, 2021, 2:13 PM

#

https://stackoverflow.com/a/54323374/2954547 someone on stackoverflow already figured it out, but i had fun finding it

Stack Overflow

Reason Behind O(1) in finding the length of List in python

I was going through https://wiki.python.org/moin/TimeComplexity of Lists in python. I know python list internally works as an Array. But how internally length of list is O(1). As it needs to traverse

loud trail Jun 11, 2021, 2:14 PM

#

last fulcrum Never even heard of idris 2 before

like haskell, but nicer (imo) and simpler, and with dependent types built into the language

last fulcrum Jun 11, 2021, 2:14 PM

#

loud trail anyway, here: https://github.com/python/cpython/blob/bb3e0c240bc60fe08d332ff5955...

I dream of the day when I can write my own language with a compiler etc.

#

I really love looking at internal things

loud trail Jun 11, 2021, 2:15 PM

#

maybe SICP is a good book for you

last fulcrum Jun 11, 2021, 2:16 PM

#

loud trail maybe SICP is a good book for you

I'll look at it. Thanks for the recommendation.

vocal gorge Jun 11, 2021, 2:16 PM

#

loud trail https://stackoverflow.com/a/54323374/2954547 someone on stackoverflow already fi...

huh. What language could they have come from where arrays without a size are common?..

loud trail Jun 11, 2021, 2:18 PM

#

C?

last fulcrum Jun 11, 2021, 2:18 PM

#

vocal gorge huh. What language could they have come from where arrays without a size are com...

Maybe he only knew of dynamic arrays

last fulcrum Jun 11, 2021, 2:19 PM

#

loud trail C?

C has array size limits

loud trail Jun 11, 2021, 2:19 PM

#

doesn't C strlen() just walk the array until it hits a nul byte?

vocal gorge Jun 11, 2021, 2:19 PM

#

loud trail C?

yeah, C strings are null-terminated, but isn't that only a string thing?

last fulcrum Jun 11, 2021, 2:19 PM

#

loud trail doesn't C `strlen()` just walk the array until it hits a nul byte?

I think so. I know a string in python also does the same

vocal gorge Jun 11, 2021, 2:20 PM

#

do... they use null-terminated arrays too? that sounds basically impossible, since in non-strings you can have zeros

last fulcrum Jun 11, 2021, 2:20 PM

#

vocal gorge do... they use null-terminated arrays too? that sounds basically impossible, sin...

They use null terminated for the str class

loud trail Jun 11, 2021, 2:21 PM

#

that i'm not sure about

austere sparrow Jun 11, 2021, 5:33 PM

#

last fulcrum They use null terminated for the str class

I don't think so, len is constant-time for strings.

#

https://github.com/python/cpython/blob/main/Objects/unicodeobject.c#L132-L139

halcyon plankBOT Jun 11, 2021, 5:33 PM

#

Objects/unicodeobject.c lines 132 to 139

#define PyUnicode_WSTR_LENGTH(op) \
    (PyUnicode_IS_COMPACT_ASCII(op) ?                  \
     ((PyASCIIObject*)op)->length :                    \
     ((PyCompactUnicodeObject*)op)->wstr_length)
#define _PyUnicode_WSTR_LENGTH(op)                      \
    (((PyCompactUnicodeObject*)(op))->wstr_length)
#define _PyUnicode_LENGTH(op)                           \
    (((PyASCIIObject *)(op))->length)```

austere sparrow Jun 11, 2021, 5:33 PM

#

https://github.com/python/cpython/blob/main/Objects/unicodeobject.c#L147-L149

halcyon plankBOT Jun 11, 2021, 5:33 PM

#

Objects/unicodeobject.c lines 147 to 149

#define _PyUnicode_GET_LENGTH(op)                       \
    (assert(_PyUnicode_CHECK(op)),                      \
     ((PyASCIIObject *)(op))->length)```

austere sparrow Jun 11, 2021, 5:35 PM

#

!e
Also yes, you can have \0 bytes

print(repr("Here's a NUL: \0"))

halcyon plankBOT Jun 11, 2021, 5:35 PM

#

@austere sparrow :white_check_mark: Your eval job has completed with return code 0.

"Here's a NUL: \x00"

austere sparrow Jun 11, 2021, 5:42 PM

#

although probably more on topic in #internals-and-peps

last fulcrum Jun 11, 2021, 5:54 PM

#

As expected, we see that inserting at the beginning of a list is most expensive, requiring linear time per operation. Inserting at the middle requires about half the time as inserting at the beginning, yet is still Ω(n) time. Inserting at the end displays O(1) behaviour, akin to append.

Python knows where the array starts from, so I'm confused as to why it would be linear time to insert at the beginning.

fervent saddle Jun 11, 2021, 5:55 PM

#

It needs to move all the existing values forward by 1

#

All the values that come after the inserted value

last fulcrum Jun 11, 2021, 5:57 PM

#

fervent saddle It needs to move all the existing values forward by 1

Yes that makes sense.

#

So the less it has to move the less time

fervent saddle Jun 11, 2021, 6:02 PM

#

Yeah

#

But it’s always O(n) except if you’re inserting at the end, which is amortized O(1)

#

I mean I guess always inserting within a specific range from the end would technically be O(1)

#

But yeah you get the idea

proud ridge Jun 11, 2021, 6:59 PM

#

hello

#

i need help with coding a A* search algo

#

#algos-and-data-structs hey

fiery cosmos Jun 11, 2021, 7:48 PM

#

fervent saddle But it’s always O(n) except if you’re inserting at the end, which is amortized O...

Kinda makes me wonder why Python doesn't implement lists like a circular queue array https://www.geeksforgeeks.org/circular-queue-set-1-introduction-array-implementation/ for O(1) insert at start and half the shifts to do in general rooThink1

#

But there are probably drawbacks I don't realize pithink

loud trail Jun 11, 2021, 8:23 PM

#

do those have constant-time lookups too?

gentle badger Jun 11, 2021, 9:08 PM

#

sup

scenic cosmos Jun 11, 2021, 9:12 PM

#

Does anyone here have opinions on what metaheuristics package to use in python?

fervent saddle Jun 11, 2021, 10:01 PM

#

If it’s done using an array then it should

keen hearth Jun 12, 2021, 4:46 AM

#

fiery cosmos But there are probably drawbacks I don't realize <:pithink:652247559909277706>

It’s an interesting question! This is the best answer I could find https://stackoverflow.com/a/56409211

Stack Overflow

Why are Python lists implemented as dynamic arrays instead of ring ...

A list in Python is now implemented as dynamic array of pointers, so it's not suitable for insertion and deletion at the front end. However, ring buffer also supports O(1) indexing. It can also exp...

fiery cosmos Jun 12, 2021, 5:47 AM

#

Yeah, I guess you have to add an offset to every index call, but that doesn't seem like that expensive of an operation. spinning_think

#

The trade-off makes sense I suppose since indexing is way way more common than inserting AYAYAHappy

stable pecan Jun 12, 2021, 6:28 AM

#

If you implement a ring buffer for fast insertion, is that meaning you need pointers for each block? that's extra memory overhead too

#

i've only implemented constant size ring buffer, so that it was just using a list internally, you could "insert" at either end, but not arbitrarily -- though i guess you could just occasionally shrink or grow it, so that you don't need to implement as a linked list

fiery cosmos Jun 12, 2021, 6:39 AM

#

yeah, you can double (or whatever factor the dynamic array uses) the size when the ring buffer fills up and still have O(1) amortized appends, but from either side

raw juniper Jun 12, 2021, 7:49 AM

#

Anyone know any good books on the practical side (CP, practice questions, etc.) of data structures (and maybe algorithms)?

#

(pls ping me when replying lol)

wanton basin Jun 12, 2021, 8:21 AM

#

Hi, I am trying to create a custom Dictionary for blob and in memory object storage

#

How do I overload .keys() dictionary method?

raw juniper Jun 12, 2021, 8:22 AM

#

I don't think you can "overload" funcs in python

#

You can override them

#

You just have to define it again, it'll override it by default

wanton basin Jun 12, 2021, 8:23 AM

#

ok thanks

raw juniper Jun 12, 2021, 8:23 AM

#

class BaseClass:
    def some_func(self):
        return 0

class Class(BaseClass):
    def some_func(self, var):
        return var

#

Yo reptile

vocal gorge Jun 12, 2021, 8:24 AM

#

yeah, you don't need any special keyword (like Java's @Override) to override a parent function in Python

raw juniper Jun 12, 2021, 8:24 AM

#

raw juniper Anyone know any good books on the practical side (CP, practice questions, etc.) ...

Do you know any good books?

vocal gorge Jun 12, 2021, 8:25 AM

#

well, CLRS isn't on the practical side, so not really

raw juniper Jun 12, 2021, 8:25 AM

#

Hmmm

wanton basin Jun 12, 2021, 8:25 AM

#

Ideally take up a project for practical, and just incorporate everything

raw juniper Jun 12, 2021, 8:26 AM

#

Ah, but that's not what they want lol. The person who asked me wanted to practice dsa questions or smth

wanton basin Jun 12, 2021, 8:30 AM

#

Hackerrank has topic specific sections

#

I am overriding keys(), but that return dict_keys object type, how can I do the same?

#

`
def init(self, tablename="data"):
super().init()
databaseProvider = SqliteDict(self.dbpath, tablename=tablename,
encode=self.my_encode, decode=self.my_decode)

def __getitem__(self, key):
    if key in self.keys():
        return Dictionary.__getitem__(self, key)
    elif key in self.databaseProvider.keys():
        return self.databaseProvider[key]
    else:
        raise KeyError

def __setitem__(self, key, value):
    Dictionary.__setitem__(self, key, value)
    self.databaseProvider[key] = value

def keys(self):
    return Dictionary.keys().extend()`

vocal gorge Jun 12, 2021, 8:34 AM

#

hmm, I'm not sure there's a builtin way

#

you can return an object that behaves the same way, though

#

according to https://docs.python.org/3/library/collections.abc.html, a KeysView is a Set, basically.

#

so, just a set would be fine to return.

wanton basin Jun 12, 2021, 8:45 AM

#

def keys(self):
        keys = set(Dictionary.keys())
        keys.update(self.databaseProvider.keys())
        return keys

#

Is there a more elegant way to do this?

vocal gorge Jun 12, 2021, 8:51 AM

#

That seems like a nice way.

wanton basin Jun 12, 2021, 9:54 AM

#

Hi, I am using joblib for a parallel loop, but antimalware executable service is not allowing it to run properly

brave oak Jun 12, 2021, 11:30 AM

#

wanton basin ``` def keys(self): keys = set(Dictionary.keys()) keys.update(se...

!e

left = {'a': 1, 'b': 2}
right = {'c': 3, 'd': 4}

result = {*left, *right}

print(result)

halcyon plankBOT Jun 12, 2021, 11:30 AM

#

@brave oak :white_check_mark: Your eval job has completed with return code 0.

{'d', 'c', 'b', 'a'}

brave oak Jun 12, 2021, 11:31 AM

#

...assuming those are dicts, of course

stable pecan Jun 12, 2021, 12:03 PM

#

wanton basin ``` def keys(self): keys = set(Dictionary.keys()) keys.update(se...

key views have an __or__ method:

In [1]: a = {'a': 1, 'b': 2}; b = {'c': 3, 'd': 4}

In [2]: a.keys() | b.keys()
Out[2]: {'a', 'b', 'c', 'd'}

wanton basin Jun 12, 2021, 12:07 PM

#

Super

stable pecan Jun 12, 2021, 12:19 PM

#

wanton basin Super

i just looked at your earlier post and it looks like you're just implementing a ChainMap?

In [2]: from collections import ChainMap

In [3]: a = {'a': 1, 'b': 2}; b = {'c': 3, 'd': 4}

In [4]: c = ChainMap(a, b)

In [5]: c['a']
Out[5]: 1

In [6]: c['d']
Out[6]: 4

In [7]: c['e']
---------------------------------------------------------------------------
KeyError                                  Traceback (most recent call last)

wanton basin Jun 12, 2021, 12:20 PM

#

What is the difference between chainmap(a,b) and a.update(b)?

stable pecan Jun 12, 2021, 12:20 PM

#

chainmaps keep the underlying dictionarys separate

#

any update to a chainmap only affects the first dictionary (a in this case)

#

it will look for keys in the first dict and then continue to the rest

wanton basin Jun 12, 2021, 12:21 PM

#

ohh but my requirement is totally different, a is in memory lru cache and b is sqldict

stable pecan Jun 12, 2021, 12:21 PM

#

how is it different?

#

just ChainMap(cache, databaseProvider)

wanton basin Jun 12, 2021, 12:23 PM

#

I will load key from db to lru cache, do concurrent operation and then update in db

#

'''

#

'''
'''

#

class RepositoryDictionary(Dictionary):

dbpath = os.path.join(os.path.abspath(
    os.path.dirname(__file__)), 'data', "app.sqlite")

def my_encode(self, obj):
    return sqlite3.Binary(zlib.compress(pickle.dumps(obj, pickle.HIGHEST_PROTOCOL)))

def my_decode(self, obj):
    return pickle.loads(zlib.decompress(bytes(obj)))

def __init__(self, tablename="data"):
    super().__init__()
    self.databaseProvider = SqliteDict(self.dbpath, tablename=tablename,
                                       encode=self.my_encode, decode=self.my_decode)

```

def getitem(self, key):
if key in self.memkeys():
return Dictionary.getitem(self, key)
elif key in self.dbkeys():
Dictionary.setitem(self, key, self.databaseProvider[key])
return Dictionary.getitem(self, key)
else:
raise KeyError

def __setitem__(self, key, value):
    Dictionary.__setitem__(self, key, value)
    self.databaseProvider[key] = value
    self.databaseProvider.commit()

def memkeys(self):
    return self.keys()

def dbkeys(self):
    return self.databaseProvider.keys()

def keystore(self):
    return self.memkeys() | self.dbkeys()

stable pecan Jun 12, 2021, 12:24 PM

#

i see, there's a very nice method you can overload __missing__ that will probably simplify your implementation

wanton basin Jun 12, 2021, 12:26 PM

#

I think instead of getitem I can implement missing

#

but when it is missing I want to load in lru and then return from it, instead of direct return from db

#

@stable pecan Will missing load it into the lru cache automatically?

stable pecan Jun 12, 2021, 12:30 PM

#

if you implement it to

wanton basin Jun 12, 2021, 12:31 PM

#

okay, thanks

stable pecan Jun 12, 2021, 12:32 PM

#

something like this:

In [17]: class MyDict(ChainMap):
    ...:     def __init__(self, database):
    ...:         super().__init__({}, database)
    ...: 
    ...:     def __missing__(self, key):
    ...:         try:
    ...:             self[key] = self.maps[1][key]
    ...:             return self[key]
    ...:         except: # Whatever error database would raise
    ...:             raise KeyError from None
    ...: 
    ...:     def __setitem__(self, key, value):
    ...:         self.maps[1][key] = value
    ...:         self.maps[1].commit()
    ...:         self.maps[0][key] = value

#

might work

wanton basin Jun 12, 2021, 12:53 PM

#

@stable pecan Hahaha

def __setitem__(self, key, value):
    ...:        self.maps[1][key] = value
    ...:        self.maps[1].commit()
    ...:        self[key] = value

This is a infinite loop

stable pecan Jun 12, 2021, 12:56 PM

#

oops. use super

#

or maps

#

there, fixed

#

@wanton basin

wanton basin Jun 12, 2021, 12:58 PM

#

Already fixed, Thanks

stable pecan Jun 12, 2021, 12:58 PM

#

i forgot to return the key from __missing__

#

might need to fix that too

uncut crypt Jun 12, 2021, 1:39 PM

#

Can some one help me on how to tackle problems involving substrings?
I always ends up using loops which results in TLE

fathom prairie Jun 12, 2021, 2:02 PM

#

what's the most memory efficient data structure for caching in python

vocal gorge Jun 12, 2021, 2:02 PM

#

...a dict? I mean, you need quick lookup too.

#

there are some stuff like BTreeMaps which are more memory-efficient, but Python doesn't have a builtin for that, though there's probably a library

fathom prairie Jun 12, 2021, 2:03 PM

#

vocal gorge there are some stuff like `BTreeMap`s which are more memory-efficient, but Pytho...

in my case yes i need quick lookups but memory is my first priority

vocal gorge Jun 12, 2021, 2:06 PM

#

looks like there's https://btrees.readthedocs.io/en/latest/overview.html that implements a BTree

#

it's not a very popular library, though

stable pecan Jun 12, 2021, 2:06 PM

#

i have some trees implemented

fathom prairie Jun 12, 2021, 2:06 PM

#

thanks! lemme look into it @vocal gorge

vocal gorge Jun 12, 2021, 2:06 PM

#

Make sure you actually measure whether you get a memory decrease compared to dicts.

fathom prairie Jun 12, 2021, 2:06 PM

#

sure

stable pecan Jun 12, 2021, 2:06 PM

#

but they weren't implemented with saving as much memory as possible

#

trees use a lot of pointers

#

https://github.com/salt-die/sacks have an AVLTree that can be remade into a mapping pretty easily (just by adding values to the nodes)

wanton basin Jun 12, 2021, 2:13 PM

#

@fathom prairie I believe it mostly depends on what type of data you want to store

fathom prairie Jun 12, 2021, 3:16 PM

#

wanton basin <@!438672336921690112> I believe it mostly depends on what type of data you wan...

{string: {int: python object}}

#

there can be multiple keys not just one

#

some can be up to 10^5

wanton basin Jun 12, 2021, 3:31 PM

#

@fathom prairie Is it a dictionary of dictionary?

fathom prairie Jun 12, 2021, 3:48 PM

#

wanton basin <@!438672336921690112> Is it a dictionary of dictionary?

yes

#

nested dict

wintry merlin Jun 12, 2021, 5:29 PM

#

hey any binary heap (min/max heap ) folks here

#

I feel like i understand the time complexity for everything here except the "average insert" wouldnt on average the value would be somewhere in the middle of the tree so it requires like log N / 2 time? so wouldnt that round to Log N time?

#

i understand that worst case is log n, n being the height of the tree, but on average an insert would be less then half the values so would need to go up some part of the tree

#

maybe even log N /4 or some other factor

raven dirge Jun 12, 2021, 5:43 PM

#

which algos should I learn first excep binary search al.

half lotus Jun 12, 2021, 5:49 PM

#

wintry merlin i understand that worst case is log n, n being the height of the tree, but on av...

Right but keep applying this logic. At each level, the bottom is half of the total nodes. So you could say each level you go up, the probability of the value being above is halved.

wintry merlin Jun 12, 2021, 5:49 PM

#

read a data structures book @raven dirge it will explore time/space complexity (big O notation) and give you a preview with examples of many of the commmon ones

#

@half lotus so its like log(n) over log (n) because each step you take is half of the chance to find a medium value?

#

which equates to ~ O(1)

raven dirge Jun 12, 2021, 5:50 PM

#

wintry merlin read a data structures book <@!799171080878030849> it will explore time/space co...

ok thanks

raven dirge Jun 12, 2021, 5:50 PM

#

wintry merlin read a data structures book <@!799171080878030849> it will explore time/space co...

any books you recommend

half lotus Jun 12, 2021, 5:50 PM

#

As the level increases you need more operations, but the chance of it being at a higher level decreases

#

So it converges to a constant value

wintry merlin Jun 12, 2021, 5:51 PM

#

raven dirge any books you recommend

im not saying you can find it free, but a google search wont hurt. https://www.amazon.com/Structures-Algorithms-Python-Michael-Goodrich/dp/1118290275

Data Structures and Algorithms in Python: Goodrich, Michael T., Tam...

Data Structures and Algorithms in Python [Goodrich, Michael T., Tamassia, Roberto, Goldwasser, Michael H.] on Amazon.com. FREE shipping on qualifying offers. Data Structures and Algorithms in Python

raven dirge Jun 12, 2021, 5:51 PM

#

wintry merlin im not saying you can find it free, but a google search wont hurt. https://www.a...

thanksss

vocal gorge Jun 12, 2021, 6:27 PM

#

im not saying you can find it free, but a google search wont hurt.
certainly not saying that it's true for every book, either

bright cedar Jun 12, 2021, 7:26 PM

#

can someone help me with this code

halcyon plankBOT Jun 12, 2021, 7:27 PM

#

Hey @bright cedar!

Uh-oh! It looks like your message got zapped by our spam filter. We currently don't allow .txt attachments, so here are some tips to help you travel safely:

• If you attempted to send a message longer than 2000 characters, try shortening your message to fit within the character limit or use a pasting service (see below)

• If you tried to show someone your code, you can use codeblocks
(run !code-blocks in #bot-commands for more information) or use a pasting service like:

https://paste.pythondiscord.com

bright cedar Jun 12, 2021, 7:28 PM

#

unborn sundial Jun 13, 2021, 7:01 AM

#

!e

print("yay") if True else print("Not yay")
print("yay") if False else print("Not yay")

halcyon plankBOT Jun 13, 2021, 7:01 AM

#

@unborn sundial :white_check_mark: Your eval job has completed with return code 0.

001 | yay
002 | Not yay

lament oriole Jun 13, 2021, 10:28 AM

#

i think i may have just made the worst selection sort ever

def minimaxi(n1,n2):
  if n1 > n2:
    return n2, n1
  return n1, n2

def selectionSort(arr):
  temp = arr
  sortedArr = [0]*len(arr)
  for j in range(len(arr)):
    for i in range(len(temp)):
      temp[i-1], temp[i] = minimaxi(temp[i-1], temp[i])
    sortedArr[j] = temp.pop(0)
  return sortedArr

two for loops feels inefficient as hell

austere sparrow Jun 13, 2021, 10:29 AM

#

lament oriole i think i may have just made the worst selection sort ever ```py def minimaxi(n1...

What's the point of temp = arr?

lament oriole Jun 13, 2021, 10:30 AM

#

oh my god lmao

#

i think i was messing around with ideas before, and that was left over. 10/10 best practice programming

#

thats better, still not happy about using 2 loops

  sortedArr = [0]*len(arr)
  for j in range(len(arr)):
    for i in range(len(arr)):
      arr[i-1], arr[i] = minimaxi(arr[i-1], arr[i])
    sortedArr[j] = arr.pop(0)
  return sortedArr

#

wait isnt selection sort O(n^2) anyway? or is that insertion sort, i get the two confused

burnt vigil Jun 13, 2021, 10:38 AM

#

lament oriole thats better, still not happy about using 2 loops ```py sortedArr = [0]*len(ar...

why did you make a local variable with a list of 0 in it? like you are affecting the space in the memory with that

burnt vigil Jun 13, 2021, 10:38 AM

#

lament oriole wait isnt selection sort `O(n^2)` anyway? or is that insertion sort, i get the t...

it is

#

both and bubble also

lament oriole Jun 13, 2021, 10:39 AM

#

burnt vigil why did you make a local variable with a list of 0 in it? like you are affecting...

oh i can use append cant i...

i really need to brush up on my python, been spending too much time with ts

burnt vigil Jun 13, 2021, 10:39 AM

#

lament oriole oh i can use append cant i... i really need to brush up on my python, been spen...

you can sort it in place without making a list

odd steppe Jun 13, 2021, 10:40 AM

#

I have a question asking for a solution to a cube that rotates and the blocks within fall to the base.
It's essentially a matrix, with values inside that are blocks, and barriers that are fixed. The block are supposed to fall to the lowest position when rotated.

how would I approach this problem, I'm very lost

burnt vigil Jun 13, 2021, 10:40 AM

#

just use 2 loops and with the indexes you are replacing them at the end of the outer loop

lament oriole Jun 13, 2021, 10:41 AM

#

i did say it might be the worst selection sort ever

burnt vigil Jun 13, 2021, 10:42 AM

#

lament oriole i did say it might be the worst selection sort ever

https://www.google.com/amp/s/www.geeksforgeeks.org/python-program-for-selection-sort/amp/

GeeksforGeeks

Python Program for Selection Sort - GeeksforGeeks

A Computer Science portal for geeks. It contains well written, well thought and well explained computer science and programming articles, quizzes and practice/competitive programming/company interview Questions.

#

on this site you can find examples and much more

lament oriole Jun 13, 2021, 10:43 AM

#

thanks

weary badge Jun 13, 2021, 1:01 PM

#

Hi All, I am looking for a solution for my question: How to create a group by considering the 'identical consecutive groupings in a column (in Pandas DataFrame) : (dont know how to insert here my code properly, if someone will tell me, I'll edit it):
from itertools import groupby

test_list = ['AA', 'AA', 'BB', 'CC', 'DD', 'DD', 'DD', 'AA', 'BB', 'EE', 'CC']
data = pd.DataFrame(test_list)
data['batches'] = ['1','1','2','3','4','4','4','5','6','7','8'] # this is the goal to reach
print(data)

result = [list(y) for x, y in groupby(test_list)]
print(result)

[['AA', 'AA'], ['BB'], ['CC'], ['DD', 'DD', 'DD'], ['AA'], ['BB'], ['EE'], ['CC']] So, I have a DataFrame with two columns: the first is a list of elements that must be kept in order + grouped into batches: identical consecutive grouping. The batch column where the result should be assigned.

I couldn't find a solution or a workaround. As you can see, I've created a list using the itertools groupby function by grouping the same cons. items, but this isn't the final result I'd like to see. (result I'd like to see):

list	batches
AA	1
AA	1
BB	2
CC	3
DD	4
DD	4
DD	4
AA	5
BB	6
EE	7
CC	8

fathom finch Jun 13, 2021, 2:11 PM

#

Which statement is false?

Adjacency list representation is better than adjacency matrix representation for directed graphs

Adjacency list representation is better than adjacency matrix representation for undirected graphs

Adjacency list representation is better than adjacency matrix representation for weighted graphs

Adjacency matrix representation is better than adjacency list representation for graphs

#

is this B?

austere beacon Jun 13, 2021, 2:20 PM

#

@scenic flint

fathom finch Jun 13, 2021, 2:21 PM

#

nope B is true

half lotus Jun 13, 2021, 3:38 PM

#

Maybe c cause a list would have to store a tuple of the vertex and weight. A matrix could just use the weight instead of putting a 1.

#

That being said this question is vague. What does "better" mean? Better on what basis?

vocal gorge Jun 13, 2021, 3:44 PM

#

D is false, probably

#

I feel like the matrix is never better, in time or in space

half lotus Jun 13, 2021, 3:45 PM

#

It has some advantages. There's a good so post that goes over or, hold on

#

https://stackoverflow.com/a/2218331

Stack Overflow

What is better, adjacency lists or adjacency matrices for graph pro...

What is better, adjacency lists or adjacency matrix, for graph problems in C++?
What are the advantages and disadvantages of each?

vocal gorge Jun 13, 2021, 3:46 PM

#

It is fast to lookup and check for presence or absence of a specific edge
between any two nodes O(1)
ah, that's true

#

wait, actually no

#

one could use an adjacency dict and then that's also O(1)

half lotus Jun 13, 2021, 3:47 PM

#

That's basically the only advantage

#

Do you mean an adjacency set?

vocal gorge Jun 13, 2021, 3:49 PM

#

yeah

half lotus Jun 13, 2021, 3:49 PM

#

Oh like a dict that stores a set

vocal gorge Jun 13, 2021, 3:49 PM

#

though one could also just use a dict of from,to tuples for the entire thing

#

or a dict mapping from to a set of to

#

either way, O(1) lookup and addition

#

only problem is memory, which is asymptotically good (just O(E)) but not very good in absolute terms (dicts aren't the most efficient, they waste what, two thirds their memory or so?)

south umbra Jun 13, 2021, 5:50 PM

#

so im coding in python from last 8 months and ik flask and fastapi and now i wanted to learn dsa (daily 2 hr side by side) with learn django in which lang should i learn dsa , my friends said cpp but i hate it and it's syntax is kinda frustating and hard what should i do

fiery cosmos Jun 13, 2021, 6:42 PM

#

data structures and algorithms apply to all languages 02shrug learning them in Python is fine and can be pretty elegant

#

Java might also be a decent choice if you want to learn a new language too rooHmm java

brave oak Jun 13, 2021, 11:34 PM

#

south umbra so im coding in python from last 8 months and ik flask and fastapi and now i wa...

it depends, really

#

C could be a good language

#

if you wanted to get into the low-level details

#

otherwise, Python is fine

next pelican Jun 14, 2021, 1:09 AM

#

.

dapper sapphire Jun 14, 2021, 2:47 AM

#

So if we have the following:
nums = [1, 2, 3, 4, 5]
And we have two for loops. Is that O(n^2) time complexity?
.
.
.

#

And if we have list_n and list_m and we traverse through them both 1 by 1 so:

for num in list_n:
  ...

for num in list_m:
  ...

then to time complexity for this would be O(n + m)?
.
.
.

#

And if we have a list and a dictionary so something like:

dictionary = {}
for index, num in enumerate(list_n):
  dictionary[num] = index

for num in dictionary:
  ...

Is the time complexity for this also O(n + m)?

austere sparrow Jun 14, 2021, 2:54 AM

#

dapper sapphire So if we have the following: nums = [1, 2, 3, 4, 5] And we have two for loops. I...

two for loops
You mean, nested loops?

dapper sapphire Jun 14, 2021, 2:55 AM

#

Yes, nested for loop that has to compare 1 element with every element in the list and we do that comparison for each element in the list/array.

austere sparrow Jun 14, 2021, 2:56 AM

#

Yes, that's quadratic complexity.

austere sparrow Jun 14, 2021, 2:56 AM

#

dapper sapphire And if we have a list and a dictionary so something like: ```python dictionary =...

What would m be here?

dapper sapphire Jun 14, 2021, 2:57 AM

#

austere sparrow What would `m` be here?

m would be dictionary which I supposed could also be n because we traversed over that list to store elements in the dictionary.

austere sparrow Jun 14, 2021, 2:58 AM

#

When would the size of the dictionary be different from the size of the list?

dapper sapphire Jun 14, 2021, 2:58 AM

#

because if we traverse over the same list twice, that's still considered O(n).

dapper sapphire Jun 14, 2021, 2:58 AM

#

austere sparrow When would the size of the dictionary be different from the size of the list?

If/when we have duplicate elements

austere sparrow Jun 14, 2021, 2:59 AM

#

Well, yes, if the list contains 1000 copies of the same element, they will be different.

#

But since 0 <= m <= n, m is always on the same order as n, so that would be just O(n)

dapper sapphire Jun 14, 2021, 3:03 AM

#

Oh so this could be O(n + m) if there are 1000 copies of the same element? or this could be O(n) if elements are different?:

dictionary = {}
for index, num in enumerate(list_n):
  dictionary[num] = index

for num in dictionary:
  ...

#

Should this be m >= 1 instead of m >= 0? Because there will at least be 1 element that will get stored even if it has copies of the same element?

shut breach Jun 14, 2021, 3:06 AM

#

that'd be O(n+m) where n is the length of the list and m is the number of unique elements in the list
since 0 < m <= n: O(n+m) == O(2n) == O(n)

dapper sapphire Jun 14, 2021, 3:08 AM

#

Since m > 0 and n >= m:
so if m is 100, n is 100 supposing each element is unique:
O(100 + 100) => O(2(100)) => O(100).

#

lol we are probably not supposed to do what I did by using 100, but just trying to get a better picture.

shut breach Jun 14, 2021, 3:09 AM

#

it doesn't make any sense to talk about O(100) like that
O(n) is a class of functions

#

based on how fast they grow with respect to their input

#

the behavior of a particular function at a particular point is irrelevant

dapper sapphire Jun 14, 2021, 3:11 AM

#

Yeah I agree it doesnt make sense to use 100. But I actually see how O(n + m) transforms into 0(2n)

shut breach Jun 14, 2021, 3:12 AM

#

the way to think about it is n >= m -> n+n >= n+m

#

so 2n is an upper bound for n+m

dapper sapphire Jun 14, 2021, 3:12 AM

#

So I understand O(n+m) becomes O(2n)
But I am trying to understand how does O(2n) becomes O(n) ?

shut breach Jun 14, 2021, 3:13 AM

#

because O(n) is notation for the class of functions which grow linearly w.r.t. their input

dapper sapphire Jun 14, 2021, 3:14 AM

#

So if we traversed through a list 3 or 4 times, it's still considered O(n). And not O(3n) or O(4n).

shut breach Jun 14, 2021, 3:15 AM

#

yes, or to put it another way, O(n) = O(2n) = O(3n) because they are all the same class of functions

#

Say your time function is T(n) = 2n, then T(kn) = 2kn
you multiplied your input by k, and the output only scaled up by k
linear!

Now consider the time function T(n) = n^2, so T(kn) = (kn)^2 = (k^2)(n^2)
you multiplied your function by k and the output was scaled up by k^2
not linear!

#

That's why we care so much about this in CS, if you increase the size of your input a little bit and it leads to a large increase in the time it takes your function to run, that's a problem

dapper sapphire Jun 14, 2021, 3:23 AM

#

I see what you mean if we have 100 or 500 elements in a list and we traverse through it 2 or 3 times, we still call it O(n) because it's linear. It would take 5x time at 500 elements compared 100 elements

But when we have K^2. Time can increase exponentially by each additional element

#

So O(n), O(2n), O(3n) end up being O(n) because time increase is linear.

#

That was helpful thanks!!

shut breach Jun 14, 2021, 3:39 AM

#

dapper sapphire I see what you mean if we have 100 or 500 elements in a list and we traverse thr...

one nitpick: 2^n is exponential, n^2 is quadratic

#

exponential is way way worse

dapper sapphire Jun 14, 2021, 3:45 AM

#

oh thanks for highlighting that! Quadratic and exponential are two different time complexities, with O(2^n) being worst.

last fulcrum Jun 14, 2021, 6:22 AM

#

Is my implementation linear?

def linear_search(list, target_value_to_find):
    for index, value in enumerate(list):
        if value == target_value_to_find:
            return f'Found your target value at index {index}'
    return -1


print(linear_search([1, 2, 4, 67, 92, 56], 92))

jolly mortar Jun 14, 2021, 6:22 AM

#

yes

last fulcrum Jun 14, 2021, 7:10 AM

#

Thanks

fathom skiff Jun 14, 2021, 7:17 AM

#

@brave oak

brave oak Jun 14, 2021, 7:28 AM

#

fathom skiff <@!171929073063297024>

what

austere sparrow Jun 14, 2021, 11:14 AM

#

last fulcrum Is my implementation linear? ```py def linear_search(list, target_value_to_fin...

Trick question: what is the space completixy of the function?

oak spindle Jun 14, 2021, 11:20 AM

#

austere sparrow Trick question: what is the space completixy of the function?

o(n)?

austere sparrow Jun 14, 2021, 11:22 AM

#

oak spindle o(n)?

why?

oak spindle Jun 14, 2021, 11:23 AM

#

austere sparrow why?

for loop+ traversing through the list?

austere sparrow Jun 14, 2021, 11:24 AM

#

oak spindle for loop+ traversing through the list?

Do you understand the difference between time complexity and space complexity?

oak spindle Jun 14, 2021, 11:24 AM

#

austere sparrow Do you understand the difference between time complexity and space complexity?

oh not yet, still learning

#

nested loops = n*n, loop -> n

#

two different loop n+m

oak spindle Jun 14, 2021, 11:26 AM

#

austere sparrow why?

wrong answer?

#

oh just now noticed you asked space complexity

#

my bad

near jetty Jun 14, 2021, 11:31 AM

#

austere sparrow Trick question: what is the space completixy of the function?

Is it ||O(n) since we only store the list once|| ?

last fulcrum Jun 14, 2021, 11:32 AM

#

austere sparrow Trick question: what is the space completixy of the function?

It's O(n)

#

Every variable uses the same size, so the same space. The loop would make it linear

austere sparrow Jun 14, 2021, 11:32 AM

#

near jetty Is it ||O(n) since we only store the list once|| ?

When computing space complexity, we usually talk about auxillary space occupied.

austere sparrow Jun 14, 2021, 11:33 AM

#

oak spindle oh not yet, still learning

The time complexity describes for how much longer the program will run when you increase the input.
The space complexity describes how much more memory the program will occupy when you increase the input, or rather, how much additional (auxiliary) memory it will need. For example, here: py def add_squares(numbers): total = 0 for number in numbers: total += number**2 return number if the size of the list is N, the time complexity is O(N), because you're traversing the list once. However, the space complexity here is O(1) -- you're only storing a constant amount of data. However, here: ```py
def add_squares(numbers):
squares = [x**2 for x in numbers]
return sum(squares)

last fulcrum Jun 14, 2021, 11:33 AM

#

austere sparrow When computing space complexity, we usually talk about auxillary space occupied.

Actually it's Space Complexity = Auxiliary space + Space use by input values

oak spindle Jun 14, 2021, 11:34 AM

#

austere sparrow The time complexity describes for how much longer the program will run when you ...

Thank you!!

topaz pulsar Jun 14, 2021, 11:34 AM

#

the function doesn't recieve a full copy of the data though, only a reference so I don't think the size of the input is even O(n) space

austere sparrow Jun 14, 2021, 11:36 AM

#

I think different sources use different terminology, but I don't think including the input size makes a lot of sense

#

especially in Python, when passing an argument to a function never copies memory

last fulcrum Jun 14, 2021, 11:41 AM

#

Join takes an iterable and then joins it to a string. Would join be creating a new string for each element in the iterable? Does this increase the time complexity of join?

austere sparrow Jun 14, 2021, 11:44 AM

#

austere sparrow Trick question: what is the space completixy of the function?

The practical answer is that this py def linear_search(list, target_value_to_find): for index, value in enumerate(list): if value == target_value_to_find: return index return None uses O(1) of auxiliary memory, but the pedantic answer is that it uses O(log(n)) of auxiliary memory, because the larger the index gets, the more space it occupies. But this will only matter if you have an unreasonable (~2**64) number of elements

last fulcrum Jun 14, 2021, 11:45 AM

#

austere sparrow The practical answer is that this ```py def linear_search(list, target_value_to_...

O(1)? Is it because all elements of the list are ints?

austere sparrow Jun 14, 2021, 11:46 AM

#

last fulcrum O(1)? Is it because all elements of the list are ints?

It doesn't matter what elements are in the list, because you're not copying them (value will refer to the same object that's stored in the list)

last fulcrum Jun 14, 2021, 11:48 AM

#

austere sparrow It doesn't matter what elements are in the list, because you're not copying them...

I'll say the truth. I still don't really understand space complexity.

#

aniblobsweat

austere sparrow Jun 14, 2021, 11:49 AM

#

well, it's sometimes tricky 🙂

last fulcrum Jun 14, 2021, 11:49 AM

#

I mainly didn't really learn it because I heard they don't ask about it

austere sparrow Jun 14, 2021, 11:54 AM

#

last fulcrum Join takes an iterable and then joins it to a string. Would join be creating a n...

Join creates a giant string in-place, without creating intermediate once, something like this:

def NOT_join(iterable):
    result = ""
    for s in iterable:
        result = result + s
    return result

def join(iterable):
    result_chars = []
    for s in iterable:
        result_chars.extend(chars_of(s))
    return string_from_chars(result)

last fulcrum Jun 14, 2021, 11:56 AM

#

austere sparrow Join creates a giant string in-place, without creating intermediate once, someth...

That makes more sense. So basically join creates something like a dynamic array and then finally returns it when it finishes going through the iterable.

austere sparrow Jun 14, 2021, 11:57 AM

#

yeah

#

You can see the CPython implementation: https://github.com/python/cpython/blob/main/Objects/unicodeobject.c#L10014-L10208
it's, uh... complicated, as always

last fulcrum Jun 14, 2021, 11:58 AM

#

austere sparrow You can see the CPython implementation: <https://github.com/python/cpython/blob/...

😂 Thanks for your help as always

green plaza Jun 14, 2021, 12:31 PM

#

I'm trying to draw a triangle using numpy. I found this formula to draw an n-sided polygon, but I don't know how to apply the formula in the code, so how do I apply this formula in numpy to draw a triangle?
https://stackoverflow.com/a/7198179/14243840

oak spindle Jun 14, 2021, 1:44 PM

#

c+ python? = cpython?

forest wadi Jun 14, 2021, 1:54 PM

#

oak spindle c+ python? = cpython?

yes

#

cpython is the open source python written in c

oak spindle Jun 14, 2021, 1:56 PM

#

nice, hearing for first time

loud trail Jun 14, 2021, 1:58 PM

#

"standard" python from python.org is cpython

red tangle Jun 14, 2021, 2:11 PM

#

Hi everyone
Could any one explain me how to set the time limit for user input in using python3.9.

oak spindle Jun 14, 2021, 2:35 PM

#

Gennady.Korotkevich

#

real inspiring story

last fulcrum Jun 14, 2021, 2:47 PM

#

red tangle Hi everyone Could any one explain me how to set the time limit for user input in...

Open a help channel. This channel is for data structures and algorithms.

granite beacon Jun 14, 2021, 4:39 PM

#

.

last fulcrum Jun 14, 2021, 5:53 PM

#

Should I learn insertion sort or I should skip it?

sage lance Jun 14, 2021, 6:07 PM

#

Hello. Someone could help me with a question? Please. I have 3 lists, products category list:[food, beverages, appetizers], for each categories, a products list:[chicken, meats, rice], [wine, beers, juices], [skewers, rolls, tartlets] and products stock list for each one, [12,34,10] [12,66,15],[6,4,5] I need to associate them. I could use: for categoría in categoria_products:
products_granos=categoria[0]?
Or what do you suggest me? Please. I am new.
Excuse my English

last fulcrum Jun 14, 2021, 6:38 PM

#

Do people read the banner lemon_intensifies

devout matrix Jun 14, 2021, 7:03 PM

#

Hello. What's the best way of converting a list to a dict and vise versa (depending on the number of occurences)?
Example:

# This list
["apple", "apple", "banana", "mango", "mango", "mango", "banana", "peach"]

# Would be converted into this dict
{
  "apple": 2,
  "banana": 2,
  "mango": 3,
  "peach": 1
}

# And vise versa

I know how to do it (using a loop) but I feel like I can do it another way so it's more efficient or uses a more dedicated function

grizzled schooner Jun 14, 2021, 7:04 PM

#

devout matrix Hello. What's the best way of converting a list to a dict and vise versa (depend...

look at the Counter data structure in the collections module of the stdlib

devout matrix Jun 14, 2021, 7:06 PM

#

I will. Thank you!

#

Just what I was looking for

fervent saddle Jun 14, 2021, 7:14 PM

#

Is next(iter(some_dict)) and next(iter(some_set)) technically O(n), at least in the worst case, since there’s a possibility it will have to look over a bunch of empty space?

O(n) for the set because everything might have been placed in the further half of the array, and O(n) for the dict since maybe all the key-value pairs from the front were deleted without deleting enough to trigger a resize

vocal gorge Jun 14, 2021, 7:40 PM

#

you mean, like, when you have tons of collisions and you deleted a lot of keys recently?

#

oh, right, next, just when you deleted lots of keys recently. hmm

fervent saddle Jun 14, 2021, 7:43 PM

#

I was thinking just about where everything gets hashed to in the case of the set, I wasn’t thinking collisions would affect anything

#

Yeah

#

Where everything gets places if you’re using a set, whether you’ve been deleting from the beginning if you’re using a dict

vocal gorge Jun 14, 2021, 7:46 PM

#

oh, that was easy to check

vocal gorge Jun 14, 2021, 7:46 PM

#

fervent saddle Is `next(iter(some_dict))` and `next(iter(some_set))` technically O(n), at least...

N =10**5
st = set(range(N))
%timeit next(iter(st))
#183 ns ± 24 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)
for i in range(N//10):
    st.remove(i)
%timeit next(iter(st))
#12.1 µs ± 189 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

#

looks like a solid yes

#

this works very well here because hash of ints is themselves, so sets of ints are sorted

fervent saddle Jun 14, 2021, 7:50 PM

#

I wonder if that’s still O(n) in the average case too with sets

#

Or if it’s different in the average case

#

But anyways, thanks for testing that

#

But yeah, you have to overallocate more and more extra space as the set increases in size. So it seems like the average amount of empty space you have to iterate over would increase too
Same with dicts, assuming you’re randomly deleting from them a random number of times. It seems like that would be O(n) in relation to the size of the dict after deleting from it

last fulcrum Jun 14, 2021, 7:52 PM

#

what's %timeit ?

fervent saddle Jun 14, 2021, 7:58 PM

#

timeit is a python module which helps you time how long python code takes to run

last fulcrum Jun 14, 2021, 8:31 PM

#

fervent saddle timeit is a python module which helps you time how long python code takes to run

Thanks. Never seen it before. I usually do
start_time = time.time() time.time() - start_time

fervent saddle Jun 14, 2021, 8:44 PM

#

It basically does that but a lot of times, a million by default, to get an even better measurement

last fulcrum Jun 14, 2021, 9:28 PM

#

fervent saddle It basically does that but a lot of times, a million by default, to get an even ...

what's the module name?

fervent saddle Jun 14, 2021, 9:29 PM

#

timeit

#

It’s not built-in, you have to install it

last fulcrum Jun 14, 2021, 9:29 PM

#

Thanks

last fulcrum Jun 14, 2021, 9:50 PM

#

I saw something on an MIT video. Use binary search in the insertion sort algorithm to improve the time complexity. I wonder if that still makes it insertion or really improves the complexity.

fiery cosmos Jun 14, 2021, 10:11 PM

#

You can use a binary search on the sorted portion to find where to insert it faster, yeah.

#

But then you still need to swap a lot so it's O(n^2) still MHXThink or now I'm not sure but this talks about it https://stackoverflow.com/a/18022248/10241514

grizzled schooner Jun 14, 2021, 10:46 PM

#

it would probably be O(n log n) if you were able to insert in constant time

#

but alas

fervent saddle Jun 14, 2021, 11:03 PM

#

Ok, I’ve been thinking about this, and now I’m wondering if my ideas are right:

next(set_iterator) is always average O(n) because the bigger the set is, the more empty spaces it will need to look over to find the next non-empty index.

next(dict_iterator)is average O(1) if you aren’t deleting from the dict because it won’t have to look over any empty spaces, but if you are deleting from the dict then it’s O(n) in relation to the current size of the dict.

next(OrderedDict_iterator) is always O(1) because it uses nodes for iteration

#

Is any of that not true?

onyx root Jun 14, 2021, 11:33 PM

#

I don't think next(set_iterator) is O(n).

#

if it were, then iterating a whole set would be O(n**2), which it isn't.

onyx root Jun 15, 2021, 12:13 AM

#

@fervent saddle it seems that sets don't shrink, so I guess something like this could happen with a lot of deletes.

fervent saddle Jun 15, 2021, 12:32 AM

#

onyx root <@!345078407702511618> it seems that sets don't shrink, so I guess something lik...

Yeah it does happen like that, you’re right: #algos-and-data-structs message
If you specifically remove from the beginning then it makes it take longer. But if you remove randomly then it doesn’t seem to make it take longer. So I guess it makes it O(n) in the worst case and O(1) on average.

#

~~But I don’t get why it doesn’t still increase without deletions~~

#

~~It has it set up so there’s the same average amount of empty space between non-empty indexes regardless of the size of the set if you haven’t been deleting from it?~~

fervent saddle Jun 15, 2021, 1:23 AM

#

Nevermind, I took a few minutes to think about it, and it seems really obvious now

#

It tries to keep the set around 50% full or something like that, so that means no matter what size the set is, each individual index should always have the same chance of not being empty

#

So it will always have to look over the same number of indexes on average

fiery cosmos Jun 15, 2021, 1:37 AM

#

I've been told recently that json is a bad database and you should use something else. Is this true? and if it is, what would you say is the best database?

loud trail Jun 15, 2021, 2:28 AM

#

fiery cosmos I've been told recently that json is a bad database and you should use something...

json is not a database, it's a file format. there is no "best" database, but for a lot of hobby applications sqlite is a good choice.

#

so there's your answer: json is a bad database because it's not a database.

rare orbit Jun 15, 2021, 3:59 AM

#

Hello, I was trying to define circularLinked list data type and wrote insert, insertfromlast, delete, size, rotate functions successfully. I initiated the class as ```class circularNode:
def init(self, v = None):
self.value = v
self.Next = None

class circularLinkedList:
def init(self, l = []):
self.head = None
self.tail = None
for x in reversed(l):
self.insert(x) But there is some trouble defining the reverse and map functionsdef map(self,f):
if self.size() == 1:
return circularLinkedList([f(self.head)])
x = self.head
for i in range(self.size()):
x = circularNode(f(x))
x = x.Next

def reverse(self):
    if self.size() == 1:
        return self
    if self.size() == 2:
        return self.rotate()
    elif self.size() > 2:
        Z = self.head
        X = self.delete().reverse()
        X.insert_last(Z)
        return X``` This is how I defined the two, but am getting error in both cases. Any help is highly appreciated. Thanks!

fiery cosmos Jun 15, 2021, 8:14 AM

#

https://stackoverflow.com/questions/67982075/i-tried-making-an-ai-chat-bot-in-python-language-using-ide-pycharm-via-the-code

Stack Overflow

I tried making an ai chat bot in python language using ide pycharm ...

import random
import json
import pickle
import numpy as np

import nltk
from nltk.stem import WordNetLemmatizer

from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import D...

thorn turret Jun 15, 2021, 12:01 PM

#

ghost ping?

scenic goblet Jun 15, 2021, 1:41 PM

#

https://www.codechef.com/ZCOPRAC/problems/ZCO12002

https://www.codechef.com/viewsolution/47855929 ---> only 2nd subtask 4th case is wrong i am not able to find why it is. Can anybody help?

crude berry Jun 15, 2021, 2:06 PM

#

Is there a way to implement a faster the way ? An alternatives of the loop ? I want to add element in the trackContainer without passing through loops. stream.Part() is an object in which I want to add notes... And if I let as it is adding 1000 notes will take approximately 1 minute

Capture_decran_2021-06-15_a_16.04.47.png

fiery cosmos Jun 15, 2021, 5:25 PM

#

loops are pretty basic to the hardware

#

the hardware performs loops, not sure how you can bypass this

fiery cosmos Jun 15, 2021, 5:42 PM

#

I mean a cpu doing a quadratic instead of a loop could be created

#

but it is for a narrow and specialised application, not general purpose

dapper sapphire Jun 15, 2021, 9:23 PM

#

alternative to for loop would be while loop and recursion, and speed for all three is roughly the same.

#

I dont know if you can use multithreading and break it into chunks of 2 or 4?

lean ice Jun 15, 2021, 10:39 PM

#

so in my research about chess I came across https://www.chessprogramming.org/Transposition_Table, from my understanding it's used for ai/engines to reduce the amount of time required to find the best move in a pre-analysed position but would it be useful to avoid calculating all possible moves (for app generation not engine) as well or would the time/complexity of the lookup not be worth it?

loud trail Jun 15, 2021, 10:40 PM

#

recursion will be significantly slower in python @dapper sapphire , function calls are expensive

#

also limited recursion stack size

#

but in this case it probably won't matter if the operation is some slow i/o

dapper sapphire Jun 16, 2021, 4:56 AM

#

I looked up to see in what other languages recursion is expensive, and wow recursion is also expensive in Java and C. Thanks for pointing that out! Especially limited recursion stack..

weary briar Jun 16, 2021, 5:10 AM

#

@minor night I need help in coding a good code.

fervent saddle Jun 16, 2021, 5:17 AM

#

It’s expensive in C unless it uses tail-end optimization

#

And then it just uses a jump back to the start of the function instead of calling it again.

woven heart Jun 16, 2021, 5:26 AM

#

guys please recommend me any nice course for dsa in python

minor night Jun 16, 2021, 6:30 AM

#

weary briar <@!839027889608720406> I need help in coding a good code.

Ok, but I am more experienced with #user-interfaces so I can't really help you with data structure but if you need help in anything else then tag me.

last fulcrum Jun 16, 2021, 6:47 AM

#

fervent saddle It’s expensive in C unless it uses tail-end optimization

So in python even tail recursion is expensive?

fervent saddle Jun 16, 2021, 6:48 AM

#

Python doesn’t do tail-end optimization

last fulcrum Jun 16, 2021, 6:48 AM

#

Also, can someone help me with this? I keep getting None.

def recursive_binary_search(array, item_to_find, start=0, end=None):
    if end is None:
        end = len(array) - 1

    if end < start:
        return -1

    midpoint = (start + end) // 2

    if array[midpoint] == item_to_find:
        return midpoint

    elif array[midpoint] > item_to_find:
        recursive_binary_search(array, item_to_find, start, midpoint - 1)
    else:
        recursive_binary_search(array, item_to_find, midpoint + 1, end)

last fulcrum Jun 16, 2021, 6:49 AM

#

fervent saddle Python doesn’t do tail-end optimization

tail recursion is returning immediately after the recursion call

#

How is that not optimised?

feral shale Jun 16, 2021, 6:49 AM

#

You are not returning in your recursive calls

fervent saddle Jun 16, 2021, 6:50 AM

#

It’s when instead of actually calling the function again, it just compiles it with a jump back to the start of the function

last fulcrum Jun 16, 2021, 6:50 AM

#

feral shale You are not returning in your recursive calls

I return at midpoint == item to find

#

Or?

fervent saddle Jun 16, 2021, 6:50 AM

#

Python doesn’t do anything like that

odd steppe Jun 16, 2021, 6:51 AM

#

fervent saddle Python doesn’t do anything like that

??

last fulcrum Jun 16, 2021, 6:51 AM

#

fervent saddle Python doesn’t do anything like that

Wait are we talking about the same thing?

fervent saddle Jun 16, 2021, 6:51 AM

#

last fulcrum tail recursion is returning immediately after the recursion call

In response to this

last fulcrum Jun 16, 2021, 6:52 AM

#

It doesn't return immediately after the return call?

if I do return recursive function
Immediately after the function gives it's answer it returns.

last fulcrum Jun 16, 2021, 6:53 AM

#

feral shale You are not returning in your recursive calls

I just found it. Thanks

fervent saddle Jun 16, 2021, 6:54 AM

#

No, python doesn’t do anything to optimize recursive functions. It won’t optimize away recursive function calls

granite copper Jun 16, 2021, 6:55 AM

#

ermm

#

guys

#

how to download python woth correct version?

#

can anyone teach me?

fervent saddle Jun 16, 2021, 6:56 AM

#

So yeah, if a part of your code calls a function recursively, it will always have to go back up the call stack

last fulcrum Jun 16, 2021, 6:57 AM

#

fervent saddle So yeah, if a part of your code calls a function recursively, it will always hav...

I totally forgot promise.

#

Cause the previous functions would need those answers

granite copper Jun 16, 2021, 6:57 AM

#

errmmmm

last fulcrum Jun 16, 2021, 6:58 AM

#

granite copper errmmmm

Open a help channel

#

This is for DS & A

granite copper Jun 16, 2021, 6:58 AM

#

ok

jolly mortar Jun 16, 2021, 6:58 AM

#

If I have something like

for i in range(n):
  # inner loop running in log(i)

What is the overall time complexity?
i'd say it's log(1) + log(2) + ... + log(n-1) = log((n-1)*(n-2)...2*1) = log(n!)
is O(log(n!)) a thing?

last fulcrum Jun 16, 2021, 6:59 AM

#

fervent saddle No, python doesn’t do anything to optimize recursive functions. It won’t optimiz...

Can I get facts for this? I know I might go into an interview and someone might say do a recursive function. I want to be able to say no 😂 pepe_fat

last fulcrum Jun 16, 2021, 7:00 AM

#

jolly mortar If I have something like ```py for i in range(n): # inner loop running in log(...

O(n2). Any inner loop is quadratic.

jolly mortar Jun 16, 2021, 7:01 AM

#

huh

#

that's not necessarily true, if you have a log n inner loop, it becomes n log n, not n^2

fervent saddle Jun 16, 2021, 7:01 AM

#

http://neopythonic.blogspot.com/2009/04/tail-recursion-elimination.html
http://neopythonic.blogspot.com/2009/04/final-words-on-tail-calls.html

Here’s some blog posts by Guido where he says why he’ll never add tail recursion optimization

Tail Recursion Elimination

I recently posted an entry in my Python History blog on the origins of Python's functional features . A side remark about not supporting ta...

Final Words on Tail Calls

A lot of people remarked that in my post on Tail Recursion Elimination I confused tail self-recursion with other tail calls, which proper T...

#

Isn’t it n*log(n) ?

jolly mortar Jun 16, 2021, 7:04 AM

#

a SO post does say that n log n provides an upper bound on log(n!)

fervent saddle Jun 16, 2021, 7:05 AM

#

Well maybe not because of how log(n) increases

#

nvm

jolly mortar Jun 16, 2021, 7:05 AM

#

but i'm not sure why i'd consider the upper bound of log(n!), not log(n!) itself

fiery cosmos Jun 16, 2021, 7:26 AM

#

I think Θ(nlogn) = Θ(log(n!)). Not sure I can be rigorous but think about the terms of log(1) + log(2) + ... + log(n-2) + log(n-1). The latter half of that list of terms will all be within 1 or each other and they are all at least log(n/2). So you have log(n!) is at least (n/2)*log(n/2) which simplifies to Θ(nlogn). rooHmm

fervent saddle Jun 16, 2021, 8:08 AM

#

I read this:
https://www.khanacademy.org/computing/computer-science/algorithms/asymptotic-notation/a/big-big-theta-notation

#

Is it saying that if you were graphing how long an algorithm takes if you’re very lucky and it takes the least possible time, and you were also graphing how long it takes if it you’re very unlucky and it takes the most possible time, big-omega is the line of the lucky least long time, and big-O is the line of the unlucky most long time, and big-theta is whatever kind of line stays between those two lines?

#

Or is that still not right?

fiery cosmos Jun 16, 2021, 8:19 AM

#

That's right mostly I guess pithink Big-Omega is the lower bound so if you get really lucky you still have to spend at least that much time. Big-O is the upper bound - you might have to spend that much time if you get unlucky.

#

And when Big-O and Big-Omega have the same function then their Big-Theta is that function - bounded on both sides.

fervent saddle Jun 16, 2021, 8:23 AM

#

I asked “are they just different ways of saying best case and worst case complexity” in general, and some mods said no
#python-discussion message

limber chasm Jun 16, 2021, 8:24 AM

#

Hi guys, I have a quick question, if you're doing an algorithm and structure excercise and they ask you to order an unordered list, would be considered "cheating" if I used method like range, min and max ?

#

Okay, brillian thanks.

fervent saddle Jun 16, 2021, 8:26 AM

#

No I misread that somehow

#

lol I thought you were asking if you could straight up call my_list.sort()

limber chasm Jun 16, 2021, 8:26 AM

#

No.

#

For example.

for i in range(len(a)):
b.append(min(a))
a.pop(a.index(min(a)))

fervent saddle Jun 16, 2021, 8:26 AM

#

yeah sorry nvm

limber chasm Jun 16, 2021, 8:27 AM

#

That would sort a list called a, would that be accepted ?

fervent saddle Jun 16, 2021, 8:28 AM

#

Is the question just “find any way to sort a list without using sorted or list.sort” ?

#

If it is, then that seems ok

limber chasm Jun 16, 2021, 8:29 AM

#

The question doesn't have any constraints of what not to use. But would that be considered an algorithm and would that be acceptable even if they asked me not to use sorted or list.sort

fervent saddle Jun 16, 2021, 8:29 AM

#

It must say some restrictions. If it just says “sort a list” then nothing is stopping you from using list.sort

fiery cosmos Jun 16, 2021, 8:30 AM

#

fervent saddle I asked “are they just different ways of saying best case and worst case complex...

I think "no" is the correct answer to that but I'm not entirely clear on it myself CoalThinks

fervent saddle Jun 16, 2021, 8:30 AM

#

If you’re following all the restrictions it says, it should be fine

limber chasm Jun 16, 2021, 8:31 AM

#

Here's the question: "write a Python program to insert items into a list in sorted order"

fervent saddle Jun 16, 2021, 8:32 AM

#

It’s probably fine, but if you’re worried about min and range then you could use your own min function and use a while loop instead

limber chasm Jun 16, 2021, 8:32 AM

#

But even if they were to say to not use sorted or list.sort(), would the method I use go against the spirit of the question by using other methods/functions

limber chasm Jun 16, 2021, 8:32 AM

#

fervent saddle It’s probably fine, but if you’re worried about min and range then you could use...

Yes very good point, I didn't think of that. thanks.

fervent saddle Jun 16, 2021, 8:32 AM

#

But I think it should be fine

austere sparrow Jun 16, 2021, 10:25 AM

#

last fulcrum Can I get facts for this? I know I might go into an interview and someone might ...

!e

def factorial(n, a = 1):
    if n <= 0:
        return a
    return factorial(n - 1, a * n)

print(factorial(2000))

halcyon plankBOT Jun 16, 2021, 10:25 AM

#

@austere sparrow :x: Your eval job has completed with return code 1.

001 | Traceback (most recent call last):
002 |   File "<string>", line 6, in <module>
003 |   File "<string>", line 4, in factorial
004 |   File "<string>", line 4, in factorial
005 |   File "<string>", line 4, in factorial
006 |   [Previous line repeated 995 more times]
007 |   File "<string>", line 2, in factorial
008 | RecursionError: maximum recursion depth exceeded in comparison

austere sparrow Jun 16, 2021, 10:26 AM

#

Well, "python doesn’t do anything to optimize recursive functions" is not quite the right wording.
CPython doesn’t do anything to optimize recursive functions.

#

Optimizations aren't really part of the language spec.

#

There's already a binary search in the stdlib https://github.com/python/cpython/blob/3.9/Lib/bisect.py, you can look at it.

ivory yacht Jun 16, 2021, 10:36 AM

#

Just because CPython doesn't do tail call recursion optimisation or anything, it doesn't mean that recursion isn't a valid strategy to use for tasks its suited to.

austere sparrow Jun 16, 2021, 10:36 AM

#

yeah, that as well

#

well, you can always transform recursion into a queue if you're hitting a limit, but I don't think it's going to be faster

fervent saddle Jun 16, 2021, 10:39 AM

#

austere sparrow Well, "python doesn’t do anything to optimize recursive functions" is not quite ...

Some day I’ll finally stop forgetting to say “CPython” instead of just “python”

austere sparrow Jun 16, 2021, 10:39 AM

#

well, PyPy doesn't optimize it either, it seems 🙂

last fulcrum Jun 16, 2021, 11:22 AM

#

ivory yacht Just because CPython doesn't do tail call recursion optimisation or anything, it...

If iteration is faster, why worry yourself over recursion?

#

I'm mainly saying this because I'm not a fan of recursion 😂

ivory yacht Jun 16, 2021, 11:24 AM

#

Both approaches are proven to have the same computational power, indeed. But solutions for specific problems in with one or the other technique may be more elegant, and easier to understand .

#

For example, if you wanted to write a program that goes through a big nested tree, and does something to all the elements - say a JSON or HTML file - doing that recursively would be pretty straightforward.

#

Doing it with iteration can be done, but you'd have to basically manually implement the call stack in a list, and keep track of appending/popping things.

#

That sort of busywork distracts from what you're actually trying to do, making it harder to understand the code.

#

It's the same reason Python's for loop being an iterator is nicer than being forced to manually increment a counter.

onyx root Jun 16, 2021, 11:29 AM

#

@last fulcrum recursion is great for recursive data structures

last fulcrum Jun 16, 2021, 11:30 AM

#

ivory yacht For example, if you wanted to write a program that goes through a big nested tre...

Yes yes. I just thought for example of pythons os module. Os.walk and os.list dir

last fulcrum Jun 16, 2021, 11:31 AM

#

onyx root <@!230942498086846464> recursion is great for recursive data structures

I've never seen any of those. Maybe because I'm still learning arrays

ivory yacht Jun 16, 2021, 11:32 AM

#

Well, one is what you mentioned, the file system.

onyx root Jun 16, 2021, 11:32 AM

#

os.walk is an interesting case: it deals with a recursive structure (nested directories), but presents them to you as a linear iteration.

austere sparrow Jun 16, 2021, 11:41 AM

#

last fulcrum I've never seen any of those. Maybe because I'm still learning arrays

!e
Another example is when you want polymorphism:

import math
Env = dict[str, int]

class Expression:
    def evaluate(self, env: Env) -> int:
        raise NotImplementedError

class Val(Expression):
    def __init__(self, value: int):
        self.value = value

    def evaluate(self, _: Env):
        return self.value

class Var(Expression):
    def __init__(self, name: str):
        self.name = name

    def evaluate(self, env: Env):
        return env[self.name]

class Add(Expression):
    def __init__(self, *exprs: Expression):
        self.exprs = exprs

    def evaluate(self, env: Env):
        return sum(expr.evaluate(env) for expr in self.exprs)


class Mul(Expression):
    def __init__(self, *exprs: Expression):
        self.exprs = exprs

    def evaluate(self, env: Env):
        return math.prod(expr.evaluate(env) for expr in self.exprs)

expr = Add(  #  4 + 2x^2 + (-5)(x + y)
    Val(4),
    Mul(
        Val(2),
        Var("x"),
        Var("x"),
    ),
    Mul(
        Val(-5),
        Add(Var("x"), Var("y")),
    )
)
print(expr.evaluate({"x": 3, "y": 1}))

This is an example of mutual recursion. If you want to add a new type of expression, you don't need to change any other code.

halcyon plankBOT Jun 16, 2021, 11:41 AM

#

@austere sparrow :white_check_mark: Your eval job has completed with return code 0.

onyx root Jun 16, 2021, 11:42 AM

#

another good example of a recursive structure.

austere sparrow Jun 16, 2021, 11:44 AM

#

Now, technically you can evaluate expressions without recursion recursion. You can make a clever setup with a queue and treat recursion in a stackless way. In other words, replace the hardware stack with your own stack. (Example: stackless Python (if I understand what it's doing correctly), and a toy language I participated in https://github.com/gurkult/py-gurklang)

#

but it gets pretty complicated, and it's probably slower

onyx root Jun 16, 2021, 11:52 AM

#

I think the polymorphism is orthogonal to the recursion. The recursion is because expressions are a recursive structure. or maybe I'm missing a subtlety

austere sparrow Jun 16, 2021, 11:54 AM

#

If this didn't use mutual recursion, you couldn't (easily) add a new type of expression

#

if you have a fixed list of expressions, you can probably hack together some queue mechanism, or just transform it into a list of stack operations

fathom prairie Jun 16, 2021, 1:38 PM

#

how can i speed up loops?

#

are there some rules i should keep in mind while performing iterations to make it faster?

odd light Jun 16, 2021, 1:58 PM

#

how much processing are you doing on each iteration? It might be worth importing async and converting the iterations to coroutines

fathom prairie Jun 16, 2021, 2:16 PM

#

odd light how much processing are you doing on each iteration? It might be worth importing...

10 to the power 6 iterations minimum

#

and not much processing just performing a lookup

meager slate Jun 16, 2021, 2:27 PM

#

changing things into local variables may speed up loops slightly

austere sparrow Jun 16, 2021, 2:30 PM

#

Can you show the code @fathom prairie ?

fathom prairie Jun 16, 2021, 2:31 PM

#

sure

#

one sec

#

@austere sparrow

data = [...]
idx = 0
for i in range(len(data)):
	if i.id == some_int:
		idx = i
		break

austere sparrow Jun 16, 2021, 2:33 PM

#

Well, there's nothing in particular you can do to speed it up. Have you tried running it with PyPy?

#

also, you mean ```py
data = [...]
idx = 0
for i in data:
if i.id == some_int:
idx = i
break

fathom prairie Jun 16, 2021, 2:33 PM

#

yes right

austere sparrow Jun 16, 2021, 2:34 PM

#

If you're doing that loop many times, maybe you can use a dict like {id: value} instead of a list?

fathom prairie Jun 16, 2021, 2:34 PM

#

memory usage

austere sparrow Jun 16, 2021, 2:34 PM

#

dicts have a constant time lookup

#

@fathom prairie How large is the list?

fathom prairie Jun 16, 2021, 2:35 PM

#

can be up to 10^6

#

more in some cases

austere sparrow Jun 16, 2021, 2:35 PM

#

A million-element dict is literally nothing

fathom prairie Jun 16, 2021, 2:35 PM

#

lol

#

alright, thanks for the help

austere sparrow Jun 16, 2021, 2:38 PM

#

@fathom prairie According to my quick benchmark, it's about 8 5 megabytes if the integers are not very big.

#

wait, no

#

It will be 5 megabytes (5242968 bytes in my case), because the keys will already be part of your objects.

fathom prairie Jun 16, 2021, 2:39 PM

#

it's about 40mb

austere sparrow Jun 16, 2021, 2:39 PM

#

?

fathom prairie Jun 16, 2021, 2:39 PM

#

ok wait i made a mistake

#

yeah it's 3mb according to my benchmark

austere sparrow Jun 16, 2021, 2:40 PM

#

If you're searching by the id multiple times, then yes, a dict is going to be way better

fathom prairie Jun 16, 2021, 2:40 PM

#

hm

#

thanks

main coral Jun 16, 2021, 2:53 PM

#

austere sparrow also, you mean ```py data = [...] idx = 0 for i in data: if i.id == some_int...

tiny queston how do you write the code part on discord?

#

the formatting i mean..

austere sparrow Jun 16, 2021, 3:09 PM

#

main coral tiny queston how do you write the code part on discord?

!code

halcyon plankBOT Jun 16, 2021, 3:09 PM

#

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

loud trail Jun 16, 2021, 3:15 PM

#

@fathom prairie let python do the looping:

result = next((val for val in data if val.id == some_int), None)

distant sable Jun 16, 2021, 3:16 PM

#

i need help with a if else code i made, is this the right channel?

loud trail Jun 16, 2021, 3:16 PM

#

@distant sable maybe a help channel. see #❓｜how-to-get-help

distant sable Jun 16, 2021, 3:17 PM

#

ah i see, thank you 🙂

loud trail Jun 16, 2021, 3:17 PM

#

huh, surprising!

In [3]: %paste
data = list(range(1_000_000))

def loop1(data):
    for val in data:
        if val == 545:
            return val

def loop2(data):
    result = next((val for val in data if val == 545), None)
    return result

## -- End pasted text --

In [4]: %timeit loop1(data)
15 µs ± 282 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

In [5]: %timeit loop2(data)
15.3 µs ± 538 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

#

i guess generators aren't any faster than for loops nowadays

#

iirc they used to be

austere sparrow Jun 16, 2021, 3:20 PM

#

@loud trail the generator already does the same for loop

loud trail Jun 16, 2021, 3:21 PM

#

right, i thought for had more overhead

#

but this is why we benchmark, after all

#

if anything the plain for loop is slightly faster, less overhead from next

#

In [2]: %%cython
   ...: def loop3(data: list):
   ...:     for val in data:
   ...:         if val == 545:
   ...:             return val

In [7]: %timeit loop3(data)
3.06 µs ± 237 ns per loop (mean ± std. dev. of 7 runs, 100000 loops each)

of course the cython version is 4x faster

#

In [4]: %timeit loop1(data)
894 ns ± 75.1 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

In [5]: %timeit loop2(data)
1.01 µs ± 24.4 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

and pypy is 3x faster than cython!

#

can't test easily on graalvm, no ipython support 😦

#

or maybe it does, let me see

#

and for completeness, if you do data_np = np.asarray(data) then numba (under cpython) can match pypy performance

In [32]: loop4 = njit(loop1)

In [33]: %timeit loop4(data_np)
893 ns ± 71.2 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

halcyon plankBOT Jun 16, 2021, 3:32 PM

#

Hey @fiery cosmos!

It looks like you tried to attach file type(s) that we do not allow (.pdf). We currently allow the following file types: .gif, .jpg, .jpeg, .mov, .mp4, .mpg, .png, .mp3, .wav, .ogg, .webm, .webp, .flac, .m4a.

Feel free to ask in #community-meta if you think this is a mistake.

lyric burrow Jun 16, 2021, 4:36 PM

#

is implementing linked-lists in python really better than using lists?
considering that list is a primitive data structure and implemented in C, comes built-in with python

austere sparrow Jun 16, 2021, 4:38 PM

#

lyric burrow is implementing linked-lists in python really better than using lists? consideri...

Linked lists are almost never better 🙂

lyric burrow Jun 16, 2021, 4:40 PM

#

then why would people learn them?

#

and what are the scenarios they are better?

vocal gorge Jun 16, 2021, 4:40 PM

#

lyric burrow and what are the scenarios they are better?

Pretty much only for teaching DSA.

#

there's a rant here about the limited usability of linked lists:
https://rust-unofficial.github.io/too-many-lists/index.html#an-obligatory-public-service-announcement

lyric burrow Jun 16, 2021, 4:41 PM

#

this is funny

fervent saddle Jun 16, 2021, 5:10 PM

#

fathom prairie <@461097636791844865> ```py data = [...] idx = 0 for i in range(len(data)): if ...

You could also sort the list by each object’s id and do binary search, unless you’re adding to the list after initially creating it because then you’ll have to insert them into the correct position which is O(n) anyways. I think inserting should still be faster than searching through the list since it’s a compiled function, but it still would probably not be very good.

The hash map idea might be better for this situation, but binary search might be a good thing to know about for the future

fervent saddle Jun 16, 2021, 5:18 PM

#

lyric burrow is implementing linked-lists in python really better than using lists? consideri...

Also python lists don’t use linked lists, they use arrays. So it wouldn’t be the same kind of data structure

austere sparrow Jun 16, 2021, 5:25 PM

#

I think collections.deque is a linked list.

#

!d collections.deque

halcyon plankBOT Jun 16, 2021, 5:25 PM

#

collections.deque


class collections.deque([iterable[, maxlen]])```
Returns a new deque object initialized left-to-right (using [`append()`](https://docs.python.org/3/library/collections.html#collections.deque.append "collections.deque.append")) with data from *iterable*. If *iterable* is not specified, the new deque is empty.

Deques are a generalization of stacks and queues (the name is pronounced “deck” and is short for “double-ended queue”). Deques support thread-safe, memory efficient appends and pops from either side of the deque with approximately the same O(1) performance in either direction.

Though [`list`](https://docs.python.org/3/library/stdtypes.html#list "list") objects support similar operations, they are optimized for fast fixed-length operations and incur O(n) memory movement costs for `pop(0)` and `insert(0, v)` operations which change both the size and position of the underlying data representation.

vocal gorge Jun 16, 2021, 5:31 PM

#

strangely, access in deques is way faster than I'd expect from linked lists, though linear:

import collections
d = collections.deque(range(100000))
%timeit d[0]
%timeit d[10]
%timeit d[100]
%timeit d[1000]
%timeit d[10000]

62 ns ± 5.44 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)
58.6 ns ± 3.5 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)
69 ns ± 10.1 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)
69.6 ns ± 4.97 ns per loop (mean ± std. dev. of 7 runs, 10000000 loops each)
317 ns ± 2.7 ns per loop (mean ± std. dev. of 7 runs, 1000000 loops each)

fervent saddle Jun 16, 2021, 5:31 PM

#

If you put all the nodes of a linked list in an array, is it still considered a real linked list?
I wonder if the deque does that

vocal gorge Jun 16, 2021, 5:31 PM

#

10000 steps over a linked list in ~250ns..

fervent saddle Jun 16, 2021, 5:34 PM

#

Also I wonder why that doesn’t start increasing until after 1000

vocal gorge Jun 16, 2021, 5:35 PM

#

before that, the function call itself is taking the 60ns, probaby.

#

that's about how long an empty function takes

austere sparrow Jun 16, 2021, 5:35 PM

#

I think it's not a naive linked list, instead it's a linked list of blocks

#

https://github.com/python/cpython/blob/main/Modules/_collectionsmodule.c#L82-L99

#

typedef struct BLOCK {
    struct BLOCK *leftlink;
    PyObject *data[BLOCKLEN];
    struct BLOCK *rightlink;
} block;

typedef struct {
    PyObject_VAR_HEAD
    block *leftblock;
    block *rightblock;
    Py_ssize_t leftindex;       /* 0 <= leftindex < BLOCKLEN */
    Py_ssize_t rightindex;      /* 0 <= rightindex < BLOCKLEN */
    size_t state;               /* incremented whenever the indices move */
    Py_ssize_t maxlen;          /* maxlen is -1 for unbounded deques */
    Py_ssize_t numfreeblocks;
    block *freeblocks[MAXFREEBLOCKS];
    PyObject *weakreflist;
} dequeobject;

fervent saddle Jun 16, 2021, 5:37 PM

#

Is that like a linked list where each node holds an array instead of one single value, so it doesn’t need to go through as many nodes to reach the nth index?

vocal gorge Jun 16, 2021, 5:37 PM

#

oh, cool.

#

I keep forgetting this fact and getting told that

#

I think I'm learning how deques are structured for the third time now.

fervent saddle Jun 16, 2021, 6:46 PM

#

austere sparrow ```c typedef struct BLOCK { struct BLOCK *leftlink; PyObject *data[BLOCK...

Also if the block size is 64 then why doesn’t the time it takes to index it increase between index 100 and index 1000?

austere sparrow Jun 16, 2021, 6:49 PM

#

fervent saddle Also if the block size is 64 then why doesn’t the time it takes to index it incr...

🤔

#

it does for me

#

idk why it doesn't change between 10 and 100 here, though

fervent saddle Jun 16, 2021, 6:49 PM

#

vocal gorge strangely, access in deques is way faster than I'd expect from linked lists, tho...

His test didn’t, that’s weird

#

I was going to guess that maybe there’s an array with references to some of the blocks at the start and at the end. But maybe it doesn’t do that after all

sinful tartan Jun 16, 2021, 6:57 PM

#

question i saw somethign and i was just wondering what is did
heres the code snippet

cards = ['1','2','3','4','5','6','7','8','9','10','11']*4```

#

what does the *4 at the end of the list do to it?

vocal gorge Jun 16, 2021, 6:58 PM

#

!e

print([1,2,3]*3)

halcyon plankBOT Jun 16, 2021, 6:58 PM

#

@vocal gorge :white_check_mark: Your eval job has completed with return code 0.

[1, 2, 3, 1, 2, 3, 1, 2, 3]

vocal gorge Jun 16, 2021, 6:58 PM

#

duplicates the list 4 times.

sinful tartan Jun 16, 2021, 7:04 PM

#

o

#

intresting

fervent saddle Jun 16, 2021, 11:24 PM

#

If there was a version of random.choice for dicts and sets which went to a random place in the array and then went forward or backward (maybe the direction could be chosen randomly) until it reached a non-empty / non-deleted space, would that have a problem of being less random?

#

Or is the main reason there’s nothing like that available just because they wanted to try to keep things in python and out of the underlying C code as much as possible? Or is there a different reason?

fathom prairie Jun 17, 2021, 4:30 AM

#

fervent saddle You could also sort the list by each object’s id and do binary search, unless yo...

thanks for the help, i used bisect.insort to always keep my list sorted and used binary search for lookups

fiery cosmos Jun 17, 2021, 7:11 AM

#

loops is difficoult

last fulcrum Jun 17, 2021, 8:15 AM

#

vocal gorge there's a rant here about the limited usability of linked lists: <https://rust-u...

Great article.

#

I just finished learning arrays. Anyone knows where I can get some problems to solve? Mainly easy and medium.

#

Also, can someone explain amortization in simple terms with an example for me? I learnt about it but it's not sticking. pithink

last fulcrum Jun 17, 2021, 8:23 AM

#

vocal gorge !e ```py print([1,2,3]*3) ```

I wonder why they wrote python like that? 1,2,3 all refer to the same elements here. Maybe to save space? Make it faster?

fiery cosmos Jun 17, 2021, 8:29 AM

#

last fulcrum I wonder why they wrote python like that? 1,2,3 all refer to the same elements h...

What do you mean pithink it's more of a syntax convenience so you don't need to write out print([1, 2, 3, 1, 2, 3, 1, 2, 3])

fiery cosmos Jun 17, 2021, 8:31 AM

#

last fulcrum I just finished learning arrays. Anyone knows where I can get some problems to s...

https://www.codewars.com/kata/search/python?q=&r[]=-8&r[]=-7&r[]=-6&r[]=-5&tags=Arrays&beta=false&order_by=popularity desc

Codewars

Codewars is where developers achieve code mastery through challenge. Train on kata in the dojo and reach your highest potential.

last fulcrum Jun 17, 2021, 8:32 AM

#

fiery cosmos What do you mean <:pithink:652247559909277706> it's more of a syntax convenience...

They all refer to the same elements in memory.

Here's the explanation.

To quickly initialize a one-dimensional list, we generally rely on a syntax such as data = [0] * n to create a list of n zeros. On page 189, we emphasized that from a technical perspective, this creates a list of length n with all entries referencing the same integer instance, but that there was no meaningful consequence of such aliasing because of the immutability of the int class in Python.

fiery cosmos Jun 17, 2021, 8:35 AM

#

Isn't the meaningful consequence that you now have a zeroed list of ints to do stuff with? beecloseloaf

last fulcrum Jun 17, 2021, 8:37 AM

#

I don't know. That was not my original question. I was just asking why python would make everything reference the same ints.

#

Thanks for the code wars suggestion.

flat sorrel Jun 17, 2021, 8:49 AM

#

last fulcrum I wonder why they wrote python like that? 1,2,3 all refer to the same elements h...

Any performance increase is likely negated by the fact that you're using Python to start with xD It's mainly for convenience.

#

Besides, Python automatically caches small integers so they would refer to the same object regardless

#

!e

a = 2
b = 2
print(a is b)

c = 3000
d = 3000
print(c is d)

halcyon plankBOT Jun 17, 2021, 8:50 AM

#

@flat sorrel :white_check_mark: Your eval job has completed with return code 0.

001 | True
002 | True

flat sorrel Jun 17, 2021, 8:51 AM

#

hmm

#

the second one prints False on my PC, the implementation used for the bot probably caches more values than usual

fervent saddle Jun 17, 2021, 9:06 AM

#

I thought CPython always interned constant ints

#

!e ```py
def f(n):
return n + 1

x = 3000
y = f(2999)
print(x is y)```

halcyon plankBOT Jun 17, 2021, 9:09 AM

#

@fervent saddle :white_check_mark: Your eval job has completed with return code 0.

False

fervent saddle Jun 17, 2021, 9:10 AM

#

!e ```py
def f(n):
return n

x = 3000
y = f(3000)
print(x is y)```

halcyon plankBOT Jun 17, 2021, 9:10 AM

#

@fervent saddle :white_check_mark: Your eval job has completed with return code 0.

True

fervent saddle Jun 17, 2021, 9:24 AM

#

last fulcrum Also, can someone explain amortization in simple terms with an example for me? I...

It means something looks like it has a certain complexity in the context of a loop or something like that, even though it might not actually have that complexity. I think the most common example, which you might have heard already, is appending to a list. It has linear time complexity because the array needs to resize and copy everything if it runs out of space, but that happens infrequently enough because of overallocation that it seems like it has constant complexity if you’re appending in a loop, so it has constant amortized time complexity.

onyx root Jun 17, 2021, 10:30 AM

#

fervent saddle I thought CPython always interned constant ints

only in a certain range

polar chasm Jun 17, 2021, 3:54 PM

#

What is the best place to learn DSA?

keen hearth Jun 17, 2021, 4:11 PM

#

polar chasm What is the best place to learn DSA?

Check out the pinned messages in this channel for some recommendations.

polar chasm Jun 17, 2021, 4:18 PM

#

keen hearth Check out the pinned messages in this channel for some recommendations.

Thanks for your reference.

last fulcrum Jun 17, 2021, 9:18 PM

#

fervent saddle It means something looks like it has a certain complexity in the context of a lo...

Thank you

ionic heart Jun 18, 2021, 12:13 AM

#

keen hearth Check out the pinned messages in this channel for some recommendations.

hey i referred to them but can you suggest some video courses with python implementation as i struggle with written content.

fiery cosmos Jun 18, 2021, 1:39 AM

#

MIT has some full (though bit old) courses in DSA all in Python 2 https://ocw.mit.edu/courses/electrical-engineering-and-computer-science/6-006-introduction-to-algorithms-fall-2011/

MIT OpenCourseWare

Introduction to Algorithms

This course provides an introduction to mathematical modeling of computational problems. It covers the common algorithms, algorithmic paradigms, and data structures used to solve these problems. The course emphasizes the relationship between algorithms and programming, and introduces basic performance measures and analysis techniques for these p...

rustic helm Jun 18, 2021, 4:02 AM

#

I am looping over files in a directory and would like to check if a file matches a glob specified in .gitignore file. One possible way to do this will be to simply loop over the files and then loop over each glob in .gitignore and check if the file matches one of the globs. But this will have a complexity of n^2. I am wondering if there would be a better way to do this?

ivory yacht Jun 18, 2021, 4:15 AM

#

It's not quite a n^2 complexity, it's more m*n since there's two different quantities - and you'd expect the number of globs to be far less than the number of total files. One thing you could do is some pre-processing, parse all the globs into some sort of object, and/or first for each folder you check do a filtering step to skip any globs that can't apply to stuff in that folder (because they reference another folder).

But first you want to ensure there's actually a performance issue here, before worrying about elaborate optimisations.

#

It is fairly likely the IO overhead from checking files might dominate the globbing.

#

Also a big speedup that the gitignore docs specifically mention - check each directory, if it's ignored, you can skip recursing its whole tree.

ivory yacht Jun 18, 2021, 4:33 AM

#

rustic helm I am looping over files in a directory and would like to check if a file matches...

Looks like the actual implementation of git's file searching logic is here, though in C so not the easiest to understand. Looks like it does do partial matching with the folder name, to find which globs can apply to any files/subfolders there. That way as you recurse down the directory tree, the globs can be pruned down to the relevant ones.
https://github.com/git/git/blob/master/dir.c

fervent saddle Jun 18, 2021, 5:16 AM

#

I was thinking about if you were trying to make battleships but with any grid size, ship sizes, and number of ships. Something I was thinking about was how the computer opponent might have a problem placing ships if there isn’t a lot of space.

For example, if you have a 20x20 grid, and you place forty 10-length ships so the entire board is occupied, how will the computer opponent efficiently figure out a way to place the ships? It seems like recursively checking different combinations might quickly start taking a long time with larger grid sizes.

Is there a name for this problem and a good way that people have figured out to solve it?

fiery cosmos Jun 18, 2021, 5:25 AM

#

idk but that's an interesting problem rooHmm

dapper sapphire Jun 18, 2021, 5:43 AM

#

Is f string also known as string interpolation?

foo = "Foo"
print(f"My name is {foo}")

rustic helm Jun 18, 2021, 5:46 AM

#

I don't think so. Python has it's own string interpolation.

#

!e
"hello %s" "world"

dapper sapphire Jun 18, 2021, 5:47 AM

#

So the below is called f-string?

foo = "Foo"
print(f"My name is {foo}")

rustic helm Jun 18, 2021, 5:48 AM

#

yes it is

#

!e
print("%s %s" %('Hello','World',))

halcyon plankBOT Jun 18, 2021, 5:49 AM

#

@rustic helm :white_check_mark: Your eval job has completed with return code 0.

Hello World

half lotus Jun 18, 2021, 5:49 AM

#

Both % and f-strings are considered string interpolation.

rustic helm Jun 18, 2021, 5:50 AM

#

Yes that's right. Sorry for the wrong info earlier. Here is an article https://www.programiz.com/python-programming/string-interpolation

Python String Interpolation

In this article we will learn about the python string interpolation. Python supports multiple ways to format text strings and these includes %-formatting, sys.format(), string.Template and f-strings.

dapper sapphire Jun 18, 2021, 5:50 AM

#

ok thanks!
Yeah I found the pep 498

rustic helm Jun 18, 2021, 5:51 AM

#

ivory yacht It is fairly likely the IO overhead from checking files might dominate the globb...

So you are saying that I should not worry about optimization first and come back to it later? Should I go with the m*n complexity algorithm then?

dapper sapphire Jun 18, 2021, 5:51 AM

#

Which goes over different types of string interpolation

ivory yacht Jun 18, 2021, 5:54 AM

#

rustic helm So you are saying that I should not worry about optimization first and come back...

You should run a profiler on your code, to time functions and see where any slowdowns occur - there's no point spending all of your time on a function that isn't slow to begin with. cProfile is available in the stdlib, but there's several around to pick from.

#

Just run python -m cProfile yourcode.py.

rustic helm Jun 18, 2021, 5:58 AM

#

ivory yacht You should run a profiler on your code, to time functions and see where any slow...

I am using rust for this particular project. Is there something similar?

ivory yacht Jun 18, 2021, 6:09 AM

#

Searching "rust profiler" gave me quite a few hits, though I'm not a Rust programmer so I can't exactly recommend one.

rustic helm Jun 18, 2021, 6:30 AM

#

yes but they all seem a bit difficult to use. Anyways thanks @ivory yacht

fiery cosmos Jun 18, 2021, 6:46 AM

#

thanks!

signal bridge Jun 18, 2021, 7:14 AM

#

Hey!
I have question about how dictionaries are implemented in py.
How am I able to iterate through the dictionary?
cause in essence, the thing that makes dicts faster is the fact that they keys are not stored right? Or am I getting something wrong here?

I am sorry if this is a nub question, any material that I can read up is also appreciated.

last fulcrum Jun 18, 2021, 7:24 AM

#

signal bridge Hey! I have question about how dictionaries are implemented in py. **How am I ab...

A dict is a hash table. That's what makes dicts faster. Same as a set. Here's the definition of a hash table.

Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data.

Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.

fiery cosmos Jun 18, 2021, 7:34 AM

#

signal bridge Hey! I have question about how dictionaries are implemented in py. **How am I ab...

!e a normal for loop goes through the keys of a dictionary py dct = {1: "A", 2: "B", 3: "C"} for key in dct: val = dct[key] print(key, val)

halcyon plankBOT Jun 18, 2021, 7:34 AM

#

@fiery cosmos :white_check_mark: Your eval job has completed with return code 0.

001 | 1 A
002 | 2 B
003 | 3 C

fervent saddle Jun 18, 2021, 7:49 AM

#

signal bridge Hey! I have question about how dictionaries are implemented in py. **How am I ab...

You mean to ask how you’re able to iterate through it in the order you put thing in?

#

Or you’re just asking how iteration works at all for things that use hashing, like dicts and sets?

#

I’m pretty sure the way it deals with empty or deleted index positions is that it just ignores them and moves on to the next index

signal bridge Jun 18, 2021, 8:45 AM

#

fervent saddle Or you’re just asking how iteration works at all for things that use hashing, li...

Yea, iteration for hashed data structures, Like a key has to be given into the hash function in order to give out the address at which the value is stored right? So, how does the dict know that these are the keys that are there in the dictionary that I have initialized, since the hash function is not reversible right?

fervent saddle Jun 18, 2021, 8:50 AM

#

I think it just goes through the array that stores everything, index by index, and ignores all the unused spaces

#

It doesn’t have to use hashing for iteration, it just goes through the underlying array like it would with any other array

signal bridge Jun 18, 2021, 8:54 AM

#

fervent saddle It doesn’t have to use hashing for iteration, it just goes through the underlyin...

Ohh, so in memory, there is an array which has all the keys stored?

fervent saddle Jun 18, 2021, 8:55 AM

#

Yeah

signal bridge Jun 18, 2021, 8:55 AM

#

Oh sweet, that answers my doubt! Thanks @fervent saddle

keen hearth Jun 18, 2021, 9:04 AM

#

@signal bridge It keeps all the entries (key-value pairs) of the dictionary in an array. The actual hash table contains indices of entries in this array.

#

Here's the source: https://github.com/python/cpython/blob/main/Objects/dictobject.c

#

Some notes: https://github.com/python/cpython/blob/main/Objects/dictnotes.txt

#

And I think Raymond Hettinger must have done a talk on this at some point 🤔 You can probably find that on YouTube.

#

Dictionaries are an important data-structure in Python, so some thought has been put into their design.

#

Ah. Here's the talk. Well worth watching: https://www.youtube.com/watch?v=p33CVV29OG8

YouTube

SF Python

Modern Dictionaries by Raymond Hettinger

Abstract
Python's dictionaries are stunningly good. Over the years, many great ideas have combined together to produce the modern implementation in Python 3.6. This fun talk is given by Raymond Hettinger, the Python core developer responsible for the set implementation and who designed the compact-and-ordered dict implemented in CPython for Pyth...

▶ Play video

fiery cosmos Jun 18, 2021, 9:45 AM

#

Hello
i had this subject and i dont know how i can do
I don’t even know how i can start
Can somebody help me please ?

lyric burrow Jun 18, 2021, 9:59 AM

#

how do you go about learning the string data structure in python?
or is it even relevant?

vocal gorge Jun 18, 2021, 10:04 AM

#

Chapter 4 of Fluent Python has a detailed description, but basically:

From the Python side, a string acts like an immutable sequence of characters
On the CPython implementation side, a string is an array of UTF-8 encoded bytes (note that a single character is often more than one byte!)

#

(this implies some things such that:

s = ""
for i in range(n):
    s += "a"

is O(n^2), because each += has to copy the entire string to a new position in memory. This was indeed true until Python2.7 introduced an optimization that, when possible, resizes the array inplace, making the above snippet O(n).

Nevertheless, because of this, many people still prefer appending to a list and then "".joining it, which is always O(n).
)

ivory yacht Jun 18, 2021, 10:15 AM

#

CPython doesn't use UTF-8, actually (though PyPy does). Depending on the characters in the string, it uses a 1, 2 or 4-byte buffer of code points.

#

That way indexing is still O(1), but pure ASCII strings don't waste memory.

onyx root Jun 18, 2021, 10:53 AM

#

lyric burrow how do you go about learning the string data structure in python? or is it even ...

whether it's relevant depends on why you are asking about it. Almost no one needs to be concerned with that implementation detail.

last fulcrum Jun 18, 2021, 12:45 PM

#

Has anyone used cracking the coding interview.

signal bridge Jun 18, 2021, 12:46 PM

#

keen hearth Ah. Here's the talk. Well worth watching: https://www.youtube.com/watch?v=p33CVV...

Thanks mate!

knotty magnet Jun 18, 2021, 3:18 PM

#

vocal gorge (this implies some things such that: ```py s = "" for i in range(n): s += "a...

using a list then joining also has the side effect of looking much nicer

fervent saddle Jun 18, 2021, 4:36 PM

#

Is it possible to have an indexable linked list type of thing if you’re only ever moving nodes to the end, and never adding new nodes or inserting at intermediate positions?

#

Or is it still not possible to have indexing for something like that?

severe osprey Jun 18, 2021, 6:10 PM

#

fervent saddle Is it possible to have an indexable linked list type of thing if you’re only eve...

what do u mean by moving nodes to the end\

fervent saddle Jun 18, 2021, 6:50 PM

#

severe osprey what do u mean by moving nodes to the end\

0 -> 1 -> 2 -> 3 -> 4
to this:
0 -> 1 -> 3 -> 4 -> 2

austere sparrow Jun 18, 2021, 6:50 PM

#

@fervent saddle You could keep a dict from indices to nodes

#

and then change things in that dict when you modify the list

#

hm, no, it wouldn't really work

fervent saddle Jun 18, 2021, 7:05 PM

#

It feels like it’s one of those thing where if there’s a solution that involves a full blown hash map, there’s a better one that involves index mapping

#

But I can’t think of a solution with either

fiery cosmos Jun 18, 2021, 7:10 PM

#

fervent saddle Is it possible to have an indexable linked list type of thing if you’re only eve...

I think you could but it would emulate an array afaik pithink when moving a node to the end how would you maintain the indexes without shifting the nodes in O(n)

fervent saddle Jun 18, 2021, 7:15 PM

#

Actually, I’ve been thinking about it, and isn’t OrderedDict something that does something like this?

#

But fix error said a hash map wouldn’t work, so maybe this is different in some way I don’t notice

knotty magnet Jun 18, 2021, 7:24 PM

#

it's not indexable

ivory yacht Jun 18, 2021, 7:24 PM

#

A linked list allows you to shift nodes around arbitrarily and efficiently, but the problem is you can't constant-time index it - indexing requires going through the nodes one by one.

#

Something that might work reasonably well is to not actually remove the nodes - leave None there, then append the node. Have a secondary list, which records the start/end indexes for each continuous "run" of non-none nodes. Then when indexing, you go through the list of runs, and use that to skip over the None elements. You'd also want to watch how many holes are present, and once you have more than some threshold compact down the list by removing all the Nones.

fervent saddle Jun 18, 2021, 7:53 PM

#

Ok I think I see what the difference is

#

With OrderedDict, if you move a node to the end, you don’t have to update any of the other keys

#

But with this, like other people have said, you’d need to update all the further along indexes

#

So that’s why OrderedDict wouldn’t work

fiery cosmos Jun 18, 2021, 9:56 PM

#

I'm practicing leetcode in every day but I find it difficult to set up algorithms as the problems get harder. Can you give me an idea on how to improve my algorithm knowledge. What is your recommendation for book or video etc.

wicked slate Jun 19, 2021, 3:02 AM

#

fiery cosmos I'm practicing leetcode in every day but I find it difficult to set up algorithm...

I like The Algorithm Design Manual, I shared a link below. The code is in C but I find it to be pretty generalizable. It does cover a ton of algorithms, so I'd focus on the most important ones for interviews to not get overwhelmed (Graphs & DP are where I found this book most helpful). The second half of the book also just lists common algorithm problems, so you can go through and understand them.

When I'm looking for a video explanation, I like Abdul Bari and Gaurav Sen on YouTube. Those aren't the only good channels though. Best of luck!

https://github.com/aforarup/interview/blob/master/Data Structures and Algorithm/Algorithm Books/The Algorithm Design Manual by Steven S. Skiena.pdf

GitHub

aforarup/interview

Resources to crack the next coding interview. Contribute to aforarup/interview development by creating an account on GitHub.

vivid horizon Jun 19, 2021, 4:26 AM

#

fiery cosmos !e a normal for loop goes through the keys of a dictionary ```py dct = {1: "A", ...

Could also do:

for key, val in dct```

fiery cosmos Jun 19, 2021, 4:27 AM

#

that would be dct.items()

faint sundial Jun 19, 2021, 4:44 AM

#

What should be my first project about algos and data structs?

#

I currently know literally nothing about it

gentle spoke Jun 19, 2021, 6:21 AM

#

hello everyone

#

I'm having trouble implementing (and maybe even understanding) the interaction between an ordered dictionary and a binary search tree

#

i must implement a BST that relies on an ordered dictionary, but I'm a little at loss

fiery cosmos Jun 19, 2021, 6:36 AM

#

i need help

#

for a question

#

def multiplication_table(start, stop):
    for x in ___:
        for y in ___:
            print(str(x*y), end=" ")
        print()

multiplication_table(1, 3)
# Should print the multiplication table shown above

#

fill in the blanks

#

This function prints out a multiplication table (where each number is the result of multiplying the first number of its row by the number at the top of its column). Fill in the blanks so that calling multiplication_table(1, 3) will print out:

1 2 3

2 4 6

3 6 9

#

this is the question

versed sorrel Jun 19, 2021, 8:27 AM

#

I need Help for a questions If CCC + BBB + AAA = CAAB, what are A, B, C =? The digits are distinct and
whole numbers from 1 to 9. Solve the problem using a python script.

pine sinew Jun 19, 2021, 8:49 AM

#

fiery cosmos i need help

def multiplication_table(start, stop):
for x in range(start, stop+1):
for y in range(start, stop+1):
print(str(x*y), end=" ")
print()

multiplication_table(1, 3)

fiery cosmos Jun 19, 2021, 8:50 AM

#

now done lol

#

found out myself

#

but thanks for giving ur time

pine sinew Jun 19, 2021, 8:51 AM

#

fiery cosmos found out myself

i just saw wasnt that tuff what questions are these?

fiery cosmos Jun 19, 2021, 8:51 AM

#

no just forgot to type range there

#

everything was right

fiery cosmos Jun 19, 2021, 9:12 AM

#

pine sinew i just saw wasnt that tuff what questions are these?

i am having a doubt

#

can u tell me what's the difference between using(start,stop+1) and range(start,stop+1)

#

not in range ig it just works on those 2 no. right?

pine sinew Jun 19, 2021, 9:51 AM

#

fiery cosmos can u tell me what's the difference between using(start,stop+1) and range(start,...

The advantage of the range type over a regular list or tuple is that a range object will always take the same (small) amount of memory, no matter the size of the range it represents (as it only stores the start, stop and step values, calculating individual items and subranges as needed). https://docs.python.org/3/library/stdtypes.html#typesseq-range

meager sun Jun 19, 2021, 1:41 PM

#

def main(arr,n,x):
for i in range(0,n):
if(arr[i]==x):
return 1
return -1

n=int(input())
arr=[]
for i in range(n):
arr.append(int(input()))
x = int(input())

main(arr,n,x)

#

whats wrong in code please help me,below screenshot is giving error

#

knotty magnet Jun 19, 2021, 1:44 PM

#

i'm guessing you're supposed to print your answer?

meager sun Jun 19, 2021, 1:45 PM

#

not running other test cases

#

knotty magnet Jun 19, 2021, 1:48 PM

#

you can view the inputs right?

meager sun Jun 19, 2021, 2:06 PM

#

nope only sample testcase data other two are nothing showing

knotty magnet Jun 19, 2021, 2:08 PM

#

what is the quesiton prompt?

meager sun Jun 19, 2021, 2:17 PM

#

#

knotty magnet Jun 19, 2021, 2:18 PM

#

booleans are True and False, not -1 and 1

#

i'm guessing it passed the first case because it checked for equality

meager sun Jun 19, 2021, 2:20 PM

#

can you explain where to change the code

knotty magnet Jun 19, 2021, 2:20 PM

#

you're returning -1 and 1, when you should be returning False and True

meager sun Jun 19, 2021, 2:21 PM

#

already done but not returning

knotty magnet Jun 19, 2021, 2:22 PM

#

i don't understand

meager sun Jun 19, 2021, 2:22 PM

#

knotty magnet you're returning `-1` and `1`, when you should be returning `False` and `True`

def main(arr,n,x):
for i in range(0,n):
if(arr[i]==x):
return True
return False

n=int(input())
arr=[]
for i in range(n):
arr.append(int(input()))
x = int(input())

main(arr,n,x)

#

still not working

knotty magnet Jun 19, 2021, 2:23 PM

#

what doesn't work

meager sun Jun 19, 2021, 2:24 PM

#

output not generating

knotty magnet Jun 19, 2021, 2:27 PM

#

i'm not sure what's wrong

meager sun Jun 19, 2021, 2:48 PM

#

when i run on jupyter notebook it give answer

#

but when i upload on website it dont give output

glacial sail Jun 19, 2021, 3:03 PM

#

#help-lollipop plz help me guys

rotund wigeon Jun 19, 2021, 3:25 PM

#

Hey guys, someone know how to create and queue with search function that will search if their is a spacific value?

merry bolt Jun 19, 2021, 3:40 PM

#

you are given an array of size n count the number of non empty subsets s of a such that the product of numbers in that subset is of the form p1*p2....*pk where p1,p2... pk are prime numbers. since the answer can be huge print it modulo 10^9 +7

merry bolt Jun 19, 2021, 3:41 PM

#

merry bolt you are given an array of size n count the number of non empty subsets s of a su...

Hey guys i had trouble trying to solve this, pls help me out with this

runic tinsel Jun 19, 2021, 4:51 PM

#

merry bolt Hey guys i had trouble trying to solve this, pls help me out with this

uh, almost any positive number is of that form

#

does it say anything about what numbers are in the array?

#

integers?

#

a subset of all ones would arguably not count

#

so i would do 2**n - 2**array.count(1) for example

fiery cosmos Jun 19, 2021, 5:44 PM

#

wicked slate I like The Algorithm Design Manual, I shared a link below. The code is in C but ...

Thanks for your reply🙂

jolly mortar Jun 19, 2021, 6:31 PM

#

runic tinsel uh, almost any positive number is of that form

i think the main restriction is that the number's prime factors appear just once in the factorization

#

45 wouldn't count, because it's of the form 3*3*5, with 3 repeated

#

I think

vocal gorge Jun 19, 2021, 8:51 PM

#

so the question is basically... each element of the array can be considered a multiset of primes. The task is to find the number of ways to choose a subset of all these multisets such that the sum of the subset has no elements with count>1

#

clearly, numbers the prime factorization of which has repeats immediately need to be discarded (they can't appear in any chosen subsets)

#

as for the others, uhh, not sure how to count this...

runic tinsel Jun 19, 2021, 9:03 PM

#

if the contents are not given at all, it's just all non empty subsets

#

not much choice

vocal gorge Jun 19, 2021, 9:15 PM

#

runic tinsel if the contents are not given at all, it's just all non empty subsets

if you have 2 and 2, they together aren't a valid subset because their product is 2^2

#

(that's likely what the constraint is)

frail sentinel Jun 19, 2021, 9:30 PM

#

hello what the deferent between range() and xrange() ?? and thank you so match

austere sparrow Jun 19, 2021, 9:31 PM

#

frail sentinel hello what the deferent between range() and xrange() ?? and thank you so mat...

In Python 3, there's no xrange anymore.

#

You should be using Python 3, unless you have a specialized tool that requires you to use Python 2, which has been deprecated for many years.

#

In Python 2, range makes a list, and xrange makes a small object that produces the numbers when you iterate over it. Python 3's range is Python 2's xrange.

frail sentinel Jun 19, 2021, 9:36 PM

#

austere sparrow In Python 2, `range` makes a list, and `xrange` makes a small object that produc...

i appreciate your help for me

dapper sapphire Jun 19, 2021, 10:00 PM

#

why do some easy questions feel harder lol

noble quiver Jun 19, 2021, 11:28 PM

#

#python-discussion message Need some help with Linked Lists!

frosty coral Jun 20, 2021, 12:45 AM

#

Can I create a python script to know the next number in a sequence without having the formula?

#

What do I need to know first in order to create it? thinkpepe

fiery cosmos Jun 20, 2021, 12:47 AM

#

What kind of sequence pithink

frosty coral Jun 20, 2021, 12:49 AM

#

of numbers example1: 1, 1, 2, 2, -1, 1 -2, -1

#

or 10, 37, 82, 145, 226

#

and know which number is next

fiery cosmos Jun 20, 2021, 1:11 AM

#

Well that's impossible in general. The sequence could be totally random.

#

There might be some machine learning way to make an educated guess pithink

#

Or if you know it's a certain type of sequence e.g. geometric progression you could construct a formula for the next number

dapper sapphire Jun 20, 2021, 1:17 AM

#

If you dont mean [i + 1] that would look at the value at the next index, then you would have to look at the numbers to understand if there is a pattern in the sequence.

#

But it would only work for lists that follow that pattern.

frosty coral Jun 20, 2021, 1:41 AM

#

pepe_why

austere sparrow Jun 20, 2021, 1:41 AM

#

0, 1, 2, 3, 4, 5, 6 can be a start of these sequences:

non-negative integers, in order
palindromes in base 10 https://oeis.org/A002113
first decimal digit of the number: https://oeis.org/A000030
"Numbers that cannot be represented as x*y + x + y, where x >= y > 1. " https://oeis.org/A254636
and, well, infinitely many others 🙂

dapper sapphire Jun 20, 2021, 2:26 AM

#

What would be the time complexity of a matrix where we are accessing elements diagonally?
Would it still be O(n)?

#

We are not accessing all elements.

#

algorithm I wrote only accesses 1, 5, 9 and 3, 5, 7

austere sparrow Jun 20, 2021, 2:28 AM

#

dapper sapphire What would be the time complexity of a matrix where we are accessing elements di...

What do you think?

#

If you have an N * N matrix, how many diagonal elements will there be?

dapper sapphire Jun 20, 2021, 2:42 AM

#

at 1 element, we access 1
at 4 element, we access 4
at 9 element, we access 5
at 16 elements, we access 8
So as elements increase we are accessing less elements

dapper sapphire Jun 20, 2021, 2:43 AM

#

austere sparrow What do you think?

.

#

I wanna say O(log n)

austere sparrow Jun 20, 2021, 2:44 AM

#

dapper sapphire at 1 element, we access 1 at 4 element, we access 4 at 9 element, we access 5 at...

You have to consider what N is in your question. Is it the side of the matrix? Or the total number of elements?

dapper sapphire Jun 20, 2021, 2:44 AM

#

by side of matrix do you mean 2x2, or 3x3 and so on?

austere sparrow Jun 20, 2021, 2:45 AM

#

Yes, the number of rows (or columns).

dapper sapphire Jun 20, 2021, 2:45 AM

#

Yeah you made a good point. I was going by total number of elements in the matrix.

austere sparrow Jun 20, 2021, 2:45 AM

#

austere sparrow You have to consider what `N` is in your question. Is it the side of the matrix?...

(There's no right or wrong answer, you'll just have different complexity functions depending on what you choose as the input size)

#

so if N is the side of the matrix, how many diagonal elements are there?

dapper sapphire Jun 20, 2021, 2:51 AM

#

1x1, 1 element
2x2, 4 elements
3x3, 5 elements
4x4, 8 elements
5x5, 9 elements

So this also looks like we are accessing less elements as N side of the matrix increases.

austere sparrow Jun 20, 2021, 2:52 AM

#

dapper sapphire 1x1, 1 element 2x2, 4 elements 3x3, 5 elements 4x4, 8 elements 5x5, 9 elements ...

Can you derive a formula for the amount of diagonal elements, given the side of the matrix?

dapper sapphire Jun 20, 2021, 2:53 AM

#

I sort of wanted to disagree with using side of the matrix as N, but I can also see why we could use side of matrix as N

#

I'm thinking if we can derive a formula

#

How do you derive a formula for something like this, plot it on a graph?

austere sparrow Jun 20, 2021, 2:56 AM

#

Let's take a look at just one diagonal.

#

If the matrix size is N x N, how many elements are there on a single diagonal?

dapper sapphire Jun 20, 2021, 2:59 AM

#

Oh I see!! O(n)

#

# [1, 2, 3, 4, 5, 6]
# [1, 2, 3, 4, 5, 6]
# [1, 2, 3, 4, 5, 6]
# [1, 2, 3, 4, 5, 6]
# [1, 2, 3, 4, 5, 6]
# [1, 2, 3, 4, 5, 6]
# 6x6 => 12(after we traverse through diagonal elements twice - top-left to bottom-right and bottom-left to top-right)
# 6x6 => 6(When we traverse through top-left to bottom-right) - We access 6 elements, so O(n)

#

Yeah, O(n)

austere sparrow Jun 20, 2021, 3:02 AM

#

So if the matrix size is even, there will be 2N elements, and if it's odd, then 2N - 1. So yes, it's O(N).

dapper sapphire Jun 20, 2021, 3:02 AM

#

if we use N as side of a matrix.

austere sparrow Jun 20, 2021, 3:02 AM

#

And if we use, say, M as the number of elements?

dapper sapphire Jun 20, 2021, 3:03 AM

#

austere sparrow So if the matrix size is even, there will be `2N` elements, and if it's odd, the...

wow you are a lot more precise. I missed to identify that pattern. Good catch on odd and even and 2N - 1!!

austere sparrow Jun 20, 2021, 3:03 AM

#

you could golf it and say 2N - N%2 🙂

dapper sapphire Jun 20, 2021, 3:04 AM

#

austere sparrow And if we use, say, `M` as the number of elements?

This would be O(log m)

austere sparrow Jun 20, 2021, 3:04 AM

#

Why?

dapper sapphire Jun 20, 2021, 3:07 AM

#

at 1 element, we access 1 elements
at 4 element, we access 4 elements
at 9 element, we access 5 elements
at 16 elements, we access 8 elements
at 25 elements, we access 9 elements

#

That is we traverse the matrix twice.

austere sparrow Jun 20, 2021, 3:08 AM

#

If N is the side of the matrix, how are M and N related?

#

(this is not a trick question)

dapper sapphire Jun 20, 2021, 3:10 AM

#

If N is the side of the matrix, then as N increases M also increases. Now the question would be at what rate does M increase in relation to N?

austere sparrow Jun 20, 2021, 3:10 AM

#

if a square matrix has side N, how many elements does it have?

dapper sapphire Jun 20, 2021, 3:11 AM

#

N * N

austere sparrow Jun 20, 2021, 3:11 AM

#

right, so M = N * N

dapper sapphire Jun 20, 2021, 3:11 AM

#

N^2

austere sparrow Jun 20, 2021, 3:12 AM

#

That's how you can find M, given N. How can you find N, given M?

#

i.e. if you know the number of elements in a square matrix, how do you find its side?

dapper sapphire Jun 20, 2021, 3:13 AM

#

If we know total number of elements, then we can do the square root to find number of elements

austere sparrow Jun 20, 2021, 3:13 AM

#

right, so N = sqrt(M)

dapper sapphire Jun 20, 2021, 3:13 AM

#

square root of total number of elements

austere sparrow Jun 20, 2021, 3:14 AM

#

So the number of diagonal elements is on the order of sqrt(M), right?

dapper sapphire Jun 20, 2021, 3:14 AM

#

Oh so it's O(m^1/2) where m is the total number of elements

austere sparrow Jun 20, 2021, 3:14 AM

#

yeah

dapper sapphire Jun 20, 2021, 3:16 AM

#

So when I did

at 1 element, we access 1 elements
at 4 element, we access 4 elements
at 9 element, we access 5 elements
at 16 elements, we access 8 elements
at 25 elements, we access 9 elements

This would instead actually be

at 1 element, we access 1 elements
at 4 element, we access 2 elements
at 9 element, we access 3 elements
at 16 elements, we access 4 elements
at 25 elements, we access 5 elements

We just take into account the first traversal and compute the time complexity from there?

#

because the second traversal is similar to first except with odd where we do - 1.

#

Sort of like if we traverse through a list twice and access each element, we still say O(n).

austere sparrow Jun 20, 2021, 3:21 AM

#

O-notation doesn't describe how many operations exactly it takes, but rather the 'order' on which a function grows.

#

if f(n) is on the order of O(g(n)) (in the colloquial CS meaning. I think this is technically 'big theta' instead of 'big O', I don't remember), then if we let K = f(n)/g(n), as n increases, K will tend to some positive constant. For example, 2n - 1 is on the order of O(n), because (2n - 1) / n will be a better and better approximation of 2 as you increase n.

eager hamlet Jun 20, 2021, 3:23 AM

#

wait sorry

#

wrong channel

#

wrong channel

dapper sapphire Jun 20, 2021, 3:24 AM

#

eager hamlet wrong channel

I think #software-architecture might be a good channel for something like this.
But if you dont have a lot of data(couple of lines), you could put it in the same file. Otherwise, yeah create a separate file something like a json and then access the data.

eager hamlet Jun 20, 2021, 3:25 AM

#

i mean it could go up to about 50 lines

#

this is a game jam so

dapper sapphire Jun 20, 2021, 3:25 AM

#

eager hamlet i mean it could go up to about 50 lines

Yup json.

#

That way your python file stays clean and you can load the json file.

dapper sapphire Jun 20, 2021, 3:27 AM

#

austere sparrow O-notation doesn't describe how many operations exactly it takes, but rather the...

I'm thinking about the portion where you said, rather the 'order' on which a function grows.

#

This part makes sense O-notation doesn't describe how many operations exactly it takes So O(n) doesnt exactly matter if we traversed through all elements once, twice, or three times.

austere sparrow Jun 20, 2021, 3:29 AM

#

Yes. Even if you traverse the diagonal elements 100 times (independent of the size), you still get O(n).

#

Asymptotic complexity doesn't really determine the 'speed' of an algorithm, because it ultimately depends on the implementation. Rather, it describes how some metric (time/space/bandwidth etc.) will change as the input gets larger.

dapper sapphire Jun 20, 2021, 3:43 AM

#

austere sparrow Yes. Even if you traverse the diagonal elements 100 times (independent of the si...

Did you mean, O(n ^ 1/2)? If you were still referring to a matrix to traverse elements diagonally 100 times.

#

Yeah I see exactly what you mean!

austere sparrow Jun 20, 2021, 3:47 AM

#

dapper sapphire Did you mean, `O(n ^ 1/2)`? If you were still referring to a matrix to traverse ...

well, I picked the side of the matrix for n

dapper sapphire Jun 20, 2021, 3:48 AM

#

Oh lol right. I should have pick up on it. Because you referred to number of elements with m. Thanks for all the help and asking questions to help me think through it!

austere sparrow Jun 20, 2021, 3:49 AM

#

👍

dapper sapphire Jun 20, 2021, 4:35 AM

#

I think they should have very easy difficulty on leetcode because some easy are really easy like you are done in couple of minutes, but some easy require a lot more thinking.

#

Wondering if people here felt that some Easy questions were as hard as Medium

dreamy aurora Jun 20, 2021, 10:12 AM

#

dapper sapphire Wondering if people here felt that some `Easy` questions were as hard as `Medium...

agreed. Sometimes I opt for other sites, such as codewars, generally speaking their challenges are ranked quite accurately

sonic field Jun 20, 2021, 3:46 PM

#

is this math

#

bye

grizzled schooner Jun 20, 2021, 4:22 PM

#

sonic field is this math

not pure math, although algorithms often involve some math (for example, when describing big O notation)

fervent saddle Jun 20, 2021, 7:21 PM

#

If a BST has strings as keys, and you’re looking up a string in it, doesn’t it only need to look at each character of the string once?

#

Is that an advantage over a hash table? The hash table first needs to look over each character for the hash value, then it needs to compare the string with at least one other string. So it needs to look through all the characters at least twice

#

So then it depends on how many times you need to compare one character to another in the BST, right?

last fulcrum Jun 20, 2021, 8:22 PM

#

When I look at solutions for arrays, most of the time, when they start iterating from 1 , then stop at the len(array).

I'm wondering why, because obviously, it's wrong. They should rather start from 1 to len(array) + 1. So I'm here to ask, why do they make the end len(array)? Do they want to leave the last element out?

knotty magnet Jun 20, 2021, 8:22 PM

#

to loop over an array by the indices, you start from 0, since the first element is arr[0]

last fulcrum Jun 20, 2021, 8:23 PM

#

pithink I just said they start the range from 1?

knotty magnet Jun 20, 2021, 8:24 PM

#

without anymore context, it's impossible to know why they did what they did ¯_(ツ)_/¯

#

looping to len(arr)+1 will give you index out of bounds though, if you're accessing with arr[i]

last fulcrum Jun 20, 2021, 8:26 PM

#

knotty magnet looping to `len(arr)+1` will give you index out of bounds though, if you're acce...

I think this might be the answer.

knotty magnet Jun 20, 2021, 8:26 PM

#

if you showed the code and problem maybe i could answer more in depth

#

but ranges tend to be open ended, because it makes everything easier

last fulcrum Jun 20, 2021, 8:27 PM

#

I'll tag you when i remember were I saw it.

#

@knotty magnet I saw it on a Google array question. I forgot where the question was but I recreated something.

#

!e

s = 'abacabad'
for i in range(1, len(s)):
    print(f" {s[i]}, this is {i}")

halcyon plankBOT Jun 20, 2021, 8:33 PM

#

@last fulcrum :white_check_mark: Your eval job has completed with return code 0.

001 |  b, this is 1
002 |  a, this is 2
003 |  c, this is 3
004 |  a, this is 4
005 |  b, this is 5
006 |  a, this is 6
007 |  d, this is 7

last fulcrum Jun 20, 2021, 8:34 PM

#

Yes len + 1 gives me an index error. But then I wonder, how would they have gotten the first element?

#

The constraints were that the array starts from 1 to length of the array.

knotty magnet Jun 20, 2021, 8:34 PM

#

no idea without seeing the code ¯_(ツ)_/¯

last fulcrum Jun 20, 2021, 8:36 PM

#

knotty magnet no idea without seeing the code ¯\_(ツ)_/¯

Well look at it like this. You have array. The array has to start from one to the length of the array. How would you get the first element?

knotty magnet Jun 20, 2021, 8:36 PM

#

arr[1]

last fulcrum Jun 20, 2021, 8:36 PM

#

arr[1] would not give you the first element.

#

Look at my code above

knotty magnet Jun 20, 2021, 8:37 PM

#

yes, because python has 0-indexed sequences

#

if you define the array to be 1-indexed, then arr[1] would be the first element

last fulcrum Jun 20, 2021, 8:38 PM

#

knotty magnet if you define the array to be 1-indexed, then arr[1] would be the first element

How are you going to define it. Look at this question from amazon.

Given a string s consisting of small English letters, find and return the first instance of a non-repeating character
in it.
If there is no such character, return '_'

Guaranteed constraints:
1 ≤ s.length ≤ 105.

[output] char

Here, the array must start from one

#

to the end of the list

knotty magnet Jun 20, 2021, 8:39 PM

#

where does it say that

last fulcrum Jun 20, 2021, 8:40 PM

#

Constraints

austere sparrow Jun 20, 2021, 8:40 PM

#

it just says that the length is from 1 to 105

last fulcrum Jun 20, 2021, 8:40 PM

#

Yes that's what I'm saying

#

You have to start from 1

knotty magnet Jun 20, 2021, 8:40 PM

#

huh?

#

the length is just how many elements are in it

#

in this case a string, so how many characters

#

what index the first element of a sequence is depends on the language

austere sparrow Jun 20, 2021, 8:42 PM

#

If the length is 1, the last element is s[0]. If the length is 2, the last element is s[1], and so on.

#

(in Python)

#

A length of 0 means the sequence is empty.

last fulcrum Jun 20, 2021, 8:42 PM

#

Seems my question is being confused.

#

Let me come again

#

I know how to get the length of an array. I know they are zero indexed. I'm asking. If your array has to start from the number one to the length of array, how would you get the last or first element. Because if I do range( 1, len(array)), it would stop before the length of the array.

Here is a question.
Given an array a that contains only numbers in the range from 1 to a.length, find the first duplicate number for which the second occurrence has the minimal index

austere sparrow Jun 20, 2021, 8:46 PM

#

I don't think I understand your question. Are you asking how to iterate over an entire list?

knotty magnet Jun 20, 2021, 8:46 PM

#

if it's zero indexed, range(1, len(array)) will touch the last index

austere sparrow Jun 20, 2021, 8:47 PM

#

Can you perhaps give a link to the problem statement?

last fulcrum Jun 20, 2021, 8:47 PM

#

austere sparrow Can you perhaps give a link to the problem statement?

You have to log in. or create an account

knotty magnet Jun 20, 2021, 8:47 PM

#

then just copy it

austere sparrow Jun 20, 2021, 8:48 PM

#

What the problem is saying is that if e.g. the list has length 4, it can contain numbers 1, 2, 3, and 4.

#

So [1, 2, 2, 3], [2, 4, 1, 3] and [4, 4, 4, 4] are valid inputs.

knotty magnet Jun 20, 2021, 8:48 PM

#

i thought it'd be 1, 2, and 3. 🤔

austere sparrow Jun 20, 2021, 8:48 PM

#

Wait, no, it's a bit confusing.

#

it's not clear whether the range is inclusive in the problem.

last fulcrum Jun 20, 2021, 8:49 PM

#

I've solved the problem though. Here's the full question

#

Given an array a that contains only numbers in the range from 1 to a.length, find the first duplicate number for which the second occurrence has the minimal index. In other words, if there are more than 1 duplicated numbers, return the number for which the second occurrence has a smaller index than the second occurrence of the other number does. If there are no such elements, return -1.

Example

For a = [2, 1, 3, 5, 3, 2], the output should be firstDuplicate(a) = 3.

There are 2 duplicates: numbers 2 and 3. The second occurrence of 3 has a smaller index than the second occurrence of 2 does, so the answer is 3.

For a = [2, 2], the output should be firstDuplicate(a) = 2;

For a = [2, 4, 3, 5, 1], the output should be firstDuplicate(a) = -1.

Input/Output

[execution time limit] 4 seconds (py3)

[input] array.integer a

Guaranteed constraints:
1 ≤ a.length ≤ 105,
1 ≤ a[i] ≤ a.length.

[output] integer

The element in a that occurs in the array more than once and has the minimal index for its second occurrence. If there are no such elements, return -1.

#

I used a set.

#

But that's not my question

#

Voice chat?

#

Maybe I can explain better

#

There

knotty magnet Jun 20, 2021, 8:50 PM

#

sorry, i can't

austere sparrow Jun 20, 2021, 8:50 PM

#

same

last fulcrum Jun 20, 2021, 8:50 PM

#

Ok. I'll write it out

#

You have an array that should start from the range 1

#

So you can't zero index

knotty magnet Jun 20, 2021, 8:50 PM

#

do you mean it's 1-indexed?

last fulcrum Jun 20, 2021, 8:51 PM

#

Yes.

#

How would you get the last element?

knotty magnet Jun 20, 2021, 8:51 PM

#

to use python syntax, arr[len(arr)]

last fulcrum Jun 20, 2021, 8:51 PM

#

and that's my question

#

range would stop before the last element

knotty magnet Jun 20, 2021, 8:52 PM

#

sure

last fulcrum Jun 20, 2021, 8:52 PM

#

and if you do len + 1

#

You get an index error

#

So now, what to do?

knotty magnet Jun 20, 2021, 8:52 PM

#

not if it's 1-indexed though?

last fulcrum Jun 20, 2021, 8:53 PM

#

@austere sparrow did you get it

last fulcrum Jun 20, 2021, 8:53 PM

#

knotty magnet not if it's 1-indexed though?

Mind writing code>

#

?

knotty magnet Jun 20, 2021, 8:53 PM

#

if the array is one indexed, then doing

for i in range(len(arr) + 1):
  arr[i]
``` won't give you an error (from indexing at least)

last fulcrum Jun 20, 2021, 8:54 PM

#

knotty magnet if the array is one indexed, then doing ```py for i in range(len(arr) + 1): ar...

s = 'abacabad'
for i in range(1, len(s)):
    print(f" {s[i]}, this is {i}")

#

What about this?

knotty magnet Jun 20, 2021, 8:55 PM

#

what about it

austere sparrow Jun 20, 2021, 8:55 PM

#

Well, s is 0-indexed

last fulcrum Jun 20, 2021, 8:55 PM

#

For example I want to start my counter from 1. In the real world things start from 1

#

How would I get the first element?

knotty magnet Jun 20, 2021, 8:56 PM

#

how does that relate to that code example

austere sparrow Jun 20, 2021, 8:56 PM

#

!e

s = 'abcde'
for i in range(len(s)):
    print(f" {s[i]}, this is {i+1}")

halcyon plankBOT Jun 20, 2021, 8:56 PM

#

@austere sparrow :white_check_mark: Your eval job has completed with return code 0.

001 |  a, this is 1
002 |  b, this is 2
003 |  c, this is 3
004 |  d, this is 4
005 |  e, this is 5

knotty magnet Jun 20, 2021, 8:56 PM

#

oh, you meant that counter

austere sparrow Jun 20, 2021, 8:56 PM

#

!e
or:

s = 'abcde'
for i, char in enumerate(s, start=1):
    print(f"{char}, this is {i}")

halcyon plankBOT Jun 20, 2021, 8:56 PM

#

@austere sparrow :white_check_mark: Your eval job has completed with return code 0.

001 | a, this is 1
002 | b, this is 2
003 | c, this is 3
004 | d, this is 4
005 | e, this is 5

knotty magnet Jun 20, 2021, 8:57 PM

#

much better

#

but we weren't talking about some real world application right?

last fulcrum Jun 20, 2021, 8:57 PM

#

So I asked this question because of a double for loop

#

In one code example you had to start from 1

#

And end at the length of that array

#

And it was not zero indexed

knotty magnet Jun 20, 2021, 8:58 PM

#

impossible to know why without the code ¯_(ツ)_/¯

last fulcrum Jun 20, 2021, 8:58 PM

#

Yes. That's why I tried explaining

#

And that went badly

#

Anyway @austere sparrow in C I can do a for loop like this.

   for(int i=1;i<=n;i++)  // outer loop  
   {  
       for(int j=i + 1;j<=10;j++)  // inner loop  
       {

How would I write the same in python?

austere sparrow Jun 20, 2021, 9:01 PM

#

you can use range, just add 1 to the right bound.

knotty magnet Jun 20, 2021, 9:01 PM

#

for i in range(1, n+1):
  for j in range(i+1, 11):
    ...

last fulcrum Jun 20, 2021, 9:02 PM

#

Ok thank you.

visual wyvern Jun 20, 2021, 9:46 PM

#

yo how would I print consecutive numbers in a list like
[0, 1, 3, 4, 5, 6] and print 01 and 3456

knotty magnet Jun 20, 2021, 9:48 PM

#

have a variable to store the last number you looked at

visual wyvern Jun 20, 2021, 9:50 PM

#

so would my for loop look like this

#

for index, item in enumerate(list, start=0):

knotty magnet Jun 20, 2021, 9:50 PM

#

are there more constraints on the input?

visual wyvern Jun 20, 2021, 9:50 PM

#

nah

dapper sapphire Jun 21, 2021, 4:13 AM

#

I went to the route of learning Scheme, so I could learn recursion and then do Trees lmao. So I'm thinking of skipping Trees for now and do Easy, Medium, and Hard in all other categories and then tackle Trees afterwards?

Wondering what strategies people have used?

fiery cosmos Jun 21, 2021, 4:49 AM

#

Well eventually you'd hit graphs and trees are just a simple type of graph Daijoubu

dapper sapphire Jun 21, 2021, 6:00 AM

#

Yeah that was my understanding that Trees are simple version of Trees. That one needs to know how Trees work to do well at Graphs.
So I'll save mostly Trees and Graphs for last.

fiery cosmos Jun 21, 2021, 6:33 AM

#

Make me moderator

#

I want to be moderatot

green tendon Jun 21, 2021, 9:02 AM

#

hey im trying to run my py file in vscode but it instantly stops when i try to run it

#

this appears and instantly goes away after a sec

glacial sail Jun 21, 2021, 9:41 AM

#

https://github.com/ninjapolski2/sudokuSolver/blob/master/sudoku.py why Python avoid if statements in last function and also... effect isn't good... if statements don't perform their function... help

GitHub

ninjapolski2/sudokuSolver

Contribute to ninjapolski2/sudokuSolver development by creating an account on GitHub.

last fulcrum Jun 21, 2021, 9:49 AM

#

I'm doing an array question, and would like to use a default dict. I know dicts are not ordered because they are hash tables, but is there anyway to get an ordered dict with defualt dict?

ivory yacht Jun 21, 2021, 9:50 AM

#

As of Python 3.6 dicts are actually semi-ordered - they remember the order of insertion and provide that when you iterate, but there's no way to re-order elements specifically.

#

If you do want a OrderedDict subclass, you can still do that - dicts have a __missing__(self, key) method you can override to implement the behaviour.

#

Defaultdict is basically just this:

class defaultdict(dict):
    def __init__(self, factory, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.factory = factory

    def __missing__(self, key):
        self[key] = val = self.factory()
        return val

ivory yacht Jun 21, 2021, 9:54 AM

#

last fulcrum I'm doing an array question, and would like to use a default dict. I know dicts ...

So you could also just inherit from OrderedDict, implement __missing__, and whatever else you need.

last fulcrum Jun 21, 2021, 9:55 AM

#

ivory yacht As of Python 3.6 dicts are actually semi-ordered - they remember the order of in...

Ok great. Would this fly for interviews though?

last fulcrum Jun 21, 2021, 9:55 AM

#

ivory yacht So you could also just inherit from OrderedDict, implement `__missing__`, and wh...

I don't understand how to do that, but great ideas

ivory yacht Jun 21, 2021, 9:55 AM

#

It's mentioned here in the lang spec:
https://docs.python.org/3/library/stdtypes.html#mapping-types-dict

#

As of 3.7 it's official, but it was actually implemented in 3.6.

last fulcrum Jun 21, 2021, 9:56 AM

#

In your code example, there was self.factory()
Is that also mentioned in the language spec?

ivory yacht Jun 21, 2021, 9:56 AM

#

I skipped adding an __init__ to setup that attribute, sorry.

#

If you subclassed from OrderedDict instead, you'd have a OrderedDefaultDict.

last fulcrum Jun 21, 2021, 10:01 AM

#

Great. So based upon your information I implemented something. It's very horrible optimised, so can you help me improve it?
Also, I forgot how to subclass etc. Long time since I used classes.

#

Here's the question.

#

Given a string s consisting of small English letters, find and return the first instance of a non-repeating character
in it.
If there is no such character, return '_'.

Example

For s = "abacabad", the output should be
firstNotRepeatingCharacter(s) = 'c'.

There are 2 non-repeating characters in the string: 'c' and 'd'. Return c since it appears in the string first.

For s = "abacabaabacaba", the output should be
firstNotRepeatingCharacter(s) = '_'.

There are no characters in this string that do not repeat.

Input/Output

[execution time limit] 4 seconds (py3)

[input] string s

A string that contains only lowercase English letters.

Guaranteed constraints:
1 ≤ s.length ≤ 105.

[output] char

The first non-repeating character in s, or '_' if there are no characters that do not repeat.

#

Here's my bad code.

from collections import defaultdict

def firstNotRepeatingCharacter(string: str) -> str:
    count_dict = defaultdict(int)
    for char in string:
        count_dict[char] += 1
    for char in string:
        if count_dict[char] == 1:
            return char
    return '_'

ivory yacht Jun 21, 2021, 10:03 AM

#

That would work fine, and you don't even actually need the ordered behaviour there.

#

If you did want to use it, your second loop could just loop over .items() and check if the value is 1. Since it orders by insertion order, it'll produce them in the order they were first added which is what you want.

last fulcrum Jun 21, 2021, 10:05 AM

#

ivory yacht If you did want to use it, your second loop could just loop over `.items()` and ...

This is because the recent improvements right?

#

Because let's say I go to a place that has python 3.5

#

I can't use this

ivory yacht Jun 21, 2021, 10:06 AM

#

Python 3.5 no, but the subclass I showed would work there. And 3.5 is officially end of life, so it's no longer supported anymore by the Python team.

#

Or you could just use setdefault and a plain OrderedDict, you don't need defaultdict's functionality really.

last fulcrum Jun 21, 2021, 10:08 AM

#

ivory yacht Or you could just use `setdefault` and a plain `OrderedDict`, you don't need `de...

Can you explain more?

ivory yacht Jun 21, 2021, 10:08 AM

#

Are you familiar with the dict.setdefault() method?

last fulcrum Jun 21, 2021, 10:08 AM

#

No

#

Never heard of it

ivory yacht Jun 21, 2021, 10:09 AM

#

Or actually you could even just use get().

#

get() returns the key for a specific value, or the default value you specified if it's not present.

last fulcrum Jun 21, 2021, 10:10 AM

#

I remember get(). Forgot about it totally.

ivory yacht Jun 21, 2021, 10:10 AM

#

What setdefault does is it takes the key and the default also. If the key is present it returns the value. If not, it sets it to the default you provide, then returns the default.

#

That way you can do mydict[key] = mydict.get(key, 0) + 1.

last fulcrum Jun 21, 2021, 10:11 AM

#

Thanks. That's really helpful. Also, first time I've seen that syntax with get. I usually see it used to prevent an exception.

fiery cosmos Jun 21, 2021, 10:13 AM

#

ivory yacht That way you can do `mydict[key] = mydict.get(key, 0) + 1`.

Isn't that exactly what orderedict does? In a way this is just "reinventing the wheel" (even though it's not a very complex problem)

ivory yacht Jun 21, 2021, 10:13 AM

#

Yep, but Carrickk needed it to be an OrderedDict too.

fiery cosmos Jun 21, 2021, 10:14 AM

#

I meant defaultdict, but I assume you understood that

#

Wait, doesn't defaultdict preserve order like a normal dict?

ivory yacht Jun 21, 2021, 10:14 AM

#

Only since 3.6, but yes.