boreal crescent May 14, 2024, 5:47 AM

#

#

Thank you 🙏🏾 I activated it

#

And thank you too because I see my problem with the input data

deep veldt May 14, 2024, 6:11 AM

#

can someone give me an example of epoch? im new

boreal crescent May 14, 2024, 6:40 AM

#

deep veldt can someone give me an example of epoch? im new

you mean the graphics ?

deep veldt May 14, 2024, 6:57 AM

#

boreal crescent you mean the graphics ?

what graphics?

boreal crescent May 14, 2024, 6:58 AM

#

the example of epoch ..!! you need the script?

deep veldt May 14, 2024, 7:03 AM

#

boreal crescent the example of epoch ..!! you need the script?

yes

boreal crescent May 14, 2024, 7:05 AM

#

def plot_training_history(self, model):
fig, axs = plt.subplots(2, 1, figsize=(8, 6))
axs[0].plot(model.history.history['loss'], label = 'Training loss')
axs[0].plot(model.history.history['val_loss'], label = 'Validation loss')
axs[0].set_title('Model Training History')
axs[0].set_ylabel('Loss')
axs[0].set_xlabel('Epoch')
axs[0].legend()

i'm work a trading bot

deep veldt May 14, 2024, 7:06 AM

#

thanks

boreal crescent May 14, 2024, 7:07 AM

#

welcome

deep veldt May 14, 2024, 7:09 AM

#

boreal crescent

hey can i know what does loss in that pic mean?

quaint rivet May 14, 2024, 7:09 AM

#

I'm trying to convert my total precipitation data into float. But it's not converting ```py
data['total_precipitation'] = data.total_precipitation.astype(float)
data['date'] = pd.to_datetime(data['id'])

print(type(data['total_precipitation'])) # <class 'pandas.core.series.Series'>

boreal crescent May 14, 2024, 7:11 AM

#

deep veldt hey can i know what does loss in that pic mean?

Epoch X/Y: Indica la current epoch y el total number de epochs (Y) que se están running.

171/171: Represents el total number de training steps executed in the current epoch. In this case, 171 training steps were completed in each epoch.

━━━━━━━━━━━━━━━━━━━━: Provides a graphical visualization of the epoch's progress.

7s 41ms/step: Indicates the average time taken for each training step to execute during the current epoch. In this case, each step took approximately 7 seconds and 41 milliseconds on average.

loss: 0.8608 - val_loss: 2.5735: Shows the loss in the training set and in the validation set at the end of the current epoch. In this case, the loss in the training set was approximately 0.8608, while the loss in the validation set was approximately 2.5735.

boreal crescent May 14, 2024, 7:12 AM

#

quaint rivet I'm trying to convert my total precipitation data into float. But it's not conve...

import pandas as pd

Assuming 'data' is your DataFrame

Check the data types of 'total_precipitation' column

print(data['total_precipitation'].dtype)

Convert 'total_precipitation' to float

data['total_precipitation'] = pd.to_numeric(data['total_precipitation'], errors='coerce')

Check if conversion was successful

print(data['total_precipitation'].dtype) # Should be float64

quaint rivet May 14, 2024, 7:14 AM

#

boreal crescent import pandas as pd # Assuming 'data' is your DataFrame # Check the data types...

When i tried dtype. It's giving me float64. But the problem is when i try to plot it. It's giving me this error Series is not callable

boreal crescent May 14, 2024, 7:18 AM

#

import matplotlib.pyplot as plt

Assuming 'data' is your DataFrame

plt.plot(data['date'], data['total_precipitation'])
plt.xlabel('Date')
plt.ylabel('Total Precipitation')
plt.title('Total Precipitation Over Time')
plt.show()

quaint rivet May 14, 2024, 7:19 AM

#

Yeah it's working

#

Thanks

deep veldt May 14, 2024, 7:20 AM

#

boreal crescent Epoch X/Y: Indica la current epoch y el total number de epochs (Y) que se están ...

i know it's training/validation loss but i dont get what it means

boreal crescent May 14, 2024, 7:25 AM

#

The "val_loss" (loss on the validation set) is a crucial metric for evaluating the model's performance and its ability to generalize to unseen data. If the loss on the validation set is significantly higher than the loss on the training set, it could indicate that the model is overfitting to the training data.

boreal crescent May 14, 2024, 7:26 AM

#

quaint rivet Thanks

you welcome bro

cedar tusk May 14, 2024, 9:18 AM

#

matplotlib is cringe after experiencing ggplot

boreal crescent May 14, 2024, 9:44 AM

#

i use matplotlib a lot, but ggplot its awesome...!!

cedar tusk May 14, 2024, 9:54 AM

#

bro this looks awesome

#

but its overkill

#

xD

marsh jewel May 14, 2024, 10:19 AM

#

guys how to make this work
import pandas as pd
from pandasai import PandasAI
from pandasai.llm.openai import OpenAI
i get this error:

ImportError Traceback (most recent call last)
Cell In[26], line 2
1 import pandas as pd
----> 2 from pandasai import PandasAI
3 from pandasai.llm.openai import OpenAI

ImportError: cannot import name 'PandasAI' from 'pandasai' (C:\Users\****\anaconda3\Lib\site-packages\pandasai_init_.py)

cedar tusk May 14, 2024, 10:35 AM

#

"you need to import a SmartDataframe or a SmartDatalake, check out the docs:
https://docs.pandas-ai.com/en/latest/getting-started/" -gventuri

#

https://github.com/Sinaptik-AI/pandas-ai/discussions/874

GitHub

ImportError: cannot import name 'PandasAI' from 'pandasai' · Sinapt...

Hi all, I was trying to import PandasAI but getting following error. ImportError: cannot import name 'PandasAI' from 'pandasai' (/home/emr-notebook/.local/lib/python3.9/site-package...

cedar tusk May 14, 2024, 10:35 AM

#

marsh jewel guys how to make this work import pandas as pd from pandasai import PandasAI ...

.

marsh jewel May 14, 2024, 10:37 AM

#

cedar tusk .

thanks

spring field May 14, 2024, 1:31 PM

#

I'm happy most of these (so, a tiny subset of NNs) don't look so alien to me anymore 🤗
a considerable improvement since I first saw it

coral flax May 14, 2024, 2:51 PM

#

@serene scaffold I'm using the parser in the 2nd dataframe pic to analyze then first one.

serene scaffold May 14, 2024, 2:51 PM

#

coral flax <@253696366952316929> I'm using the parser in the 2nd dataframe pic to analyze t...

I will not look at pictures of text.

#

to give a dataframe as text, do print(df.head().to_dict('list'))

#

though for full disclosure, I have a meeting that's about to start, so I might not be able to help. Posting the dataframe as text and the code increases your chances of getting help from someone else.

coral flax May 14, 2024, 2:52 PM

#

'dt1': [Timestamp('2001-04-15 23:30:00'), Timestamp('2001-04-17 00:28:00'), Timestamp('2001-04-17 23:44:00'), Timestamp('2001-04-18 23:48:00'), Timestamp('2001-04-19 23:12:00')],
'dt2': [Timestamp('2001-04-16 08:06:00'), Timestamp('2001-04-17 07:28:00'), Timestamp('2001-04-18 07:02:00'), Timestamp('2001-04-19 07:06:00'), Timestamp('2001-04-20 06:56:00')]}

#

the first one is rlly long since it's one data every minute

serene scaffold May 14, 2024, 2:53 PM

#

!paste

arctic wedgeBOT May 14, 2024, 2:53 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

coral flax May 14, 2024, 2:57 PM

#

actually if it's just head:

#

'Activity Counts': [0, 0, 0, 0, 0]
'Date time': [Timestamp('2001-04-15 00:00:00'), Timestamp('2001-04-15 00:01:00'), Timestamp('2001-04-15 00:02:00'), Timestamp('2001-04-15 00:03:00'), Timestamp('2001-04-15 00:04:00')]

spring field May 14, 2024, 3:32 PM

#

how does the notation here

#

expand to this

#

what even is this A[B, C] notation?

wooden sail May 14, 2024, 3:44 PM

#

spring field how does the notation here

it's not standard notation, they probably define it to mean exactly this

#

where did you find this?

spring field May 14, 2024, 3:45 PM

#

well, initially I found it here
https://stanford.edu/~shervine/teaching/cs-230/cheatsheet-recurrent-neural-networks
there seems to be no definition of what it does there

#

it's also used here
https://towardsdatascience.com/grus-and-lstm-s-741709a9b9b1
and here
https://colah.github.io/posts/2015-08-Understanding-LSTMs/
and probably other places

#

those are separate images

#

my only question is about the notation used in the first one and how it corresponds to the much more readable notation in the second

#

dot is probably dot product and * is Hadamard product

wooden sail May 14, 2024, 3:55 PM

#

no way to know if it doesn't say it explicitly 😛 it's probably taken from the original paper

spring field May 14, 2024, 3:55 PM

#

alright, looking at the diagrams, my best guess is that it's supposed to be concatenation

#

but if it's concat, why does the other notation have two different weight matrices sobbing

wooden sail May 14, 2024, 3:57 PM

#

it does look like it means concatenation, which would make sense if the vectors were row vectors, but then the whole multiplication doesn't make sense

#

better look at the original paper

#

all non standard (which is fine, if you explicitly define the symbols. not otherwise though)

past meteor May 14, 2024, 4:03 PM

#

The diagrams for LSTMs and GRU do a lot more damage than help

wooden sail May 14, 2024, 4:03 PM

#

we've had this discussion with zestar before

spring field May 14, 2024, 4:03 PM

#

alright, concat makes sense, but then why do the "expanded" versions have different weight matrices

past meteor May 14, 2024, 4:03 PM

#

Have we?

wooden sail May 14, 2024, 4:03 PM

#

why look at a diagram with like 10 nodes and arrows everywhere when you can represent it all with like 4 equations

past meteor May 14, 2024, 4:03 PM

#

yes hahaha

wooden sail May 14, 2024, 4:03 PM

#

past meteor Have we?

when you asked about a weird network that still worked even when you took the "wrong output", remember?

past meteor May 14, 2024, 4:04 PM

#

wooden sail when you asked about a weird network that still worked even when you took the "w...

ah yes

wooden sail May 14, 2024, 4:04 PM

#

and looking at the equations immediately elucidated that was just like removing half a layer

spring field May 14, 2024, 4:04 PM

#

oh wait, does it mean the weights are also concatenated? is it a vstack?

wooden sail May 14, 2024, 4:04 PM

#

i would honestly recommend to look for the original paper and check the equations there

past meteor May 14, 2024, 4:04 PM

#

The diagrams kind of try and convey the intuition behind why the original author thinks they work

wooden sail May 14, 2024, 4:04 PM

#

or just a better source that is more explicit

spring field May 14, 2024, 4:05 PM

#

I found a paper, all it says is that it's a generalized form...

past meteor May 14, 2024, 4:05 PM

#

As per usual I'll recommend ISLR https://www.statlearning.com/ they have a good chapter on LSTMs and GRU

spring field May 14, 2024, 4:05 PM

#

clearly not the original paper, but cmon

past meteor May 14, 2024, 4:06 PM

#

I think the whole cell state thing is just:

A way to have more parameters and non-linearities
A way to prevent vanishing gradient by having something that is essentially a skip connection

#

Forget gate, output gate, ... they all obfuscate what it's doing

#

welcome to diagrams + time series models

wooden sail May 14, 2024, 4:09 PM

#

einsum is the #1 reason i prefer numpy to matlab

past meteor May 14, 2024, 4:09 PM

#

I'm pretty sure he intended us to use do-calculus

spring field May 14, 2024, 4:11 PM

#

so someone, at some point, just mashed it together

#

this may or may not be the original paper, it's not even called GRU there https://arxiv.org/pdf/1406.1078v3

crisp dragon May 14, 2024, 4:13 PM

#

any one knows pyinstaller?

spring field May 14, 2024, 4:27 PM

#

yeah, ok, so ig those are almost implementation details, you can either concat and apply a larger weight matrix or you can apply separate weight matrices and sum the products

odd meteor May 14, 2024, 5:25 PM

#

I just saw Dune 2 last night. I opened this server now and I'm seeing Lisan Al Gaib 😄 (Muad'Dib)
I hope they make a part 3 'cos they didn't wrap up things properly in part 2.

river cape May 14, 2024, 5:33 PM

#

Hi guys , I have just finished with regression , classification and clustering , what should I do next?

whole pendant May 14, 2024, 7:00 PM

#

computer vision

opaque mantle May 14, 2024, 7:04 PM

#

can anyone suggest a good crash course to learn tensorflow and pytorch, and viewing current data science trends which one is a better alternative to use because I have seeen lots of data scientists using pytorch but in most courses through which i have learn machine learning utilises tensorflow and keras library

#

but tbh i havent much explored the tensorflow too tho

#

just wanted to ask which is best to use these days and easier and faster to learn?

odd meteor May 14, 2024, 7:06 PM

#

river cape Hi guys , I have just finished with regression , classification and clustering ,...

Build personal projects on those topics or test out what you've learned by participating in a ML Hackathon.

spring field May 14, 2024, 7:13 PM

#

opaque mantle can anyone suggest a good crash course to learn tensorflow and pytorch, and view...

pytorch is the one you should use, yes
and yeah, lots of old tutorials use tf, but it's how it is

odd meteor May 14, 2024, 7:14 PM

#

opaque mantle just wanted to ask which is best to use these days and easier and faster to lear...

You'll find your path once you've explored long enough. I started with Keras, then moved to PyTorch. I had to learn PyTorch when I got into ML Research. I tried PyTorch and I haven't looked back since. You might prefer TensorFlow to other frameworks, you just have to start from one, I guess.

In summary: I'd say, "what's easier and faster to learn" is subjective. These frameworks are just tools, so just pick up one already and start 'cooking'. It doesn't matter if you started with a cutlass, hoe, tractor, or shovel, just pick a tool already and get started.

opaque mantle May 14, 2024, 7:15 PM

#

spring field pytorch is the one you should use, yes and yeah, lots of old tutorials use tf, b...

thanks! by chance do you know any crash course on pytorch which is really easy to understand and has almost all concepts covered

spring field May 14, 2024, 7:15 PM

#

unfortunately no, but you can look at the pinned messages here

iron basalt May 14, 2024, 7:18 PM

#

Deep Learning paper of the year material.

#

That everyone cites, but nobody understands or implements themselves.

odd meteor May 14, 2024, 7:19 PM

#

opaque mantle thanks! by chance do you know any crash course on pytorch which is really easy t...

YouTube

Daniel Bourke

Learn PyTorch for deep learning in a day. Literally.

Welcome to the most beginner-friendly place on the internet to learn PyTorch for deep learning.

All code on GitHub - https://dbourke.link/pt-github
Ask a question - https://dbourke.link/pt-github-discussions
Read the course materials online - https://learnpytorch.io
Sign up for the full course on Zero to Mastery (20+ hours more video) - https:/...

▶ Play video

YouTube

Patrick Loeber

PyTorch Tutorial 01 - Installation

New Tutorial series about Deep Learning with PyTorch!
⭐ Check out Tabnine, the FREE AI-powered code completion tool I use to help me code faster: https://www.tabnine.com/?utm_source=youtube.com&utm_campaign=PythonEngineer *

Part 01: Installation

I show you how I install PyTorch on my Mac using Anaconda. Installation on Linux or Windows can be ...

▶ Play video

spring field May 14, 2024, 7:19 PM

#

btw, I find the diagrams in this paper pretty good https://proceedings.mlr.press/v63/gao30.pdf

iron basalt May 14, 2024, 7:19 PM

#

"All you need is all you need"

#

Did I win ML?

#

Good diagrams make things way easier, but they are hard to come up with and often not the best idea. Symbolic / algebraic is pretty good.

odd meteor May 14, 2024, 7:20 PM

#

It's now boring tbh. I thought I'm the only one who's tired of that

iron basalt May 14, 2024, 7:25 PM

#

We peeked with the MOG

#

Feynman diagrams are probably a top ten.

opaque mantle May 14, 2024, 7:28 PM

#

odd meteor You'll find your path once you've explored long enough. I started with Keras, th...

woah thats a good journey!

opaque mantle May 14, 2024, 7:28 PM

#

odd meteor 1. https://www.youtube.com/watch?v=Z_ikDlimN6A&t=7110s&pp=ygUVcHl0b3JjaCBkYW5pZW...

tysm!

spring field May 14, 2024, 7:34 PM

#

all things considered this GRU model is avoiding learning anything...

#

well, I was just gonna ask about that as well

#

ohhh, yk what

#

the first graph is pretty misleading

agile cobalt May 14, 2024, 7:41 PM

#

I'd really check for bugs in your implementation, the train accuracy going down is quite worrying?

opaque mantle May 14, 2024, 8:04 PM

#

any idea why its not working

#

i downloaded it and even installed everything

#

how can i check it

spring field May 14, 2024, 8:09 PM

#

hit the windows key and start typing: "environment variables", a window will pop up, click on the button that say "environment variables", a window will pop up, find a variable named Path, double click it and create a new entry that is a path to your conda installation

opaque mantle May 14, 2024, 8:09 PM

#

i can see this path

spring field May 14, 2024, 8:09 PM

#

it's echo on win

agile cobalt May 14, 2024, 8:10 PM

#

$PATH does not works on windows though
either echo %PATH% or just use set with no arguments
(both assuming cmd.exe, idk about powershell)

opaque mantle May 14, 2024, 8:10 PM

#

where do i do this

spring field May 14, 2024, 8:11 PM

#

spring field hit the windows key and start typing: "environment variables", a window will pop...

^

opaque mantle May 14, 2024, 8:11 PM

#

oki ill try this

agile cobalt May 14, 2024, 8:11 PM

#

opaque mantle where do i do this

Win+R -> cmd -> echo %PATH%

spring field May 14, 2024, 8:12 PM

#

spring field all things considered this GRU model is avoiding learning anything...

yeah, idk

opaque mantle May 14, 2024, 8:12 PM

#

spring field hit the windows key and start typing: "environment variables", a window will pop...

here

#

click new first?

spring field May 14, 2024, 8:12 PM

#

yep

opaque mantle May 14, 2024, 8:13 PM

#

like this?

spring field May 14, 2024, 8:13 PM

#

spring field yeah, idk

        # (B, Seq, F) => (Seq, B, F)
        x_seq = x_unpacked.permute(1, 0, 2)
        out = []
        for x_t in x_seq:
            z = torch.sigmoid(
                (self.W_z @ x_t[..., None])[..., 0]
                + (self.U_z @ hidden[..., None])[..., 0]
                + self.b_z
            )
            r = torch.sigmoid(
                (self.W_r @ x_t[..., None])[..., 0]
                + (self.U_r @ hidden[..., None])[..., 0]
                + self.b_r
            )
            new_h = torch.tanh(
                (self.W_h @ x_t[..., None])[..., 0]
                + (self.U_h @ (r * hidden)[..., None])[..., 0]
                + self.b_z
            )
            hidden = z * hidden + (1 - z) * new_h
            out.append(hidden)
        out_seq = torch.stack(out)
        out_seq = out_seq.permute(1, 0, 2)

it should be a textbook implementation of GRU

spring field May 14, 2024, 8:13 PM

#

opaque mantle like this?

perhaps, if that's the right path, then yes

opaque mantle May 14, 2024, 8:14 PM

#

yes

spring field May 14, 2024, 8:14 PM

#

you'll need to restart whatever shell you were using after you apply the changes

opaque mantle May 14, 2024, 8:14 PM

#

anaconda3, right?

spring field May 14, 2024, 8:15 PM

#

I actually don't know that

opaque mantle May 14, 2024, 8:16 PM

#

still this

spring field May 14, 2024, 8:16 PM

#

did you restart it

opaque mantle May 14, 2024, 8:16 PM

#

restart what?

spring field May 14, 2024, 8:16 PM

#

the shell

opaque mantle May 14, 2024, 8:16 PM

#

yeah i closed it and opened it again

spring field May 14, 2024, 8:16 PM

#

it also might be possible the executable is somewhere in a ./bin directory

opaque mantle May 14, 2024, 8:17 PM

#

😭 how do i check that?

#

oki

#

here

#

here

#

can i tell in simpler words? 😅

opaque mantle May 14, 2024, 8:20 PM

#

opaque mantle like this?

u mean in this?

#

which one?

#

sorry im not good at terminal stuff and all

#

ok

#

copy and what do i do with it?

#

oo alright

#

omg finally!

#

tysm lisan and matiiss!

#

now everything will workfine, right?

#

oh

spring field May 14, 2024, 9:21 PM

#

yk what

#

yeah, there might be data leakage

#

or not

fallow coyote May 14, 2024, 10:11 PM

#

I'm currently working on a mini project extracting and manipulating data from stock data (S&P500) to practice using numpys and pandas, while I learn the prereq maths for ML. Anyone recommend any features to add to this projects to help improve my skills (intermediate to advanced)?

opaque mantle May 14, 2024, 10:15 PM

#

is it normal to take like 4-5 seconds to get this output?

#

i even checked time like this but i dont think the creation of the tensor is taking the time

agile cobalt May 14, 2024, 10:21 PM

#

importing torch can take some time, ```py
import time
start = time.time()
import torch
end = time.time()

print(end - start)

#

you may want to use Jupyter Notebooks or IPython to avoid having to re-import libraries and re-load data

left tartan May 14, 2024, 10:22 PM

#

fallow coyote I'm currently working on a mini project extracting and manipulating data from st...

Predictions / forecasting, economic data, technical indicators, etc

fallow coyote May 14, 2024, 10:25 PM

#

left tartan Predictions / forecasting, economic data, technical indicators, etc

I've been thinking of making a SMA. I guess that may be a good start

#

Ill try to figure out myself, then get pissed off and find someone whos already done it XD

left tartan May 14, 2024, 10:27 PM

#

fallow coyote Ill try to figure out myself, then get pissed off and find someone whos already ...

Sma is like one pandas operation, don't stress it

#

The interesting question is about forecasting

#

For example; how accurate is SMA as a predictor

fallow coyote May 14, 2024, 10:29 PM

#

left tartan The interesting question is about forecasting

The forecasting I think is too difficult atm as my coding and maths skills arent to that level yet. Unless, you recommend I just go for it and see how it goes

left tartan May 14, 2024, 10:30 PM

#

fallow coyote The forecasting I think is too difficult atm as my coding and maths skills arent...

Many things aren't so complicated to 'do' because it's simply running some code. Like: a linear regression using sklearn is maybe a dozen lines of code. You don't have to know the math, but just how to assemble the code. Once you can do that, then lots of things are possible. But, it's daunting at first

fallow coyote May 14, 2024, 10:34 PM

#

Ill give it a go. So I'll need to learn how to use the sklearn libraries or are there other libraries I could use?

left tartan May 14, 2024, 10:37 PM

#

fallow coyote Ill give it a go. So I'll need to learn how to use the sklearn libraries or are ...

Sklearn is the thing to learn. Once you know that pattern, you can then use other libraries (ie: Xgboost)

#

But, being able to fit a linear regression model to a historical data set, and forecast the next N days is probably a good starting point

fallow coyote May 14, 2024, 10:41 PM

#

left tartan Sklearn is the thing to learn. Once you know that pattern, you can then use othe...

Ill have a go with it. Thing is with me, I have a bad habit of needing to know everything before I get onto a project when it'd be better if I just learn the 'basics' and then get on with it

#

I've only really in the past few months trying to develop that mindset

left tartan May 14, 2024, 10:42 PM

#

fallow coyote Ill have a go with it. Thing is with me, I have a bad habit of needing to know e...

Oh, coding is like cave diving; you sometimes have to dive in and figure out where you are

#

There's an art to finding projects that are hard enough but solvable.

#

What's nice is: once you make it work, you can then take a step back and ask: 'why did it work?'

opaque mantle May 14, 2024, 10:45 PM

#

agile cobalt you may want to use Jupyter Notebooks or IPython to avoid having to re-import li...

tysm for this suggestion! yeah it was taking 7.5s lol

fallow coyote May 14, 2024, 10:46 PM

#

left tartan There's an art to finding projects that are hard enough but solvable.

Should I do mini projects or focus on one at a time and use as many different functions and skills on that one project (if that makes sense)?

opaque mantle May 14, 2024, 10:46 PM

#

jupyter notebook is gamechanger!

left tartan May 14, 2024, 10:47 PM

#

fallow coyote Should I do mini projects or focus on one at a time and use as many different fu...

What ever you find more fun

#

As long as you're writing code, you're learning.

fallow coyote May 14, 2024, 10:47 PM

#

I always feel doing multiple projects overwhlems me as I feel a need to complete a project fully

left tartan May 14, 2024, 10:48 PM

#

fallow coyote I always feel doing multiple projects overwhlems me as I feel a need to complete...

Fully might be a step too far, at least for me. Nothing ever is fully done.

fallow coyote May 14, 2024, 10:49 PM

#

When I mean fully, I mean the program I created has fulfilled the criteria I wanted it to fulfill

left tartan May 14, 2024, 10:53 PM

#

fallow coyote When I mean fully, I mean the program I created has fulfilled the criteria I wan...

That's fair, I wouldn't worry about your method: you have so much to learn (we all do) all that matters is that you're learning something

fallow coyote May 14, 2024, 10:54 PM

#

left tartan That's fair, I wouldn't worry about your method: you have so much to learn (we a...

Thank you so much for your help

spring field May 14, 2024, 10:56 PM

#

it actually kiiinda seems to be pretty big yes, when running locally on the CPU it only took 200 samples yet the graphs looked normal at least

#

there are like 800 batches, each containing 32 sequences

#

though I use 256 sequences per batch to speed up the training

#

100

serene scaffold May 14, 2024, 11:00 PM

#

170 epochs? 🤯

spring field May 14, 2024, 11:03 PM

#

I did a 1000 epochs at some point...

#

lol

wary thistle May 14, 2024, 11:03 PM

#

could someone pls help me in my #1240058452647084103 discussion please, I am unsure about this normal distribution i am using

iron basalt May 14, 2024, 11:08 PM

#

left tartan Fully might be a step too far, at least for me. Nothing ever is fully done.

Software can be done, but it's only done when you decide it's done. You can extent it forever, and it will degrade over time, not improve, if it starts to run away from the original. Make a new project instead.

#

Or at least, it could be, if the platform it's built on did not constantly change things that break it (outside of your control and the real reason software is never considered complete).

iron basalt May 14, 2024, 11:30 PM

#

Unfortunately can't ever tell if Google is not just faking it. Their previous lying ruined it for themselves. With cherry picking I can make it seem as if I have solved any problem in the world.

#

They need to start showing a lot of cases in which it fails too.

#

Boston Dynamics started doing this and it helped them a lot.

#

(Robotics loves cherry picking, it's almost all faked)

#

(And when it's not, it's not impressive for those outside of robotics)

#

They showed that their demo videos are cherry picked, but since they are admitting it and showing how it fails most of the time, it's fine.

#

No need to be paranoid about showing off the failure cases.

#

Yeah, they went even further.

#

If you look at the old Google I/Os where they were presenting their AI assistent way back in like 2015, it made it look like ChatGPT-4o is now.

#

It was all faked though.

#

And that is why it was also never released.

#

They then made the excuse that it was because of AI safety and the public was not ready for it or some nonsense.

#

Yeah, and they give out the free version that you can try out, which is the most important step.

#

It's like with a lot of ML papers. I'm in the "show code" camp. Or it at least has to be very simple for me to implement myself either from scratch or with a small tweak in like Pytorch or something.

#

Also because bugs are easy in ML, and there have been cases where someone did show the code and it was just bugged and either did not work because of the reasons they thought, or when fixed worked even better

#

A funny thing about bugs in ML is that they can often introduce small (pseudo)random noise, which can often improve its results.

craggy agate May 14, 2024, 11:42 PM

#

Hey folks, I am in high school and have just completed my research paper which I will be publishing to github if everything checks out, it is my first time doing something like this and would like to know your thoughts! Here is its copy - https://docs.google.com/document/d/1RMx3emOys4p3OvgNvbzYJY-gE3c-1SshNoOMt4wqC6w/edit?usp=sharing
Would love for y'all to share your thoughts, thanks!

Google Docs

Automated Malaria Parasite Detection Using Convolutional Neural Net...

iron basalt May 14, 2024, 11:44 PM

#

Software really needs it because it's not tangible.

craggy agate May 14, 2024, 11:48 PM

#

I see, I will look into the rules of research papers, thanks 👍

#

I see

serene scaffold May 14, 2024, 11:53 PM

#

imagine reading the literature
LOL

craggy agate May 14, 2024, 11:56 PM

#

serene scaffold imagine reading the literature LOL

?

serene scaffold May 14, 2024, 11:56 PM

#

craggy agate ?

it's a shitpost

craggy agate May 14, 2024, 11:57 PM

#

serene scaffold it's a shitpost

What is? "review of literature"?

spring field May 14, 2024, 11:57 PM

#

craggy agate What is? "review of literature"?

imagine reading the literature
LOL

#

alright, this is rigged

craggy agate May 14, 2024, 11:58 PM

#

spring field alright, this is rigged

what are you on about 💀

serene scaffold May 14, 2024, 11:59 PM

#

craggy agate What is? "review of literature"?

it's where you read all (or many of) the academic papers that relate to a topic of interest, so that you know basically everything there is to know about the state of the art for that """art"""

#

my first contribution to ""the body of knowledge"" was about developing a system for automating it https://www.sciencedirect.com/science/article/pii/S1532046421002999

craggy agate May 15, 2024, 12:00 AM

#

serene scaffold it's where you read all (or many of) the academic papers that relate to a topic ...

Oh, so you are recommending I read a bunch of academic papers to get the style of writing and editing required for said papers?

serene scaffold May 15, 2024, 12:00 AM

#

craggy agate Oh, so you are recommending I read a bunch of academic papers to get the style o...

idk, I was just shitposting about literature reviews.

craggy agate May 15, 2024, 12:00 AM

#

serene scaffold idk, I was just shitposting about literature reviews.

alright

#

I see

serene scaffold May 15, 2024, 12:00 AM

#

when I said "imagine reading the literature", the subtext is that being up-to-date is cumbersome, and that academics secretly don't do it.

spring field May 15, 2024, 12:01 AM

#

I always read a paper before bed, helps me fall asleep /s

serene scaffold May 15, 2024, 12:02 AM

#

spring field I always read a paper before bed, helps me fall asleep /s

when you read a paper, do you speculate about how many of the authors and reviewers are native english speakers?

craggy agate May 15, 2024, 12:02 AM

#

serene scaffold when I said "imagine reading the literature", the subtext is that being up-to-da...

ohk i get it

spring field May 15, 2024, 12:03 AM

#

serene scaffold when you read a paper, do you speculate about how many of the authors and review...

(I am yet to actually properly read a paper 😬 I have a couple I know I should actually read because they're more less part of course content)

serene scaffold May 15, 2024, 12:04 AM

#

spring field (I am yet to actually properly read a paper 😬 I have a couple I know I should a...

my advice for reading papers: don't feel bad if none of it makes sense after your first pass. or seventh pass. a lot of academics are (a) writing for an audience that presumably knows more than you and (b) are profoundly unskilled writers.

spring field May 15, 2024, 12:11 AM

#

spring field alright, this is rigged

both run the same code, this is an RNN, not even a GRU
the dataset is basically a bunch of text
the first one, runs locally on my computer on the cpu, only uses like the first 200 samples from the dataset
the second one, runs on a gpu on cloud, using all, idk, like 25k samples from the dataset and it looks terrible

#

yep, literaly copy pasted it

#

and I was using pretty much (but like, literally the same in terms of reproducibility) RNN implementation on numerical time series data and it worked pretty ok
so maybe it is the loss function, maybe it is the metric

#

you mean batch size?

#

uh oh, the test loss in the first one is way way above

#

relatively speaking, the difference is much greater in the first graph

#

I know, I don't like it either, we're just given some template code and that's how it be there...

#

yeah yeah, I have changed it, I was just running the older code cuz I thought it was working somewhat at least.... 😐

iron basalt May 15, 2024, 12:22 AM

#

serene scaffold my advice for reading papers: don't feel bad if none of it makes sense after you...

I want more people to make papers like Schmidhuber, with the funky iconic diagrams / images. Not because it's useful, but because it makes me more willing to read it.

spring field May 15, 2024, 12:26 AM

#

yeahhh, it's just overfitting

iron basalt May 15, 2024, 12:33 AM

#

Simplicity and maintainability in software often fails to account for the fact that how simple, readable, or easy to maintain something is, depends also largely on the skill of person reading it. This is obvious for mathematics, for example, a novice looking at something like a differential equation might consider it complex nonsense, but to someone more skilled it's actually much easier to understand than via plain English. However, it seems that this same realization has not been made for software at large.

spring field May 15, 2024, 12:35 AM

#

it's as basic an RNN as an RNN can be basic

iron basalt May 15, 2024, 12:35 AM

#

You can optimize for simplicity for a novice, but like in mathematics, this can greatly hold it back.

#

(But if what you are making is simple software (e.g. mostly business logic stuff, just an app), then this totally makes sense)

spring field May 15, 2024, 12:36 AM

#

yeah, can do that

iron basalt May 15, 2024, 12:38 AM

#

iron basalt (But if what you are making is simple software (e.g. mostly business logic stuff...

(But there is also a lot of stuff in software that has never really been measured, just promised improvement of readability and maintainability (e.g. Java style OOP, where are all the supposed gains? (i'm in the mood for hot takes)))

spring field May 15, 2024, 12:40 AM

#

alright firNotes thanks for the help btw

#

ah, it's 4am here 😁 gn, thanks again

#

howwww is this happening sobbing
batch size: 8, embedding dimensions: 32, rnn hidden size: 64
how much simpler can it possibly get

#

yeah, nope, I tried with torch.nn.RNN, it just wouldn't budge, there's clearly an issue somewhere else then

#

will figure it out tomorrow
it's probably something to do with the text then, maybe it's just not what RNNs are good for... it's trying to predict text, training on quotes, tries to predict the next word

#

why text? I didn't pick the data, but now that I think back to my previous stuff working on that other data, that only had 3 inputs and 2 outputs and it was struggling with pretty much cyclic data, this has a ton of inputs and a couple thousand outputs technically, idk... kind of a multivariate time series

#

oh well, I'm fairly certain my implementation of the layers is correct though

vestal spruce May 15, 2024, 2:01 AM

#

is anyone familiar with python simple diarizer package? I'm getting speechbrain error

#

This is the error messge

SpeechBrain could not find any working torchaudio backend. Audio files may fail to load. Follow this link for instructions and troubleshooting: https://pytorch.org/audio/stable/index.html
torchvision is not available - cannot save figures
SpeechBrain could not find any working torchaudio backend. Audio files may fail to load. Follow this link for instructions and troubleshooting: https://pytorch.org/audio/stable/index.html
Traceback (most recent call last):
  File "C:\Users\HP\Pycharmprojects\proyek_ta\test.py", line 6, in <module>
    from simple_diarizer.diarizer import Diarizer
  File "C:\Users\HP\Pycharmprojects\proyek_ta\.venv\Lib\site-packages\simple_diarizer\diarizer.py", line 9, in <module>
    from speechbrain.pretrained import EncoderClassifier
ModuleNotFoundError: No module named 'speechbrain.pretrained'

Process finished with exit code 1

sage sparrow May 15, 2024, 2:50 AM

#

How can I test data for accuracy or significance of variable-to-variable dependency?

#

I have a weather dataset but I'm not sure if it's me or if there's data not showing significance to the temperature. Should I visualize it or use some algorithm? I'm lost so any guidance is greatly appreciated here

gritty vessel May 15, 2024, 3:00 AM

#

``
Model: "model_1"

Layer (type) Output Shape Param #

input_5 (InputLayer) [(None, 60, 55, 6)] 0

conv2d_17 (Conv2D) (None, 60, 55, 32) 1760

max_pooling2d_11 (MaxPooli (None, 30, 27, 32) 0
ng2D)

conv2d_18 (Conv2D) (None, 30, 27, 64) 18496

max_pooling2d_12 (MaxPooli (None, 15, 13, 64) 0
ng2D)

up_sampling2d_5 (UpSamplin (None, 30, 26, 64) 0
g2D)

up_sampling2d_6 (UpSamplin (None, 60, 52, 64) 0
g2D)

conv2d_19 (Conv2D) (None, 60, 52, 32) 18464

conv2d_20 (Conv2D) (None, 60, 52, 1) 33

=================================================================
Total params: 38753 (151.38 KB)
Trainable params: 38753 (151.38 KB)
Non-trainable params: 0 (0.00 Byte)
``
my input is 60,55,6 and output shape is 60,55,1 but why in end its coming to 60,52,1 how can i configure it?

#

I have faced this multiple times can you guys give me a direction so that i can figure out why there is shape mismatch

gritty vessel May 15, 2024, 3:12 AM

#

sage sparrow How can I test data for accuracy or significance of variable-to-variable depende...

try visualization, correlation analysis

jaunty helm May 15, 2024, 4:07 AM

#

sage sparrow How can I test data for accuracy or significance of variable-to-variable depende...

plotting and just looking is unironically very good (monke brain good at spot patterns)
there are also some statistical tests (pearson correlation being one of the most well-known), but dont just rely on them

bold timber May 15, 2024, 6:44 AM

#

The question box on my chat gpt account is very small. Does anyone know how to solve this issue?

spiral bay May 15, 2024, 6:59 AM

#

Does anybody know an fast way to check models on Huggingface for their context size?

spring field May 15, 2024, 8:03 AM

#

yes, accuracy is measured as number of correct predictions over number of all predictions made

#

there are 11k tokens

#

embedding (I'll assume at least 1 weight + bias, it's the built-in one) + RNN (3 params there (2 weights, 1 bias)) + linear/fc layer (so 1 weight, 1 bias)

#

I can get the state dict ig

#

or just len of .parameters()?

spring field May 15, 2024, 8:12 AM

#

spring field or just len of .parameters()?

it's 6

#

I am generating text along the graphs

#

love amy repository moonlight murders .happiness asses shakespeare companions hideous ephemeral tissue dent laziness stale cliche anxious below theoretical distract paul ago saja buys entry goats napoleon pets fireflies fireflies yous lang glows innovation stealing ,god dearly bind profanity freaking trained recalled races princesses claw copying visualize pulses deciphered ,all ,god begun begun picks instagram tricks praised traffic conclude crash introverts dogma
this is from the GRU network (which has like 2x the params), but it shows a similar graph anyway

rigid cape May 15, 2024, 8:17 AM

#

Hi guys, I would like to make a Facial Recognition system using tensorflow. I know basic statistics, Machine learning and data analysis. As a Beginner to Deep Learning , what am I supposed to do ? Do I see a Youtube Channel and implement it directly or do I have to learn the fundamentals? I also know Web backend for the application side of things .

spring field May 15, 2024, 8:19 AM

#

yeah, it's weighted too

#

wait wait, when you say parameters, what do you mean exactly?

#

alright, slightly confused, do you mean like features or like torch.nn.Parameter parameters?

#

alright, I was counting Parameter objects

#

there are like this many parameters then 2197498

spring field May 15, 2024, 8:25 AM

#

spring field howwww is this happening <:sobbing:1107868633390125066> batch size: 8, embeddin...

this one had 1098234

#

the GRU model is at 4.5M in its standard config if you will

#

alright

spring field May 15, 2024, 8:48 AM

#

idk, if I don't do that, it just tries really hard to predict mostly the ones that are most frequent in the dataset

#

alr, I'll find a paper

spring field May 15, 2024, 8:51 AM

#

spring field idk, if I don't do that, it just tries really hard to predict mostly the ones th...

although... I'll give it some time, at least it's progressing the right way 😁

#

this makes sense too

spring field May 15, 2024, 10:07 AM

#

found a paper, from what I can tell it suggests using a bunch of hidden layers, meanwhile I'm here using one ducky_skull

#

if anyone's interested: https://www.isca-archive.org/interspeech_2010/mikolov10_interspeech.html

dull flare May 15, 2024, 10:26 AM

#

hello

honest sierra May 15, 2024, 10:41 AM

#

Does anyone know some free API for accessing a LLM through Python? I took a look into ChatGPT but it costs money. I tried running Llama2 using Ollama in my computer but it's too slow. Is there any alternative solution?

jaunty helm May 15, 2024, 10:46 AM

#

honest sierra Does anyone know some free API for accessing a LLM through Python? I took a look...

ChatGPT but it costs money
actually it doesn't have to, there's a free tier according to here

honest sierra May 15, 2024, 10:47 AM

#

not really. you need tokens which they dont give you. you have those limits but you have to pay for the tokens

jaunty helm May 15, 2024, 10:50 AM

#

honest sierra not really. you need tokens which they dont give you. you have those limits but ...

ah I see, I stand corrected (twice in such a short amount of time too 😦 )
I doubt you'll find any online solution that doesn't cost you money, but you can try running a smaller model

honest sierra May 15, 2024, 10:50 AM

#

thank u!

jaunty helm May 15, 2024, 10:51 AM

#

how many parameters is the llama2 you're running?
it gets as small as 7b iirc

jaunty helm May 15, 2024, 10:51 AM

#

jaunty helm how many parameters is the llama2 you're running? it gets as small as 7b iirc

as small as 7b in the llama series anyway
the phi series have even fewer params

dull flare May 15, 2024, 10:52 AM

#

1.Collect and pre-process datasets of facial expressions captured in different contexts (e.g., cultural variations, lighting conditions).

2.Train separate models for each context or develop a multi-context model that adapts its predictions based on additional input data (e.g., location, cultural background).

these are 2/5 of the tasks i received from my internship provider

honest sierra May 15, 2024, 10:52 AM

#

jaunty helm how many parameters is the llama2 you're running? it gets as small as 7b iirc

how can I check this

jaunty helm May 15, 2024, 10:52 AM

#

honest sierra how can I check this

not sure, how did you download it?

dull flare May 15, 2024, 10:52 AM

#

how can i provide multiple context to an image

honest sierra May 15, 2024, 10:53 AM

#

jaunty helm not sure, how did you download it?

ollama pull llama2

#

first downloaded and set up ollama ofc

jaunty helm May 15, 2024, 10:55 AM

#

honest sierra `ollama pull llama2`

from here I'd assume you're already running 7b (the smallest one)
try the phi series ig

#

in their github in the examples, it shows the gemma model which has even less params than phi

#

I'm not sure if this actually helps, but there are other less mainstream architectures like rwkv, mamba, etc. which might use less computing power (really am not sure on this)

honest sierra May 15, 2024, 11:01 AM

#

im gonna see how phi3 goes. thank u for helping!

boreal nest May 15, 2024, 11:19 AM

#

Hello everyone, sharing a notebook on association rule mining . https://www.kaggle.com/code/jaepin/association-analysis-using-grocery-dataset
I'm curious as to what the minimum support is for large transactional datasets. I've seen other do as low as 1% for apriori, but it seems to be too low in my opinion and in consequence they get trivial or inexplicable associations. Does anyone here work in retail and or have performed association rule mining at work before? what are your thoughts/comments?

Association Analysis using Grocery Dataset

Explore and run machine learning code with Kaggle Notebooks | Using data from Retail Transactions Dataset

potent sky May 15, 2024, 11:40 AM

#

honest sierra Does anyone know some free API for accessing a LLM through Python? I took a look...

cohere!

sage sparrow May 15, 2024, 12:04 PM

#

jaunty helm plotting and just looking is unironically very good (monke brain good at spot pa...

Thank you guys @jaunty helm @gritty vessel

gritty vessel May 15, 2024, 12:04 PM

#

sage sparrow Thank you guys <@208918673178492929> <@463988925040558080>

Solved?

sage sparrow May 15, 2024, 12:05 PM

#

gritty vessel Solved?

Just woke up lol but I've got a better idea of how I'll go about it

honest sierra May 15, 2024, 12:37 PM

#

Would Google Colab work with this?

deep veldt May 15, 2024, 1:17 PM

#

the smaller dataset is the more epoch the model has to learn for high accuray right?

cerulean abyss May 15, 2024, 1:17 PM

#

How about aws to use ia

mild dirge May 15, 2024, 1:18 PM

#

deep veldt the smaller dataset is the more epoch the model has to learn for high accuray ri...

When you have less data, the model is more likely to overfit on this small dataset, as there is not enough data to generalize.

tepid rose May 15, 2024, 1:26 PM

#

Guys how knows about the asji server or something like that?

#

Sorry I checked the name it is asgi

warm trellis May 15, 2024, 1:50 PM

#

Hey everyone, why would you want to embed the tabular data when applying transfer learning for a regression task?

spring field May 15, 2024, 1:52 PM

#

so, after reading that paper (#data-science-and-ml message), which suggested between 30 and 500 hidden units, I picked 50, so now I have 50 GRU layers each with a hidden size of 256, embedding dims is only 128 though, in total it's ~24M parameters
either way, an epoch takes about 10 to 15 minutes now, but at least the graphs make sense (this is after epoch 10)
the sentences are maybe not the best ones yet, but at least the graphs...

consider . until until until until is . . . .
your sometimes , . . . . . . . . . . . . . . . . . . real point dark . . . . . . . . . . the . . a rather the the , , the is the affects the the rest his the
you . . . . . . .
all humor , . . . . strength , . . earth . . . . . the just the . , . . . . . . . . the nature nature and . . . . so that . . , , like [any] than day waiting before go it . . . . . .
when . . . . . , . . , . . . , . . . .
[any] . . . .
ah [any] a . . . . . . . . . . . of all . . . . no all well make
it . . the w , , [any] . . . . . the happy to person the [any] [any] person . . . . . and you has w w a . . . . . . . . . . . . . . . w a . . . is . . . . . . . . .
i . . . . . . . . .
[any] . . . . . . . . .
my work the an the . . . .
different a the . . the make w raised .

7Esy7Isy7Isy7Kskiqv0S3Lsiwrfiop94YkSUoQQRDQrl073n333aijSJIkSZKU0LxGlySp9HDGuiRJkiRJkiRJkiRJBbCxLkmSJEmSJEmSJElSAdwKXpIkSZIkSZIkSZKkArhiXZIkSZIkSZIkSZKkAthYlyRJkiRJkiRJkiSpADbWJUmSJEmSJEmSJEkqgI11SZIkSZIkSZIkSZIKYGNdkiRJkiRJkiRJkqQC2FiXJEmSJEmSJEmSJKkANtYlSZIkSZIkSZIkSSqAjXVJkiRJkiRJkiRJkgpgY12SJEmSJEmSJEmSpAL8P7Ho5k4HjNI5AAAAAElFTkSuQmCC.png

#

wdym? tbf, the tokens are just whole words here

#

mainly practice, we just had it as a homework

serene scaffold May 15, 2024, 1:58 PM

#

No, forever

spring field May 15, 2024, 1:59 PM

#

pretty sure google colab is gonna shoot the party down pretty soon bread_pensive

#

that's not the point

spring field May 15, 2024, 1:59 PM

#

spring field if anyone's interested: https://www.isca-archive.org/interspeech_2010/mikolov10_...

this is literally a 2010 paper as well, lol

#

dw, I'll move on to transformers soon

#

may I call it DGRUN: Deep Gated Recurrent Unit Network? lol

craggy agate May 15, 2024, 5:34 PM

#

Guys, I have achieved 61 val accuracy for my CNN human emotion classification model, I have around 3k images for each class. There are 8 classes in total, is this good? How can I potentially increase val accuracy and reduce loss values?

leaden narwhal May 15, 2024, 5:41 PM

#

Hey guys, got this end result on my LSTM and i wanted to code the prediction for the next days. Can anyone help me, im super lost

cedar tusk May 15, 2024, 6:30 PM

#

leaden narwhal Hey guys, got this end result on my LSTM and i wanted to code the prediction for...

uhh u dont have any non test predictions yet?

#

we dont even know which package you used for the predictions

#

need more context

leaden narwhal May 15, 2024, 6:37 PM

#

cedar tusk uhh u dont have any non test predictions yet?

No I dont

#

How do i make the future predictions for the next 7 days for instance

#

or maybe 15

boreal crescent May 15, 2024, 6:51 PM

#

leaden narwhal No I dont

def load_model_and_predict(self):
# Load the trained model
model = load_model('trading_model.keras')

    # Retrieve historical trading data
    rates = mt5.copy_rates_range(self.symbol, mt5.TIMEFRAME_M5, datetime.now() - timedelta(days = 30), datetime.now())
    if rates is None or len(rates) < 60:
        print("Not enough data for prediction")
        return

i have this for 30 days

cedar tusk May 15, 2024, 7:03 PM

#

boreal crescent def load_model_and_predict(self): # Load the trained model model...

uh this is not the prediction part of the code

#

there should be a part that calls model.predict

#

it should be something like this for future predictions:

n_future = 5  # Number of future time steps you want to predict
predictions = []

for _ in range(n_future):
    # Make a prediction for the next time step
    next_pred = model.predict(input_data)
    
    # Append the prediction to the list of predictions
    predictions.append(next_pred[0, 0])
    
    # Update the input data by removing the first element and appending the prediction
    input_data = np.append(input_data[:, 1:, :], [[next_pred[0]]], axis=1)

#

oh wrong reply xD

cedar tusk May 15, 2024, 7:08 PM

#

leaden narwhal No I dont

.

boreal crescent May 15, 2024, 7:22 PM

#

cedar tusk uh this is not the prediction part of the code

i have this predictions call

def load_model_and_predict(self):
    # Load the trained model
    model = load_model('trading_model.keras')

    # Retrieve historical trading data
    rates = mt5.copy_rates_range(self.symbol, mt5.TIMEFRAME_M5, datetime.now() - timedelta(days = 30), datetime.now())
    if rates is None or len(rates) < 60:
        print("Not enough data for prediction")
        return
    # Prepare the data for prediction.
    close_prices, ma5, rsi_values, cci_values = self.get_indicators(rates)
    features = np.column_stack((close_prices, ma5, rsi_values, cci_values))
    features_scaled = self.scaler.transform(features)  # Use the same scaler as during training.

    # Prepare input data for the LSTM model
    X, y = self.prepare_data(features_scaled)

    # Make predictions using the loaded model
    predictions = model.predict(X)

    # Here you can do whatever you need with the predictions.
    print("Predictions:", predictions)

#

And the results i have this,

177/177 ━━━━━━━━━━━━━━━━━━━━ 8s 45ms/step - loss: 0.7418 - val_loss: 3.4327
Epoch 25/30
177/177 ━━━━━━━━━━━━━━━━━━━━ 8s 44ms/step - loss: 0.7563 - val_loss: 3.4275
Epoch 26/30
177/177 ━━━━━━━━━━━━━━━━━━━━ 8s 44ms/step - loss: 0.7597 - val_loss: 3.4299
Epoch 27/30
177/177 ━━━━━━━━━━━━━━━━━━━━ 8s 45ms/step - loss: 0.7648 - val_loss: 3.4291
Epoch 28/30
177/177 ━━━━━━━━━━━━━━━━━━━━ 9s 51ms/step - loss: 0.7499 - val_loss: 3.4423
Epoch 29/30
177/177 ━━━━━━━━━━━━━━━━━━━━ 8s 45ms/step - loss: 0.7444 - val_loss: 3.4330
Epoch 30/30
177/177 ━━━━━━━━━━━━━━━━━━━━ 9s 50ms/step - loss: 0.7396 - val_loss: 3.4489
197/197 ━━━━━━━━━━━━━━━━━━━━ 5s 21ms/step
Predictions: [[-0.17073686]
[-0.17073686]
[-0.17073686]
...
[-0.17073686]
[-0.17073686]
[-0.17073686]]

but i know i have a mistake on my code i trying to fix, but its crazy because my trading bot make profits and predictions for the movement of trading signal and trend

#

Time: 2024-05-15 18:15:00, Open: 1.08653, High: 1.08666, Low: 1.08633, Close: 1.08658, Direction: buy
Time: 2024-05-15 18:20:00, Open: 1.08658, High: 1.08692, Low: 1.08657, Close: 1.0868, Direction: buy
Time: 2024-05-15 18:25:00, Open: 1.0868, High: 1.0872, Low: 1.08669, Close: 1.08715, Direction: buy
Time: 2024-05-15 18:30:00, Open: 1.08716, High: 1.08744, Low: 1.08708, Close: 1.08723, Direction: buy
Time: 2024-05-15 18:35:00, Open: 1.08723, High: 1.08724, Low: 1.08686, Close: 1.08692, Direction: sell
Time: 2024-05-15 18:40:00, Open: 1.08692, High: 1.08697, Low: 1.0867, Close: 1.08678, Direction: sell
Time: 2024-05-15 18:45:00, Open: 1.08678, High: 1.08687, Low: 1.08633, Close: 1.08633, Direction: sell
RSI: 57.673367175840006
last value CCI: 55.203492465835076
MA5: 1.104268
Current Price: 1.1042
Cjurrent Price EURUSD: Bid - 1.08751, Ask - 1.08751
Current Signal trade: SELL
current trend: DOWN
Trend Down, Recommend sell.
MA5: 1.104268
Current Price: 1.1042
Current Trend: UP
succefull order buy send
Stop Loss: 1.08733, Take Profit: 1.08798

steel hull May 15, 2024, 7:45 PM

#

Hey Guys, any suggestions on which would be the best resources to practice Python for Data Analysis? Some free alternatives for Leetcode and Stratascratch?

lapis sequoia May 15, 2024, 7:53 PM

#

Any of you feel that chatGTPjust makes you worse off, give you wrong answers and wastes time by saying you are wrong?

agile cobalt May 15, 2024, 7:54 PM

#

lapis sequoia Any of you feel that chatGTPjust makes you worse off, give you wrong answers and...

not sure about the last one, more often than not it should be the other way around (you wasting your time telling GPT that it is wrong), but for the first half yes

boreal crescent May 15, 2024, 7:55 PM

#

lapis sequoia Any of you feel that chatGTPjust makes you worse off, give you wrong answers and...

yes a chatgpt give always wrong answare

#

but you have to request a right answare showed the mistake

lapis sequoia May 15, 2024, 7:57 PM

#

I do not even use it much. I put my 3000 hours into this before even touching chatgtp.

#

Had the sickest two-part tariff ever. Also, people should use two-part tariffs and stuff for market segementation

#

that is money like on a insane level

pearl ocean May 15, 2024, 8:04 PM

#

bruv why use google colab

#

is it just because its like a virtual gpu or smthing

#

i never used it i just want to know

#

because so many ai tutorials on youtube use colab like why not just use vscode

wooden sail May 15, 2024, 8:05 PM

#

gpu usage indeed

pearl ocean May 15, 2024, 8:05 PM

#

ah ok

steel hull May 15, 2024, 8:05 PM

#

steel hull Hey Guys, any suggestions on which would be the best resources to practice Pytho...

anyone?

lapis sequoia May 15, 2024, 8:05 PM

#

I do not even use it. I question myself and I am like: "I would have been done if I just went to stack overflow for a quick answer"

#

Was Chatgtp always this bad?

stark flame May 15, 2024, 8:18 PM

#

hello guys
what would be the best way to predict a word by giving an incorrect input and from a premade dictionary the word gets predicted
eg: input word " BAD"
word from dictionary "BAT"
so basically we are getting some word prediction form an model but to increase its accuracy further we are trying this method

pls mention me whenever any of you responds

cedar tusk May 15, 2024, 8:46 PM

#

steel hull Hey Guys, any suggestions on which would be the best resources to practice Pytho...

using python and experimenting without looking at the answers

#

only look at documentation

#

u can just take data from kaggle and experiment on ur own

#

learn matplotlib, sns, plotly(maybe)

#

if u are trying to do data analysis only i would just ditch python altogether for R

#

R slaps python in anything not deep learning

#

and even then it has h2o which is industry standard level

#

tidymodels also work very good

#

but is not meant for deep learning

spring field May 15, 2024, 9:04 PM

#

stark flame hello guys what would be the best way to predict a word by giving an incorrect i...

~~gonna sound like a broken record, but you could try an RNN class network~~
~~but for real though, an RNN + DQN might do the trick here, where it learns to prioritize the words in the dictionary...~~
headbang

spring field May 15, 2024, 9:31 PM

#

eh, probably, attention and all, speaking of which, I gotta go and revise that, btw, colab shut the party down... I got until epoch 16, it wasn't making much sense in rollouts, but it was continuing to improve by the other metrics

#

actually, some of the stuff it was generating made a teeny tiny bit of sense

#

these were some of my favourites

- it every much meaning god want suffering suffering idea w first ideas element inclined . . . . long . [any] happy off knew still . . even same making even come [any] . . the you . you . is is . . . you . . . . . . . [any] . those the
- your believe the the . . lot is gives both give science believeth science science . out .
- your w against the is the the , . a . out . out . , your . been i . great nobody day day worst important how ever - look passion pleasurable alec times kisses aslan aslan alone . knows best found doesnt waffle waffle aslan st value something . . soul is . to . to to the the the
- different the is w w w fact . the the . the w determination isnt because today ?its wednesday end . end success world used the the better [any] between go

lofty terrace May 16, 2024, 3:49 AM

#

How do you guys work on solo projects, i.e finding huge data sets

tacit basin May 16, 2024, 4:39 AM

#

lofty terrace How do you guys work on solo projects, i.e finding huge data sets

Hugging face Fine Web dataset is huge

steel hull May 16, 2024, 4:51 AM

#

cedar tusk tidymodels also work very good

Thanks buddy

spring field May 16, 2024, 6:08 AM

#

how many epochs? (which ig is a silly way to compare them, but oh well...)

#

training loop as in the code for it?

leaden kayak May 16, 2024, 6:23 AM

#

Any idea of models trained on desktop and software interactions, capable of recognizing UI elements of OS, softwares and windows, without relying on HTML (ie. Electron App)

spring field May 16, 2024, 6:29 AM

#

for epoch in range(1, 1000 + 1):
    for dataloader in [dataloader_train, dataloader_test]:
        losses = []
        accs = []

        if dataloader is dataloader_train:
            model.train()
            torch.set_grad_enabled(True)
        else:
            model.eval()
            torch.set_grad_enabled(False)

        for x_padded, y_padded, x_length in tqdm(dataloader, file=sys.stdout):
            x_padded = x_padded.to(DEVICE)
            y_padded = y_padded.to(DEVICE)

            x_packed = pack_padded_sequence(
                x_padded, x_length, batch_first=True, enforce_sorted=False
            )
            y_packed = pack_padded_sequence(
                y_padded, x_length, batch_first=True, enforce_sorted=False
            )

            y_prim_packed, _ = model.forward(x_packed)

            idxes_batch = range(len(y_packed.data))
            idxes_y = y_packed.data  # (B * Seq, F)
            loss = -torch.mean(
                torch.log(y_prim_packed.data[idxes_batch, idxes_y] + 1e-8)
            )
            losses.append(loss.cpu().item())

            idxes_y_prim = y_prim_packed.data.argmax(dim=-1)
            acc = torch.mean((idxes_y_prim == idxes_y) * 1.0)
            accs.append(acc.cpu().item())

            if dataloader is dataloader_train:
                loss.backward()
                optimizer.step()
                optimizer.zero_grad()

past meteor May 16, 2024, 6:31 AM

#

spring field ```py for epoch in range(1, 1000 + 1): for dataloader in [dataloader_train, ...

Are you using lightning + tensorboard?

spring field May 16, 2024, 6:32 AM

#

I have no idea what either of those are

past meteor May 16, 2024, 6:32 AM

#

Aha, there's a whole load of boilerplate you can be saved from

past meteor May 16, 2024, 6:35 AM

#

spring field I have no idea what either of those are

https://lightning.ai/docs/pytorch/stable/starter/introduction.html Pytorch lightning reduces the amount of boilerplate you need to write for training models (just write the step (forward pass + how to compute the gradients) and it does the rest, including things you typically want like early stopping and so on.

Also integrates well with Tensorboard https://lightning.ai/docs/pytorch/stable/visualize/logging_basic.html, plots the val and train loss

spring field May 16, 2024, 6:35 AM

#

Very nice, thank you

past meteor May 16, 2024, 6:37 AM

#

I'm typically very skeptical for 3rd party deps but lightning automates all the things you don't want to be spending time on so you can just focus on getting the architecture right and you can forget the lest of the plumbing

spring field May 16, 2024, 6:37 AM

#

odd you say? you want the loader or the dataset? cuz the loader is torch...DataLoader

pure birch May 16, 2024, 6:38 AM

#

hello

spring field May 16, 2024, 6:40 AM

#

I assume this is what you want to see

def __getitem__(self, idx):
        x_raw = np.array(self.final_quotes_sentences[idx], dtype=np.int64)

        y = np.roll(x_raw, -1)
        # lag
        # [this is fun] => [is fun this]
        y = y[:-1]
        x = x_raw[:-1]
        x_length = len(x)

        pad_right = self.max_sentence_length - x_length
        x_padded = np.pad(x, (0, pad_right))
        y_padded = np.pad(y, (0, pad_right))

        return x_padded, y_padded, x_length

pure birch May 16, 2024, 6:41 AM

#

i cannot understand a single line of code in it--

spring field May 16, 2024, 6:42 AM

#

oh right, there's this thing where it's padded with a bunch of zeros

#

that actually might be throwing it off a bit

#

because the rnn cells just include those 0s during the recurrence

#

instead of one hot encoding and dotting it just takes out the values by index

#

I'm aware

#

there is, idxes_y is the ground truth

solar creek May 16, 2024, 6:49 AM

#

Question, will python help me become a Data Scientist?

spring field May 16, 2024, 6:49 AM

#

yeah

#

y_packed is the ground truth

#

look, I didn't write all of this code myself

#

we are given templates

#

the output of the model is not a token, it's a 1d tensor of size equivalent to the number of classes

spring field May 16, 2024, 6:55 AM

#

solar creek Question, will python help me become a Data Scientist?

it sure is a tool to be used in the field, but only you can help yourself

solar creek May 16, 2024, 6:56 AM

#

spring field it sure is a tool to be used in the field, but only you can help yourself

Alright, thanks.

#

I'm a teen learning Python, with the only ambition of mastering it, I just wanna know what benefit can I assert myself if I learn this language?

spring field May 16, 2024, 6:58 AM

#

so, you can either one hot encode the token and you get a one hot tensor that's the same size as the output of the model and then you dot them together or you can just index the output tensor

spring field May 16, 2024, 6:59 AM

#

solar creek I'm a teen learning Python, with the only ambition of mastering it, I just wanna...

Well, Python is widely used in the field and has a large ecosystem of libraries and such

solar creek May 16, 2024, 7:00 AM

#

spring field Well, Python is widely used in the field and has a large ecosystem of libraries ...

I'm certain it'll help me get a decent pay job as well?

#

I'll dedicate most of my time learning this language, to be fair.

spring field May 16, 2024, 7:02 AM

#

I agree 😄

spring field May 16, 2024, 7:03 AM

#

spring field so, after reading that paper (https://discord.com/channels/267624335836053506/36...

what did you find odd about this loss graph though?

spring field May 16, 2024, 7:38 AM

#

I would have, but I think I ran out of the free colab compute minutes or however they measure that
I usually run models in paperspace now, I'll give it another try and more epochs, yeah

#

it's a bit of a first that epochs take this long too 😄

spring field May 16, 2024, 8:17 AM

#

it was GRU specifically btw

#

they were SOTA token predictors in 2k10 waaaaaaaaaahhhhhh

#

though interestingly the paper did say it took them 6 hours to train the model 🤔

grand minnow May 16, 2024, 8:24 AM

#

https://ai.google.dev/competition if anyone is interested

Google for Developers

Gemini API | Join the Gemini API Developer Competition | Google A...

Integrate the Gemini API, quickly develop prompts, and transform ideas into code to build AI apps.

spring field May 16, 2024, 8:36 AM

#

lol

#

I will, dw

spring field May 16, 2024, 9:53 AM

#

btw, what is steps? is that how many batches have gone through? so like if you have say 10 batches per epoch and 5 epochs, that's 50 steps?

#

oh, right, so like, stepping opposite the gradient

#

yeah, gotcha, I see

past meteor May 16, 2024, 11:08 AM

#

this is something I'd do with tensorboard

#

For what it's worth, I like Matiiss' approach, even if the models they're trying suck it's a good ddactic experience

spring field May 16, 2024, 11:12 AM

#

hey

past meteor May 16, 2024, 11:12 AM

#

Tbf babysitting models and doing "grad student descent" isn't what you should be doing though

#

https://en.wiktionary.org/wiki/graduate_student_descent

Wiktionary

graduate student descent

spring field May 16, 2024, 11:13 AM

#

uh oh

past meteor May 16, 2024, 11:14 AM

#

I used to do this when I was a masters student myself 😂 . Now I just code up hyperparameter tuning stuff after doing 1-2 trial runs and let GPU go brrr

#

It's far better than tweaking things manually - it doesn't work nor scale

#

Unless you have intuitions for why a certain hyperparameters should be set to a specific value

#

I read papers, drink coffee or just idk sleep and monitor my logs on tensorboard (train/val curves), optuna (hyperparameter importance) and mlflow (hyperparam results obtained per model instance) every so often

spring field May 16, 2024, 11:18 AM

#

I don't think I used their hyperparams, it was just suggested to have between 30 to 500 layers of RNN hidden stuffs, they used 100 in their paper, I used 50, didn't touch other params

#

I'm not sure there were any hyperparams listed though pithink

past meteor May 16, 2024, 11:21 AM

#

I think there's actually funny insight from the models I'm running

spring field May 16, 2024, 11:22 AM

#

they're kiiinda scattered around, those hyperparams

#

they used a learning rate of 0.1 or 0.3, something along those lines

past meteor May 16, 2024, 11:22 AM

#

"worse" architectures have better results because they get to do more hyper parameter searches / hour

spring field May 16, 2024, 11:23 AM

#

I know

#

I had 1e-3

past meteor May 16, 2024, 11:23 AM

#

Do they even use ADAM?

#

Or regular sgd

spring field May 16, 2024, 11:24 AM

#

they used SGD

#

was ADAM invented at that time?

#

or rather, discovered?

past meteor May 16, 2024, 11:24 AM

#

Are you just trying to recreate their results?

spring field May 16, 2024, 11:24 AM

#

well, no, I just don't want my model to suck

past meteor May 16, 2024, 11:25 AM

#

What's your reasoning?

spring field May 16, 2024, 11:25 AM

#

I mean, reading their paper did help, cuz I went from having 1 hidden layer to 50... that like, severely improved performance

past meteor May 16, 2024, 11:26 AM

#

Oh, you said don't not use Adam

#

I think if you're willing to use SGD with a very low learning right and high patience on your early stopping it might stomp adam

#

At least, that's what I've seen empirically

#

But yeah it's annoying advice but the easiest way to improve models is to actually hyperparemeter tune them "automatically"

spring field May 16, 2024, 11:35 AM

#

alrighty, I'm confused, the paper mentions hidden/context size several times and I don't understand whether that means size of matrix or how many matrices there are

#

Size of context (hidden) layer s is usually 30 − 500 hidden units.
https://www.isca-archive.org/interspeech_2010/mikolov10_interspeech.pdf

#

I mean, given current empirical evidence of what I have seen, it seems to mean how many matrices there are (i.e., how many RNN cells), but that means that the hidden size of a single cell is not mentioned as far as I can tell

#

no

past meteor May 16, 2024, 11:39 AM

#

Are you using learning rate halving?

#

I think it's just 1 hidden layer with 30 to 500 neurons, could be wrong tho

#

Size of context (hidden) layer s is usually 30 − 500 hidden units.

spring field May 16, 2024, 11:46 AM

#

mmm, cuz I thought it basically meant this...

past meteor May 16, 2024, 11:46 AM

#

I don't think there's hidden-to-hidden connections here

spring field May 16, 2024, 11:48 AM

#

cool...
well, I did try 256, 512, 1024 (well, whatever they're called) for a single hidden layer, that yielded pretty bad results
chaining hidden layers immediately improved the model

#

I was gonna move to transformers, but then I noticed that transformers use batch norm... so, I decided to take a deeper look at that again and reading the original paper, idea being that it would speed up the model I had...

#

oh well, starting with the batch norm paper wasn't too bad of an idea either 😁

However, the effect of batch normalization is dependent on the mini-batch size and it is not obvious how to apply it to recurrent neural networks
https://arxiv.org/pdf/1607.06450 (Layer Normalization)

cedar tusk May 16, 2024, 12:05 PM

#

whenever i see ur name the dude comes to my mind, not the protagonist but the old person that shouts "As its written" xD

past meteor May 16, 2024, 12:09 PM

#

the thing that is sus about batch norm is that the authors seem to change their mind if it should be before or after the activation

cedar tusk May 16, 2024, 12:10 PM

#

how do u guys find the time to read papers xD

#

its beyond me

#

how the f

#

well if ur finances are ok i would do the same

orchid cobalt May 16, 2024, 2:52 PM

#

Hey folks

#

is it allowed to share projects one is working on?

#

or do we need permissions from mods?

agile anvil May 16, 2024, 3:19 PM

#

What are the best resources for meta-prompting?

cedar tusk May 16, 2024, 3:49 PM

#

orchid cobalt or do we need permissions from mods?

why would u send a whole project at once?

#

just ask the questions in snippet form

fair vigil May 16, 2024, 4:10 PM

#

hey guys new to the data science and ai subject basis any recommendations on what i should learn

#

first*

fallow coyote May 16, 2024, 4:26 PM

#

fair vigil hey guys new to the data science and ai subject basis any recommendations on wha...

Im in the same boat. Tbh the way I lve been learn changes a lot 😂. For the maths, read the pinned messages for the resources and for the coding, look up Python DataAnalysis by Wes Kinney

cedar tusk May 16, 2024, 4:41 PM

#

fair vigil hey guys new to the data science and ai subject basis any recommendations on wha...

get comfortable in either R (perfect for anything other than deep learning or deployment) or python (good for deep learning and deployment)

cedar tusk May 16, 2024, 4:41 PM

#

fallow coyote Im in the same boat. Tbh the way I lve been learn changes a lot 😂. For the math...

you too

#

deployment means making a server ready for the model with everything set up for the machine to work

#

then try to implement/use and learn the ml algorithms, no need to learn EVERY ml algo since there are bajillions of em, just learn the big ones (as in popularity) such as linear/logistic regression, svc, random forest

#

this should take u at least 1 month

#

after that come back here

cedar tusk May 16, 2024, 5:03 PM

#

but deployin models require more than necessary computer knowledge

#

thats the hard part of getting experience for data science

#

without proper equipment u cannot really see real world applications

#

like u gotta setup a server (requires networking and linux knowledge) or buy/rent a server, you gotta set up api (superficial webdev knowledge) etc etc

#

true tho if u work on these than u become what industry wants

#

but its too much for someone new

#

cuz if they dont then they most prob cant xD

#

heyyy im a student as well

#

tru

#

is that french?

#

or latin

#

oh its like saying thus my hypothesis is the truth

#

or maybe demonstration complete?

#

i dk

#

i will never create a proof

#

i hate proofs

#

yea but they are not presented to a person to grade

#

blegh

#

compiler is mah bitch

#

i would prob just remove all the safety from the compiler

#

i cannot have a compiler say what i can and cannot do

#

yea that was counterintuitive

#

actually, i need for the compiler tell me what i can do

#

now that i think about it

#

since if it doesnt, then im not using that language at all

#

hmm

#

i need to eat, all this thinking made me hungry

#

do u like trying new food?

#

or maybe i should dm u xD

fallow coyote May 16, 2024, 5:17 PM

#

I swear to god, i fucking hate matplotlib. im trying to set the x axis to be labelled with the years from a dataframe and have it formatted so its clear to read but every solution i find is not working

cedar tusk May 16, 2024, 5:18 PM

#

fallow coyote I swear to god, i fucking hate matplotlib. im trying to set the x axis to be lab...

bro just switch to R

#

ggplot is like heaven compared to matplotlib

#

or do cell magic to just run r in one cell of the notebook

#

if u ever want to change dm me, ill teach u

#

the ways of R

fallow coyote May 16, 2024, 5:21 PM

#

ill be alright XD. ive briefly used r before. simple af to use but sticking to python for now.

cedar tusk May 16, 2024, 5:22 PM

#

suit urself, R is heaven

river cape May 16, 2024, 5:23 PM

#

AgglomerativeClustering.init() got an unexpected keyword argument 'affinity'

#

Any idea why can't i type affiiniy

cedar tusk May 16, 2024, 5:23 PM

#

river cape AgglomerativeClustering.__init__() got an unexpected keyword argument 'affinity'

cuz init method of the class doesnt take that argument?

#

which package is it?

river cape May 16, 2024, 5:24 PM

#

hc = AgglomerativeClustering(n_clusters = 5, affinity='euclidean', linkage = 'ward')

cedar tusk May 16, 2024, 5:25 PM

#

i found this stackoverflow post https://stackoverflow.com/questions/53849107/sklearn-agglomerative-clustering-custom-affinity

Stack Overflow

Sklearn Agglomerative Clustering Custom Affinity

I'm trying to use agglomerative clustering with a custom distance metric (ie affinity) since I'd like to cluster a sequence of integers by sequence similarity and not something like the euclidean

river cape May 16, 2024, 5:25 PM

#

So I should change affinity to metric?

cedar tusk May 16, 2024, 5:30 PM

#

never used the method so i have no idea

fallow coyote May 16, 2024, 5:37 PM

#

cedar tusk bro just switch to R

Manage to fucking solve the issue. I may switch to R if matplotlib makes me want to commit an act of terrorism

cedar tusk May 16, 2024, 5:38 PM

#

fallow coyote Manage to fucking solve the issue. I may switch to R if matplotlib makes me want...

it did to me, multiple times, now if i am to do visualisation i just dont use python at all

#

dplyr is superior to pandas anyways

fallow coyote May 16, 2024, 6:17 PM

#

Ive only just started using it matplotlib and already it feels fucking frustrating to use. Going to start a few guided ML projects just to get grasp on what ML progrmaming is like

#

Ill see how it goes as I advance. Im still very much a beginner. Tbh ive only been practicing with pandas plus a bit of matplotlib; so really just doing the data prep and cleaning of data if thats what its called

spring field May 16, 2024, 6:38 PM

#

pandas is pretty much just numpy so it's really all the same

lofty terrace May 16, 2024, 7:01 PM

#

tacit basin Hugging face Fine Web dataset is huge

huge thanks!

covert finch May 16, 2024, 7:04 PM

#

what does this mean, instead of calling it a dataframe you call it an n-dimensional array? haha

cedar tusk May 16, 2024, 7:04 PM

#

kaggle has a huge dataset db

spring field May 16, 2024, 7:20 PM

#

is that one weird exception when dtype is a list of named types?

#

which I assume they still probably sort of organize it such that it's multiple arrays with same type of data

#

ah

covert finch May 16, 2024, 7:27 PM

#

this got me looking at the pandas source code

#

and

#

they have a class called Extensionarray which is basically what all their other "arrays" are inheriting

#

which is why it can support so many different types of data

#

exactly yeah

#

oh I absolutely agree

#

I'm saying this is why they are slower

#

I don't know why we have so many fancy ds libs when pretty much everything can be done with pure numpy

#

patsy is cool too

#

https://patsy.readthedocs.io/en/latest/

#

lower level stats modeling used a lot with numpy

#

hahahaha yeah

#

what that means to me is it defines the models but doesnt execute the computations, but its mostly used for like linear modeling

desert dew May 16, 2024, 7:39 PM

#

Can anyone help me find a better road map for ai and ml

river cape May 16, 2024, 7:40 PM

#

Hey guys so I just finished studying about regression , classification and clustering . I did do some projects also .. What should be my next focus?
Neural Networks or NLP or Computer Vision or Genrative AI?

covert finch May 16, 2024, 7:41 PM

#

good question, i think by declaring the models it makes the calculations faster

#

I just know its used in statsmodels which I used to use a lot

#

yeah the use cuda python right?

#

right right

#

https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html

CUDA 12.4 Update 1 Release Notes

The Release Notes for the CUDA Toolkit.

#

The worst api ever is pyspark man

#

just be glad you don't have to use that haha

#

well they function very differently than pandas or R dataframes, which like you said makes it more reliable and faster

#

I'm just complaining because it took my a while to get used to... its basically a wrapper for apache spark so it was kinda like learning a new language

#

But I'm pretty used to it now. and I love the concept of cluster compute

#

really? on a single machine?

#

I guess yeah if you don't force it to infer the schema maybe, but without a cluster polars has been the fastest library for general EDA

#

for me

warm trellis May 16, 2024, 8:10 PM

#

Hey everyone, I know it's really simple question but why I am having this error?
stack expects each tensor to be equal size, but got [30432, 72, 8] at entry 0 and [30432, 1, 1] at entry 1?
Cannot I've different shapes for the X and y?

spring field May 16, 2024, 8:12 PM

#

you can

spring field May 16, 2024, 8:14 PM

#

warm trellis Hey everyone, I know it's really simple question but why I am having this error?...

!paste could you share your code?

arctic wedgeBOT May 16, 2024, 8:14 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

spring field May 16, 2024, 8:14 PM

#

and the entire traceback as well

arctic wedgeBOT May 16, 2024, 8:15 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

warm trellis May 16, 2024, 8:17 PM

#

spring field !paste could you share your code?

Here is the code and traceback..
https://paste.pythondiscord.com/FMJA

#

Basically what is does that, it creates X => which is shape of the 30432, 72, 8 and y normally it should be 30432, 1 and then my neural network learns from this

spring field May 16, 2024, 8:26 PM

#

alright, well, that's the data, but how are you processing it? like, what's the model/network?

covert finch May 16, 2024, 8:27 PM

#

pyspark definitely has better support for non tabular data, which seems to be your use case

warm trellis May 16, 2024, 8:28 PM

#

spring field alright, well, that's the data, but how are you processing it? like, what's the ...

I'm pretty sure problem isn't in the network, previously when I using pytorch dataset it was working charm. Now when trying lightening I got errors...

spring field May 16, 2024, 8:29 PM

#

well, does the model's input and output match your data?

warm trellis May 16, 2024, 8:30 PM

#

spring field well, does the model's input and output match your data?

yeap, it's. I think it's due to the L.LightningDataModule. Cuz, when I comment out the y part and create an empty shaped numpy array same as X it starts to work..

spring field May 16, 2024, 8:42 PM

#

can you still show the model

warm trellis May 16, 2024, 8:47 PM

#

spring field can you still show the model

I just added them here. Model is a little bit not really correct now because of the data shape this data loader model spits out. But with pytorch normal dataset, I used to be able to run it with X tensor size (32,72,9) and Y tensor (32, 1)
https://paste.pythondiscord.com/5Z4A

#

I think the error is there because, method val_dataloader() spits tensor size torch.Size([2, 30432, 72, 9]), that's what it should not spit 😄

unreal scarab May 16, 2024, 9:55 PM

#

I need help with reading data from a CSV file.

Here's the code:

import pandas as pd

file_path = "Train.csv"
df = pd.read_csv(file_path)

here's the error:
DtypeWarning: Columns (1,7,16,35) have mixed types. Specify dtype option on import or set low_memory=False.
df = pd.read_csv(file_path)

about the CSV file:
it has around 98K lines and 40 columns.
some data values are missing, and data consist of several types int, float, string, etc.

warm trellis May 16, 2024, 10:04 PM

#

unreal scarab I need help with reading data from a CSV file. Here's the code: ```python impo...

that should not be an error, but more warning for you that the some columns has mixed data types

spring field May 17, 2024, 1:19 AM

#

warm trellis I just added them here. Model is a little bit not really correct now because of ...

sorry, I have no idea at this moment 😁

#

all these norms are kinda interesting btw, batch, instance, group, layer..., well, tbf, ig group norm could be considered a generalization of instance and layer norms, but still

spare briar May 17, 2024, 1:45 AM

#

whats interesting about them @spring field ?

spring field May 17, 2024, 1:47 AM

#

that they normalize stuff 😄
idk, they speed up training and such

weary sedge May 17, 2024, 2:01 AM

#

Does anyone have a good understanding about Generative Adversarial Neural Networks, especially Super-Resolutions GANNs and Enhanced Super-Resolutions GANNs?

spring field May 17, 2024, 2:05 AM

#

weary sedge Does anyone have a good understanding about Generative Adversarial Neural Networ...

it's best to just ask your question directly instead of asking whether someone can answer the question first, it's just an unnecessary step

weary sedge May 17, 2024, 2:06 AM

#

spring field it's best to just ask your question directly instead of asking whether someone c...

I actually want to have a conversation around this topic?

Which is the reason to my post above.

If anyone can best explain what is a GANN, and how it works (preferably in basic english).

spring field May 17, 2024, 2:08 AM

#

Well, then you may want to begin that conversation, otherwise someone will have to ask you what you want to talk about which again is an unnecessary step, just begin.

spring field May 17, 2024, 2:09 AM

#

weary sedge I actually want to have a conversation around this topic? Which is the reason t...

might I suggest https://www.youtube.com/watch?v=Sw9r8CL98N0

YouTube

Computerphile

Generative Adversarial Networks (GANs) - Computerphile

Artificial Intelligence where neural nets play against each other and improve enough to generate something new. Rob Miles explains GANs

One of the papers Rob referenced: http://bit.ly/C_GANs

More from Rob Miles: http://bit.ly/Rob_Miles_YouTube

https://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed...

▶ Play video

weary sedge May 17, 2024, 2:09 AM

#

I have seen this video before 😄

#

I think there is a way to make a thread, so that way, I don't have a conversation about this on this channel.

#

But I am not sure how to do that @spring field

spring field May 17, 2024, 2:11 AM

#

You can create a post in #1035199133436354600 but I don't think that's what you want, I suggest just sticking to this channel

#

it's not thaaat active anyway

weary sedge May 17, 2024, 2:14 AM

#

spring field You can create a post in <#1035199133436354600> but I don't think that's what yo...

Thank you...

Now, do you know much about the subject asked: #data-science-and-ml message

Have you had any personal experience about this?

What are your thoughts?

spring field May 17, 2024, 2:17 AM

#

Not yet, in the coming weeks I will actually learn about them, so, I'll certainly be able to chime in then, maybe a bit sooner 😁

weary sedge May 17, 2024, 2:18 AM

#

If that is the case, would you permit me to collaborate with you?

I would like to embark on this journey with you and we can both share knowledge together.

#

What do you say? @spring field?

spring field May 17, 2024, 2:30 AM

#

Thank you for the invitation, but I'll have to decline it for now 🙂

weary sedge May 17, 2024, 2:32 AM

#

spring field Thank you for the invitation, but I'll have to decline it for now 🙂

Ok 👍🏾

vestal spruce May 17, 2024, 2:49 AM

#

Do tensor turn into list if I slice it?
I got this traceback error message

Traceback (most recent call last):
  File "C:\Users\HP\Pycharmprojects\proyek_ta\test.py", line 42, in <module>
    result = model.transcribe(speech)
             ^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\HP\Pycharmprojects\proyek_ta\.venv\Lib\site-packages\whisper\transcribe.py", line 135, in transcribe
    decode_options["language"] = max(probs, key=probs.get)
                                                ^^^^^^^^^
AttributeError: 'list' object has no attribute 'get'

#

Here's a snippet of the code in question

signal, fs = torchaudio.load(WAV_FILE)
model = whisper.load_model("base")
segments = diar.diarize(WAV_FILE, num_speakers=NUM_SPEAKERS, outfile="segments.txt", silence_tolerance=2)
speeches = []
for segment in segments:
    speech = signal[:, segment['start_sample']:segment['end_sample']]
    result = model.transcribe(speech)
    speeches.append(f"speaker_{segment['label']}: {result}")

spring field May 17, 2024, 2:53 AM

#

vestal spruce Here's a snippet of the code in question ```py signal, fs = torchaudio.load(WAV_...

you can just print(type(speech)) before passing it to model.transcribe

vestal spruce May 17, 2024, 2:55 AM

#

oh right

orchid forge May 17, 2024, 5:21 AM

#

i just got a new laptop and im using anaconda after like 1 year, from last 1 year i was coding on online platforms like google colab and kaggle now im gonna use anaconda

#

and now i dont need online link of data to connect to my python platform

#

would anaconda be good for my every data analysis project?

#

i mean i wont need to install anything different for any other new library of python for data analysis right?

lapis sequoia May 17, 2024, 5:27 AM

#

anyone using vLLM to host the LLM as an webserver API to interact with? The question is how do I send multiple prompts via a post API request to generate in parallel? (as I can with the offline inference of vllm llm class)

twin garden May 17, 2024, 5:42 AM

#

can someone give me a roadmap to become to get a job in data science(currently learning basic python)

#

Like what are the types of tech stack to become a good at data science

jaunty helm May 17, 2024, 5:43 AM

#

twin garden Like what are the types of tech stack to become a good at data science

https://roadmap.sh/ is one I see people recommending

roadmap.sh

Developer Roadmaps - roadmap.sh

Community driven roadmaps, articles and guides for developers to grow in their career.

twin garden May 17, 2024, 5:45 AM

#

jaunty helm https://roadmap.sh/ is one I see people recommending

there is a roadmap for data analyst. Is it the same as data science?

jaunty helm May 17, 2024, 5:46 AM

#

twin garden there is a roadmap for data analyst. Is it the same as data science?

well next to that is AI and Data Scientist

twin garden May 17, 2024, 6:20 AM

#

is learning django framework is a good idea for data science?

spring field May 17, 2024, 6:21 AM

#

probably not

#

it's a massive framework on its own

#

I mean, ig you could learn just enough for whatever you need, but I'd maybe suggest first picking up flask if you really need to build a backend for some DS project or sth, it's much more lightweight

#

but generally, that's backend related, probably not that much of a concern for you in the DS field, especially not now, since now I assume you're just starting, so, just focus on DS

twin garden May 17, 2024, 6:32 AM

#

okie

cedar tusk May 17, 2024, 6:57 AM

#

twin garden is learning django framework is a good idea for data science?

not needed at all, just use fastapi

orchid forge May 17, 2024, 7:12 AM

#

guys

untold marsh May 17, 2024, 7:12 AM

#

twin garden is learning django framework is a good idea for data science?

Not required

#

Did you get the laptop @orchid forge

orchid forge May 17, 2024, 7:13 AM

#

Creating a Series by passing a list of values, letting pandas create a default RangeIndex.
what does the RangeIndex means here and why it creates a default RangeIndex?

orchid forge May 17, 2024, 7:13 AM

#

untold marsh Did you get the laptop <@1127458312938598460>

ya

untold marsh May 17, 2024, 7:14 AM

#

orchid forge Creating a Series by passing a list of values, letting pandas create a default R...

Its a range startinf from 0 to no. of rows you have in a DF

orchid forge May 17, 2024, 7:15 AM

#

oh

#

thanks kev

untold marsh May 17, 2024, 7:15 AM

#

orchid forge thanks kev

No isoos

orchid forge May 17, 2024, 7:15 AM

#

im studying from the beginning

untold marsh May 17, 2024, 7:15 AM

#

Which laptop is it btw?

untold marsh May 17, 2024, 7:16 AM

#

orchid forge im studying from the beginning

Its alright

#

I suggested a HP i7 that day

lapis sequoia May 17, 2024, 8:06 AM

#

How would u guys show off a scraping project? I have one where I thought the process of getting the data and formatting it was pretty interesting and I wanna show the process, not just the code.. Ive been writing up a word document but idk if there's like a standard way to do this or what

#

True

river cape May 17, 2024, 9:30 AM

#

Hey guys so I just finished studying about regression , classification and clustering . I did do some projects also .. What should be my next focus?
Neural Networks or NLP or Computer Vision or Genrative AI?

cedar tusk May 17, 2024, 9:53 AM

#

river cape Hey guys so I just finished studying about regression , classification and clust...

have you implemented ml algos in python?

#

from scratch just using numpy?

#

do that

lapis sequoia May 17, 2024, 10:13 AM

#

lol im scraping video thats y i kinda wanna focus more on the process

#

data isnt useful besides for training data i just wanna show that I was able to use the hidden api and construct the footage from raw .ts

#

maybe project just sucks idk

#

im a student just trying to get some scraping gigs on upwork rn

#

want to show ppl that i can do anything

#

prolly will stick to the word document and copy some of it to the github readme, and link to the repository on the document

#

that way I can have the document to show on upwork and the github for my actual resume applying for internships and stuff

#

well the readme at least I'd have the github regardless ig

frail heart May 17, 2024, 11:41 AM

#

If I was training a network to recognize bus, bike and car in an image, would the output neurons look something like this?

rn_image_picker_lib_temp_252dc8d6-65d4-4513-b1f0-1ed4168fd593.jpg

lapis sequoia May 17, 2024, 11:47 AM

#

yeah was considering that but thought it might be overkill.. once I have a website for some other projects I'll throw it up there but it's not really worth for just this one for me

frail heart May 17, 2024, 12:12 PM

#

I guess I have to learn about CNN.

#

Can you recommend a video for me about CNN?

deep zealot May 17, 2024, 12:54 PM

#

anyone familiar with regression models?

left tartan May 17, 2024, 1:23 PM

#

deep zealot anyone familiar with regression models?

Just ask the question plz.

remote mountain May 17, 2024, 1:24 PM

#

Henlo?

#

I’m running into some problems I’m trying to use Jupyter ….

#

#

I tried downloading the module from terminal via pip but it’s still showing the error

#

Any help is appreciated

bold snow May 17, 2024, 1:38 PM

#

hi guys does anyone know if there's a pretrained model from huggingface that classify tone/pitch, avg. decibels, max. decibel from audio?

deep zealot May 17, 2024, 1:38 PM

#

left tartan Just ask the question plz.

#1240995766403862568 message

#

basically its in here

bold snow May 17, 2024, 2:14 PM

#

is it easier than training my own model to do classification task?

#

i only have 10 hours of training data without classification label yet only transcription

deep zealot May 17, 2024, 2:48 PM

#

#importing libraries
import numpy as np
import pandas as pd

#pull the data
dataset = pd.read_stata("eitc.dta")

#preparing dummy variables
dataset['post93'] = np.where(dataset['year'] > 1993, 1, 0)
dataset['mom'] = np.where(dataset['children'] > 0, 1, 0)
dataset['mom_post93'] = dataset['mom'] * dataset['post93']

#Isolate X and Y variables
Y = dataset.loc[:, 'work'].values
X = dataset.loc[:, ['post93', 'mom', 'mom_post93']].values

#Do logistic regression
import statsmodels.api as sm
X = sm.add_constant(X)
model1 = sm.Logit(Y, X).fit()
model1.summary(yname = "Work",
               xname = ("intercept", "After 1993", "Is mom",
                        "Mom after 1993"),
               title = "Impact of tax credit on employment - model1")

#

Can anyone explain why a constant/intercept is needed to perform the regression

agile cobalt May 17, 2024, 2:55 PM

#

deep zealot Can anyone explain why a constant/intercept is needed to perform the regression

try to create a line passing through these points using only y = a*x, without an intercept b

past meteor May 17, 2024, 2:56 PM

#

deep zealot ```py #importing libraries import numpy as np import pandas as pd #pull the dat...

Without an intercept your regression line would always go through (0,0) for instance

past meteor May 17, 2024, 2:58 PM

#

deep zealot Can anyone explain why a constant/intercept is needed to perform the regression

Think about modelling time spent studying versus your grade. Let's assume the relationship is linear..if you study 0 hours your grade isn't 0/100 right?

deep zealot May 17, 2024, 3:11 PM

#

agile cobalt try to create a line passing through these points using only `y = a*x`, without ...

impossible

#

so shouldn't the program return an error if i don't pass an constant/intercept with that logic?

#

because the program should perceive something impossible as an error

deep zealot May 17, 2024, 3:14 PM

#

past meteor Think about modelling time spent studying versus your grade. Let's assume the re...

I see

#

but can you kindly inform me how this constant/intercept is made

#

and how this constant/intercept affects my other values

past meteor May 17, 2024, 3:18 PM

#

deep zealot but can you kindly inform me how this constant/intercept is made

By solving an optimization problem. Others call it curve fitting.

deep zealot May 17, 2024, 3:19 PM

#

i think i got it. ty

past meteor May 17, 2024, 3:20 PM

#

Yeah, you can look up maximum likelihood estimation (MLE) and you'll find tons of resources explaining it in detail

deep zealot May 17, 2024, 3:20 PM

#

deep zealot so shouldn't the program return an error if i don't pass an constant/intercept w...

now *coughs aggressively

past meteor May 17, 2024, 3:20 PM

#

I don't know statsmodels' (clunky) API well but it adds an intercept by default

deep zealot May 17, 2024, 3:20 PM

#

lol

#

why did you say its clunky

past meteor May 17, 2024, 3:21 PM

#

Using sci-kit learn feels a lot more natural

deep zealot May 17, 2024, 3:22 PM

#

bias coming in clutch

#

tysm

fringe fossil May 17, 2024, 3:28 PM

#

Does anybody here have any experience working with Yolov8?

#

Trying to get it to read some documents, hoping I could get a question or two answered.

past meteor May 17, 2024, 3:29 PM

#

Ask your question directly please

fringe fossil May 17, 2024, 3:29 PM

#

Sure thing.

#

Right now, I'm trying to use it as an OMR for detecting checkmarks/unfilled checkboxes in a series of documents. I've fed it ~50 yes boxes and no boxes and gave it ~250 epochs of training, and it hasn't detected a single one, which leads me to think somethin's wrong with the dataset I gave it.

#

For a simple black and white image, where the checks/boxes are relatively small, about how many do you think would be necessary for it to start identifying them correctly?

heady spoke May 17, 2024, 3:31 PM

#

hello, how is everyone? I have a quick question about implementing GPT into a python project. I´ve tried implementing the basic project that the documentation has, but I have an error that basically says that I ran out of free trials, but I never used it before. Am I missing any configurations for it to work?

#

the error is 429 I think

analog bobcat May 17, 2024, 3:31 PM

#

Can someone help I’m having problem with my visual studio code My visual studio code is stating numpy not accessed.
Import numpy could not be resolved

fringe fossil May 17, 2024, 3:31 PM

#

fringe fossil For a simple black and white image, where the checks/boxes are relatively small,...

For reference, below is the 'ideal' image for the form. They wind up getting printed, filled, then scanned, so there's some noise/orientation stuff goin' on there.

trim saddle May 17, 2024, 3:34 PM

#

analog bobcat Can someone help I’m having problem with my visual studio code My visual studio ...

Do you have the right python interpreter selected, where numpy is installed?

agile cobalt May 17, 2024, 3:41 PM

#

heady spoke hello, how is everyone? I have a quick question about implementing GPT into a py...

OpenAI API free trial credits expire a few months after you create your API Key iirc, have you created it a while ago?

heady spoke May 17, 2024, 3:44 PM

#

yeah, I guess I did a couple of months ago

agile cobalt May 17, 2024, 3:44 PM

#

you could try using Google's Gemini instead, it has a fairly generous free tier, though is not as good

heady spoke May 17, 2024, 3:44 PM

#

got it

#

thank you!

analog bobcat May 17, 2024, 3:59 PM

#

trim saddle Do you have the right python interpreter selected, where numpy is installed?

I have python 3.12 interpreter selected on my vscode but haven’t downloaded numpy i thought it comes with the python interpreter

agile cobalt May 17, 2024, 4:01 PM

#

analog bobcat I have python 3.12 interpreter selected on my vscode but haven’t downloaded nump...

nope, you have to pip install it - it is not part of the standard library.

analog bobcat May 17, 2024, 4:06 PM

#

agile cobalt nope, you have to pip install it - it is not part of the standard library.

How do i go by it 🙏🙏

agile cobalt May 17, 2024, 4:09 PM

#

assuming you have selected the correct interpreter, pip install numpy

tiny venture May 17, 2024, 4:38 PM

#

Hi, can anyone please help with this. I believe there's something very simple here which I am missing. Thanks!

#

Just trying to make sure that the age input is within the given range and that it is an integer but it doesn't seem to be working

spring field May 17, 2024, 4:42 PM

#

tiny venture Hi, can anyone please help with this. I believe there's something very simple he...

input always returns a string

tiny venture May 17, 2024, 4:42 PM

#

spring field `input` always returns a `str`ing

I thought I fixed that by doing int(age)

#

But maybe not

spring field May 17, 2024, 4:44 PM

#

what do you think isinstance does?

#

redirect for further inquiries as you're being helped in pygen already #python-discussion message

lapis sequoia May 17, 2024, 5:00 PM

#

i was wrong

#

scipy conjugate gradient is better

#

its just using finite differences to calculate the gradient and i thought thats part of conjugate gradients

river cape May 17, 2024, 5:49 PM

#

Guys what do you mean by a pipeline?

past meteor May 17, 2024, 5:51 PM

#

river cape Guys what do you mean by a pipeline?

It can mean several different things. It can mean like an sklearn pipeline which is encapsulating a model with its preprocessing or it could mean something like data processing pipelines

wooden sail May 17, 2024, 5:52 PM

#

lapis sequoia its just using finite differences to calculate the gradient and i thought thats ...

you should be able to specify an analytic gradient too, but yeah, by default most optimizers will use stochastic approximation with finite differences, and then use the gradient to approximate the hessian if 2nd order

vast goblet May 17, 2024, 6:22 PM

#

I have 2 YoE currently working as a data scientist, I can't decide whether I pursue a Master's degree in DS. I've read a lot of blogs but still confused, I'd love if someone gives me a hint or something?

river cape May 17, 2024, 6:28 PM

#

past meteor It can mean several different things. It can mean like an sklearn pipeline which...

What about an nlp pipeline?

past meteor May 17, 2024, 6:29 PM

#

river cape What about an nlp pipeline?

Same as the sklearn, preprocessing + predictions in one. Look at spacy for an example.

river cape May 17, 2024, 8:42 PM

#

#

I cant install a package in my virtual env any idea as to why is that?

spring field May 17, 2024, 8:58 PM

#

pithink interesting, but did you try what the error message suggested? .venv/bin/pip install textblob

hollow escarp May 17, 2024, 11:00 PM

#

How can i speed up my object detection solution/model?

#

Im currently getting 11FPS on my computer and i'm looking for ways to getting more because thats unfortunetly not enough

#

I need to deploy that object detection code to raspberry pi thats why im using onnxruntime for that task

#

Any ideas?

#

Also idk if thats even possible to use GPU on raspberry pi so im currently using my CPU for such tasks

spring field May 17, 2024, 11:11 PM

#

that might be part of the issue

#

actually I was coming here to ask, cuz I just read about how thunderbolt lets you connect to an external GPU
does it mean, I could buy a CUDA-supported GPU and get models running on that through my laptop?

long canopy May 17, 2024, 11:57 PM

#

what stack are you guys using for RAG?

#

llamaindex?

lost beacon May 18, 2024, 2:09 AM

#

Hi everyone good morning from mumbai. Can anyone in the chat recommend me good sources like YouTube channels and video links or PDFs to learn how to build an LLM?

wooden sail May 18, 2024, 3:34 AM

#

river cape

you might have python aliased in your .bashrc, so the python from the venv is not being called

bold snow May 18, 2024, 4:49 AM

#

lost beacon Hi everyone good morning from mumbai. Can anyone in the chat recommend me good s...

huggingface

lost beacon May 18, 2024, 4:53 AM

#

Hugging face ??

#

Is it a YouTube channel ?

chrome salmon May 18, 2024, 8:53 AM

#

Is data science more programming or math? In reference to ML clearly being more math and theory, would data science also be like that? Or is it about the same amount of math used in software/website development (definitely not)

twin garden May 18, 2024, 8:54 AM

#

any good IDE for SQL?

lapis sequoia May 18, 2024, 9:00 AM

#

chrome salmon Is data science more programming or math? In reference to ML clearly being more ...

you can do it with a lot of math or with barely any math

chrome salmon May 18, 2024, 9:01 AM

#

data science?

lapis sequoia May 18, 2024, 9:03 AM

#

chrome salmon data science?

yes

#

you can derive theoretical proofs of convergence of various data science algorithms, or you can just tinker around

chrome salmon May 18, 2024, 9:04 AM

#

Is doing it with barely any math really data science at all? Or is it like majority of YT ML courses which use pre-trained models and claim it to be ML

lapis sequoia May 18, 2024, 9:05 AM

#

chrome salmon Is doing it with barely any math really data science at all? Or is it like major...

I'd say it is, automl often wins data science competitions

#

but if no one does math there will be less progress in the field

hollow escarp May 18, 2024, 9:13 AM

#

hollow escarp How can i speed up my object detection solution/model?

So If there is anyone with some knowladge about it would be cool if you give me some tips how to do it

spring field May 18, 2024, 9:31 AM

#

Anyone got experience with external GPUs? I just yesterday came across this article mentioning how Thunderbolt 3 sort of had support for that. Would that be an okay alternative to like having to buy/build an entire PC if I could hypothetically use it through my laptop. Which I'm realising doesn't have a Thunderbolt connection... Can regular USB ports be used for this? say USB 3 (some gen). Idea being that I could practice ML locally much easier. Is it worth the cost anyway? Currently the cheapest option I have found is like $0.55 an hour and sometimes there are free options as well, they are like A4000, P5000, RTX4000 GPUs which I assume are more powerful than anything I'd be willing or able to buy anyway. So yeah, thoughts?

chrome salmon May 18, 2024, 9:32 AM

#

chrome salmon Is data science more programming or math? In reference to ML clearly being more ...

Okay so, It seems to be majorly statistics

#

I need to pick something for my integrated MSc if I do take Sc, and data science isnt looking so bad

devout sail May 18, 2024, 9:44 AM

#

spring field Anyone got experience with external GPUs? I just yesterday came across this arti...

This page has some info, but I'm guessing they also sell this stuff and you might want a more technical review. Don't have personal experience with it.

Have you checked Google colab prices?

Everything You Need To Know About External GPU

#

You can also fully go the VM route, on Azure or somewhere else. If you pick a spot VM with a less popular GPU you can probably get a decent price

spring field May 18, 2024, 9:54 AM

#

I checked Colab plans first, yes, they used some mysterious compute units, so I had no clue how much computing I'd actually get, I read on reddit that it wasn't that much and that it was just an opaque pricing model, it also seemed to be a limited amount of those units a month, soo, that's even worse. So I found paperspace which apparently was recently acquired by DigitalOcean and I already use DO for hosting, so they seemed like a good choice and their pricing is transparent, you know how much you're gonna pay per hour and there is no monthly limit on how many hours you can run their containers per month afaik. So yeah, that's what I use rn, they also occasionally have some free containers available for running, so that's also great.

devout sail May 18, 2024, 9:59 AM

#

Hmm yeah it looks decent. If you go for a spot VM someplace else you might be able to save some money if you're willing to let the VM go down occasionally

#

Unless paperspace also has spot

#

Although depanding on the price of the GPU and how much usage you're getting out of it it might make sense to buy one

#

Anyway to your question external GPUs might be ok but anything that isn't physically connected to your computer will have latency

past meteor May 18, 2024, 10:04 AM

#

Google Colab is decent. There's a GPU that costs 1.8 compute units per hour and 100 of them is €11

past meteor May 18, 2024, 10:04 AM

#

spring field Anyone got experience with external GPUs? I just yesterday came across this arti...

But, this is a question for @final kiln

spring field May 18, 2024, 10:05 AM

#

That's what I'm considering yes, but at this point I'd probably have to buy/build a PC for that, so, not fun in terms of money 😁
I'm not using them thaaaat much right now and as I said, occasionally they are available for free for some time.

spring field May 18, 2024, 10:06 AM

#

devout sail Anyway to your question external GPUs might be ok but anything that isn't physic...

makes sense yes, but I'm also realising I probably don't have a computer at this time that would even support it

#

What are you using?

twin garden May 18, 2024, 10:08 AM

#

Anyone... does Excel is also needed to become data scientist/analyst?

spring field May 18, 2024, 10:08 AM

#

yeah, that's what I'm doing right now and yes, it's in bursts at least for now

past meteor May 18, 2024, 10:09 AM

#

spring field That's what I'm considering yes, but at this point I'd probably have to buy/buil...

Honestly, unless you need it for gaming I wouldn't invest in getting a big GPU

#

You're better off with a desktop because laptops get chunky really fast. You can SSH into your desktop anyway if it's for model training.

#

Source: someone with a laptop with a 3070 card

spring field May 18, 2024, 10:11 AM

#

alright, in that case I'll get a new laptop cuz my current one's display broke (second time a display has gone out for a laptop of mine), and I kinda want a bit of mobility

#

(I could also try replacing the display, but... idk)

#

and nvm, didn't read zestar's message...

past meteor May 18, 2024, 10:12 AM

#

spring field alright, in that case I'll get a new laptop cuz my current one's display broke (...

Honestly if I could go back I'd get an M2/M3

spring field May 18, 2024, 10:13 AM

#

past meteor You're better off with a desktop because laptops get chunky really fast. You can...

yeah, that was my exact thought process as well, I could get a PC and use my current laptop to SSH into it and such, but yeah... laptop's display broke and such... not fun

wooden sail May 18, 2024, 10:15 AM

#

for the price of a framework motherboard (if you wanna change cpu) you can get an elitebook or a thinkpad

#

those also have all user-replaceable components

spring field May 18, 2024, 10:15 AM

#

Alright, thanks y'all for the advice, appreciate it heartowo

wooden sail May 18, 2024, 10:18 AM

#

u can't with framework either other than buying a new gpu module or a new mobo

#

both as expensive as a whole new laptop

spring field May 18, 2024, 10:18 AM

#

twin garden Anyone... does Excel is also needed to become data scientist/analyst?

Needed? Probably not. Is it used in the field? Probably yes. It's not that difficult to do the basic stuffs with it though and it's somewhat intuitive, especially if you will come from a programming background IMO, so yeah, I wouldn't worry about it too much maybe, but I'll let others with more experience elaborate.

Also Excel added (limited) TypeScript and Python support, so yeah, that's a thing as well.

past meteor May 18, 2024, 10:19 AM

#

They rebranded a lot of jobs that don't have an iota of science to data scientist and for some of those you need Excel 🥴

wooden sail May 18, 2024, 10:31 AM

#

ok this is cheaper than i remembered, but by no means cheap still

#

here's the mobos too

#

only the old ones are reasonable~ish

#

the new ones are on the same scale as a thonkpad

#

i agree with their philosophy, but you pay a premium for it

#

at any rate, even if you had a desktop computer, you would never want to pay for an A100 or H100. i agree with you and zestar that using an online service is best

#

or if you're lucky, your company/university has specialized compute hardware for you to use

#

that'd require laptop cpu makers to change how they make cpus

#

not something framework can tackle alone

#

yes lmao

#

yeah but you have no way of dissipating that much heat in a laptop chassis. what does exist is "DTR" (desktop replacement)"mobile computers". they are massive because they bring a desktop cpu, which also means you now need massive cooling

#

(gpus on laptops are also different from desktop ones, to the extent that 4050 and 4090 are almost the same thing on mobile)

#

that's something to take up with intel, amd, and nvidia, not with framework

#

essentially

#

if you're not playing, just forget about gpu and dock your laptop into 2~3 monitors and nice peripherals

#

i've derailed us way off topic from DS and AI by now, but yeah, tangentially related since it means you can never do big scale AI tasks on a laptop, and hardly on a desktop as well

spring field May 18, 2024, 10:58 AM

#

keep us posted, please

steady yacht May 18, 2024, 11:53 AM

#

Let P be a language of polynomial definitions
p(x) = a0x^0 + · · · + anx^n
I have 2 requirements I am sort of struggling with rn, I am not really sure if I have solved them.
If the exponent for a variable is 0 the variable is allowed to be completely omitted and if the exponenet is 1 then the variable is allowed to be written without an exponent.
Here is my take on it:

<T> -> <Coefficient> <VariablePart> | <Coefficient> 
<VariablePart> -> <Variable> <ExponentPart> | <Variable>
<ExponentPart> -> ^ <Power> | “” 

<Coefficient> -> <CoeffTerm> + <Coefficient> | <CoeffTerm>
<CoeffTerm> -> <polVariable> × <Coefficient> | <polVariable>
<polVariable> -> [a-w][yz] | <”(“Coefficient”)”>

<Power> -> [2-9] <[0-9]>* ```

pine peak May 18, 2024, 3:31 PM

#

hello everyone,
needed some guidance in getting started with learning more on Data Science and AI, I'm 17 just finished high school and i'll be joining college studying for a degree on AIML in roughly 3 months and i'll be trying to use that free time in just knowing the basics and getting things brushed.
I'm done with the basics of python but confused on what to get started with next.
would be great if someone would be able to guide me via DM for the next 3 months, just need help with knowing what to do next not doubts.

serene scaffold May 18, 2024, 3:33 PM

#

@pine peak what is the name of the degree program? Computer science, or artificial intelligence?

pine peak May 18, 2024, 3:37 PM

#

serene scaffold <@881733550825672715> what is the name of the degree program? Computer science, ...

i actually have a choice its either,
Artificial intelligence and data science
or
Bachelore in technology with specialization in AI and machine learning

#

depends on what college i get into really (results come out in 2 days)

serene scaffold May 18, 2024, 3:38 PM

#

pine peak i actually have a choice its either, Artificial intelligence and data science o...

The one that has more theoretical math requirements is probably "better"

pine peak May 18, 2024, 3:39 PM

#

serene scaffold The one that has more theoretical math requirements is probably "better"

that would be first one, the second degree is more of using the applications of AI and machine learning

serene scaffold May 18, 2024, 3:40 PM

#

pine peak that would be first one, the second degree is more of using the applications of ...

What you'd learn in the second one would become outdated more quickly

pine peak May 18, 2024, 3:40 PM

#

makes sense

#

i'm done with python (basics) for now what do you think i should go with next?

craggy patio May 18, 2024, 3:54 PM

#

Can somebody please explain why my graph looks like this?

#

In theory shouldent the F1 score have a positive relationship with confidence?

#

Or does the confidence variable just mean "don't accept predictions with confidence ratings less than x"

sand herald May 18, 2024, 3:58 PM

#

what... is confidence?

craggy patio May 18, 2024, 3:59 PM

#

Chance the AI thinks the classification it gave the object is correct

#

basically "how confident" it is that it's correct

sand herald May 18, 2024, 4:05 PM

#

hmm

#

i struggle to see why the two should have a linear r/s**

#

ur F1 is precision + recall, right?

#

precision = TP / (TP+FP), recall = TP/(TP+FN) if i rmb correctly, but the confidence is basically saying how 'sure' the model is in its prediction, which could be either 1 (TP) or 0 (TN), right? (the numerators are both TP, there's no TN in either the numerator nor denominator of both precision & recall)

perhaps another way to say this would be, high confidence doesn't mean high TP. it actually means high TP and TN. whereas f1-score/precision/recall are more focused on TP and not TN?

so if u had another 'score' that measures how sure the model is in its prediction that the target is 1 (positive), then yes perhaps that 'score' (whatever its called) might show a positive r/s with f1-score

sand herald May 18, 2024, 4:35 PM

#

also, i guess these are with the assumptions that your model is actually tuned properly and having good evaluation scores -> otherwise if your model is tuned poorly it'll prolly throw out high confidence scores for predictions that turn out to be mostly wrong?

molten elk May 18, 2024, 5:09 PM

#

Does anyone here have any idea what this means? How do they get the values of p and q here

tidal bough May 18, 2024, 5:33 PM

#

molten elk Does anyone here have any idea what this means? How do they get the values of `p...

It reads to me that they are plotted on Figure 3-4 which you didn't post.

wooden sail May 18, 2024, 7:14 PM

#

looks like p and q are given to you, not something to be computed. it's just an illustrative example. as reptile says, the vectors should be in the figure. that being said, you are given T and you are also given the coordinates of p and q in the basis S (the canonical basis), so you also know what p and q are in that basis

wooden sail May 18, 2024, 7:15 PM

#

molten elk Does anyone here have any idea what this means? How do they get the values of `p...

the basis S is just the identity matrix, so p and q are exactly the vectors mentioned there: p = [3, 1] and q = [-6, 2]

leaden rock May 18, 2024, 7:16 PM

#

What's up guys

#

I just made my first AI with the digit classification MNIST dataset. How far behind am I?

serene scaffold May 18, 2024, 8:12 PM

#

leaden rock I just made my first AI with the digit classification MNIST dataset. How far beh...

That's a good first project.

hollow escarp May 18, 2024, 9:26 PM

#

Him im running my onxx model using onnxruntime on my CPU and im wondering how can i speed it up

#

With out using GPU

#

Also thats object detection model, and dynamic quantization seams to not working, it gives me worse results. Before that i had like 11 FPS and after, something like 7FPS

warm trellis May 18, 2024, 9:30 PM

#

Hey guys how can someone find teams to work in for the kaggle competitions?

agile cobalt May 18, 2024, 9:31 PM

#

warm trellis Hey guys how can someone find teams to work in for the kaggle competitions?

maybe try Forums/Discussions

warm trellis May 18, 2024, 9:36 PM

#

agile cobalt maybe try Forums/Discussions

apprechiate it. Do you know any active/popular forums? (i can google though peer advice is more valuable)

agile cobalt May 18, 2024, 9:39 PM

#

I meant mostly Kaggle's own, e.g.

https://www.kaggle.com/discussions
https://www.kaggle.com/competitions/ai-mathematical-olympiad-prize/discussion
but looks like they have a feature for that?
https://www.kaggle.com/discussions/product-feedback/341195

[Product Update] Improvements to the Competition Team Tab and Findi...

[Product Update] Improvements to the Competition Team Tab and Finding Teammates.

haughty silo May 19, 2024, 1:56 AM

#

Hello guys, how's going?

sand herald May 19, 2024, 2:17 AM

#

I recently acquired a very powerful GPU but my training times only went down by half compared to training locally on my laptop. im using pytorch lightning and i see that GPU usage is very low (< 5%) and 100% cpu usage. training till early stopping still takes about 1.5 hours per set of hyperparams im testing which is a little too slow for me.

can i have a bit of intuition/suggestions as to what might be the potential issues? thanks!

edit: im training a bi-directional LSTM - does anyone have insights on if this model is not as efficient on a GPU (i've heard some comments about this) - dataset is relatively small tho

serene scaffold May 19, 2024, 3:11 AM

#

sand herald I recently acquired a very powerful GPU but my training times only went down by ...

It might help to put the code in the pastebin and indicate which parts you changed after migrating to the better GPU

#

!paste

arctic wedgeBOT May 19, 2024, 3:11 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

lapis sequoia May 19, 2024, 3:50 AM

#

It's like integrating y-axis to x-axis to find the median

sand herald May 19, 2024, 3:58 AM

#

serene scaffold It might help to put the code in the pastebin and indicate which parts you chang...

i dont have access to my code right now but from what i can recall, i changed the accelerator='gpu' and devices=[1] (we have multiple GPUs) in the Trainer object of pytorch lightning. also some misc stuff such as calling .numpy() on the Tensor objects but i think that's mostly it.

i kinda thought one of the points about using lightning was the easy transition from cpu to gpu :/

#

ill paste my code when i have access. but honestly it's... super long

burnt pond May 19, 2024, 6:33 AM

#

what kind of maths like which topics in maths are required for datascience and like ml and ai

gritty vessel May 19, 2024, 7:58 AM

#

I'm trying to predict the lightning events

#

Using cnn

#

But my model is not able to learn lightning events instead it's giving high accuracy in predicting non lightning events

#

I can share the whole code like I did to prepare feature data and target data but I'm pretty sure I have to explain it first as it is little messy

#

Lightning that happens in cloud

#

Data consists of 6 variables Tir1, tir2 ,swir, vis, latitude and longitude and in target I have lighting occurred or not at that point

#

Coordinate where lightning will occur

#

Wait I m writing

#

Each variable is a 2d array of 1536,1392

#

So images yeah

#

6 images as features and in output one image

#

Sure

#

I am following unet arch

#

Yup

#

Oh sorry Auto correct

#

Model architecture is like this. Input = 1536,1392,6
32 conv2d
Maxpool
64 conv2d
Maxpool
128 conv2d
Maxpool
256 conv2d
Upscale
128 conv2d
Upscale
64 conv2d
Upscale
32 conv2d
1 output = 1536,1392,1

#

Arch is like this

#

The basic one only

#

I curated data myself

#

I am following a paper

burnt pond May 19, 2024, 8:13 AM

#

Kk

gritty vessel May 19, 2024, 8:13 AM

#

But there is little bit diff

#

Something like this only I was following I am following that lightning cast one

#

ProbSevere LightningCast: A deep-learning model for satellite-based lightning nowcasting

#

Oh I can't share pdf here

#

The problem is lightning events are less than 1%

#

Non lightning events are 99.9%

#

Of data

#

Can that be the problem ?

#

The paper I am following?

past meteor May 19, 2024, 8:22 AM

#

@wooden sail how do you feel about Einops at a glance? https://github.com/arogozhnikov/einops

gritty vessel May 19, 2024, 8:22 AM

#

i saw there code they didnt do anything

past meteor May 19, 2024, 8:22 AM

#

output_tensor = rearrange(input_tensor, 't b c -> b c t') is interesting

gritty vessel May 19, 2024, 8:23 AM

#

wait let me share the link

#

https://colab.research.google.com/drive/1PKTltCfGqE_UjeRRfA1k0vxztiYPfpsg?usp=sharing

Google Colab

wooden sail May 19, 2024, 8:25 AM

#

past meteor <@467435887236612106> how do you feel about Einops at a glance? <https://github....

i love it, though at face value, rearrange is not needed

gritty vessel May 19, 2024, 8:26 AM

#

yeah touching it would be actual is a bad idea

past meteor May 19, 2024, 8:26 AM

#

wooden sail i love it, though at face value, rearrange is not needed

Compared to just transposing?

gritty vessel May 19, 2024, 8:26 AM

#

as in real case scenerio we will have a faulty model

wooden sail May 19, 2024, 8:26 AM

#

technically transposing is also never needed

#

the whole point of einstein notation is that reshaping is not necessary

past meteor May 19, 2024, 8:26 AM

#

Now I'm curious

#

I see what you mean

wooden sail May 19, 2024, 8:26 AM

#

just careful treatment of the indices suffices to define any contraction

past meteor May 19, 2024, 8:27 AM

#

You can express the susbequent expressions in einstein notation on the original tensor?

gritty vessel May 19, 2024, 8:27 AM

#

they have just gridden the data like data is in different scales

past meteor May 19, 2024, 8:27 AM

#

Is that what you're getting at?

gritty vessel May 19, 2024, 8:27 AM

#

so to bring dat to similar resolution they did those steps

wooden sail May 19, 2024, 8:27 AM

#

the bigger problem is that all of the modules they list compatibility with already have einsum

#

they probably just wrap the underlying einsum implementation (is my guess)

past meteor May 19, 2024, 8:27 AM

#

I feel like this is what I needed all along because I'm going around commenting tensor shapes like a madman

wooden sail May 19, 2024, 8:27 AM

#

you should do that anyway imo

past meteor May 19, 2024, 8:28 AM

#

wooden sail you should do that anyway imo

That's fair, but I suppose with this the intent matches the output more

wooden sail May 19, 2024, 8:29 AM

#

how do you mean?

past meteor May 19, 2024, 8:29 AM

#

To use programmer buzzwords: this looks self-documenting

gritty vessel May 19, 2024, 8:29 AM

#

generally where i was working they didnt talked about testing the model they talked about validation more

past meteor May 19, 2024, 8:29 AM

#

while with Torch I do a bunch of operations and really have to comment why to save the reader (myself in a week) the trouble of trying to figure it out

gritty vessel May 19, 2024, 8:30 AM

#

so it is possible from validation they mean testing

#

nup they have test on new data

past meteor May 19, 2024, 8:31 AM

#

Anyhow, I'll try it out. Or do you recommend just using einsum itself in Jax, Torch and numpy?

wooden sail May 19, 2024, 8:32 AM

#

past meteor To use programmer buzzwords: this looks self-documenting

i have some amount of beef with this, in the direction of turing machines being able to represent the same thing as lambda calculus but in different syntax. the problem comes from using a representation of the math that is just not really suitable for "self documentation"

#

and even that aside, in papers using only math and text, you still reiterate the dimensions whenever confusion might arise. i wouldn't expect imperative code to be able to circumvent this

gritty vessel May 19, 2024, 8:33 AM

#

so from 1536,1392,1 i.e 2138112 total events only 30k are lightning events

wooden sail May 19, 2024, 8:33 AM

#

past meteor Anyhow, I'll try it out. Or do you recommend just using `einsum` itself in Jax, ...

i would suggest just using einsum from the respective module unless you plan on making your pipelines uniform by always using einops

gritty vessel May 19, 2024, 8:33 AM

#

gritty vessel so from 1536,1392,1 i.e 2138112 total events only 30k are lightning events

in total then when we divide it based on time it gets even lesser

past meteor May 19, 2024, 8:36 AM

#

wooden sail i would suggest just using einsum from the respective module unless you plan on ...

I'll start from there then yes

gritty vessel May 19, 2024, 8:37 AM

#

i did but model than started predciting lightning values in excess

#

then i talked here and they told that making changes in data is not advisable as in real life scenerio this will be the case that lightning is less

#

would you like to see the code and data ? i can stream itand explain what i did

#

till now

#

alright in voice caht 1

#

damn i dont have streaming perms

#

of model ?

#

this is X_data

#

https://paste.pythondiscord.com/UIAQ

#

i didnt used exactly same arch they used used litte simpler one

#

still that model is also pretty complex

#

my point is it should predict atleast something why only predcit non lightning events

#

https://paste.pythondiscord.com/BNPQ

#

this is y_data

#

model

#

https://paste.pythondiscord.com/V52A

#

one more thing should i feed latitude and logitude as features or not?

#

yeah just number of layers is less

#

like they used 2-3 32 layers than 2-3 64 layers

#

do they?

#

they have 4 features i believe

#

didnt made any instead i i plotted it on map

#

and compared it with original

#

some of predicted looked pretty similar

#

Epoch 1/10 22/22 [==============================] - 245s 11s/step - loss: 23.8827 - accuracy: 0.9090 Epoch 2/10 22/22 [==============================] - 240s 11s/step - loss: 0.1823 - accuracy: 0.9974 Epoch 3/10 22/22 [==============================] - 237s 11s/step - loss: 0.1417 - accuracy: 0.9891 Epoch 4/10 22/22 [==============================] - 233s 11s/step - loss: 0.1220 - accuracy: 0.9966 Epoch 5/10 22/22 [==============================] - 246s 11s/step - loss: 0.1100 - accuracy: 0.9995 Epoch 6/10 22/22 [==============================] - 245s 11s/step - loss: 0.0749 - accuracy: 0.9989 Epoch 7/10 22/22 [==============================] - 240s 11s/step - loss: 0.0743 - accuracy: 0.9995 Epoch 8/10 22/22 [==============================] - 237s 11s/step - loss: 0.0447 - accuracy: 0.9991 Epoch 9/10 22/22 [==============================] - 244s 11s/step - loss: 0.0458 - accuracy: 0.9990 Epoch 10/10 22/22 [==============================] - 235s 11s/step - loss: 0.0425 - accuracy: 0.9991

#

it is going down

#

that i dont have currently

#

its not overfitting i checked it

#

in my lab i did plotted loss val loss

#

but when i completed the work they didnt allowed me to take the work outside the lab

#

can i omehow make this run on my system?

#

i have 16gb ram and rtx 3050 laptop gpu it crashes down everytime

#

i try to run

#

yeah i need to create a loss func that monitors how good it predicts the lightning events

#

now i just have to figure out how can i run this on my laptop lol

#

ok

#

that is because it is predicting non lightning cases accurately \

#

alright i will check it

#

that might be the case

#

ok

native narwhal May 19, 2024, 9:29 AM

#

how to segregate images based on their features?

#

I want to segregate all the shapes with curved edges in them and segregate them into the circular category, there are 3 in total, circular, intersecting and overlapping

hollow escarp May 19, 2024, 11:16 AM

#

Hi, im using easyocr as my main OCR and im running it on device with out GPU and im wondering if it's possible to install only libs needed for CPU

left tartan May 19, 2024, 11:49 AM

#

hollow escarp Hi, im using easyocr as my main OCR and im running it on device with out GPU and...

It's PyTorch based right? PyTorch has a cpu install option.

hollow escarp May 19, 2024, 11:52 AM

#

left tartan It's PyTorch based right? PyTorch has a cpu install option.

Ye

#

Your right

#

I forgot about It, thx

river cape May 19, 2024, 12:26 PM

#

import string
exclude = string.punctuation
def remove_punc(text):
return text.translate(str.maketrans(' ',' ',exclude))

#

Could anyone explain what does this code do?

#

I got a part of it understood . It is used to replace characters in the string

autumn gull May 19, 2024, 1:03 PM

#

hey i'm new here and i need help regarding a project of a virtual assistant. My problem is im using tokenize,removing stop words and then using stemming on the question, after that these two sentences "What is your name" and "what is my name" is only left with "name" to work upon which then will respond as a single answer for both of the questions . how can i fix that?anyone can help me?

jaunty helm May 19, 2024, 1:07 PM

#

river cape Could anyone explain what does this code do?

str.maketrans(...)
If there are two arguments, they must be strings of equal length, and
in the resulting dictionary, each character in x will be mapped to the
character at the same position in y. If there is a third argument, it
must be a string, whose characters will be mapped to None in the result.
so basically the first 2 are just dummy elements (mapping ' ' to ' ', basically doing nothing), and everything in exclude will be mapped to None... so be excluded from the result

left tartan May 19, 2024, 1:08 PM

#

river cape Could anyone explain what does this code do?

!e ```py
print("abc".translate(str.maketrans('a', 'z')))

arctic wedgeBOT May 19, 2024, 1:08 PM

#

@left tartan :white_check_mark: Your 3.12 eval job has completed with return code 0.

zbc

left tartan May 19, 2024, 1:09 PM

#

autumn gull hey i'm new here and i need help regarding a project of a virtual assistant. My ...

Could you provide more details? Share your code perhaps. A help thread might be better: #❓｜how-to-get-help

jaunty helm May 19, 2024, 1:29 PM

#

playing with this dataset, trying to do multiclass classification on Target(3 possible values, Dropout Enrolled Graduated)
I'm find that models have a hard time discerning Enrolled: (using SVC as example)

-----SVC-----
              precision    recall  f1-score   support

     Dropout       0.84      0.73      0.78       284
    Enrolled       0.52      0.33      0.40       159
    Graduate       0.77      0.94      0.84       442
```the data's imbalanced,

ALL['Target'].value_counts()
Graduate 2209
Dropout 1421
Enrolled 794
```I tried class_weight='balanced' on SVC which improved it a little (f1 of Enrolled sits around 0.47), then I tried down sampling to get

down_sampled = pd.concat([
    ALL.loc[ALL['Target'] == 'Graduate'].iloc[:800], 
    ALL.loc[ALL['Target'] == 'Dropout'].iloc[:800],
    ALL[ALL['Target'] == 'Enrolled'].iloc[:800]])

-----SVC-----
              precision    recall  f1-score   support

     Dropout       0.78      0.67      0.72       200
    Enrolled       0.62      0.66      0.64       199
    Graduate       0.75      0.81      0.78       200
```which (I think) made it overall better, but it's still having a hard time classifying `Enrolled`, so I think the bad performance isn't *only* due to imbalance
are there any other techniques I can use to improve this?

cedar tusk May 19, 2024, 1:32 PM

#

jaunty helm playing with [this](https://www.kaggle.com/datasets/mikhail1681/student-performa...

have you tried using another model type?

#

randomforest or tree based models could work better

jaunty helm May 19, 2024, 1:33 PM

#

cedar tusk have you tried using another model type?

yea, I get a similar trend using trees, I just used SVC as example cause discord character limit

cedar tusk May 19, 2024, 1:33 PM

#

u can just use pastebin

#

to paste longer stuff

jaunty helm May 19, 2024, 1:34 PM

#

jaunty helm yea, I get a similar trend using trees, I just used `SVC` as example cause disco...

e.g. lightgbm
original dataset

-----LGBMClassifier-----
              precision    recall  f1-score   support

     Dropout       0.79      0.74      0.76       284
    Enrolled       0.54      0.38      0.44       159
    Graduate       0.79      0.91      0.85       442

    accuracy                           0.76       885
   macro avg       0.71      0.68      0.68       885
weighted avg       0.75      0.76      0.75       885

after down sample

-----LGBMClassifier-----
              precision    recall  f1-score   support

     Dropout       0.77      0.69      0.73       200
    Enrolled       0.64      0.70      0.67       199
    Graduate       0.80      0.81      0.80       200

    accuracy                           0.73       599
   macro avg       0.74      0.73      0.73       599
weighted avg       0.74      0.73      0.73       599

jaunty helm May 19, 2024, 1:34 PM

#

cedar tusk to paste longer stuff

it's basically the same anyways so I thought it's not important

cedar tusk May 19, 2024, 1:35 PM

#

instead of doing downsampling u can try giving weights to the classes

#

cuz downsampling makes u loose data

#

or u can try deep learning as well

#

just let the neural network figure it out

jaunty helm May 19, 2024, 1:36 PM

#

cedar tusk instead of doing downsampling u can try giving weights to the classes

I'm not sure how to do that manually (manually with reason and not just "this feels ok"), and I already tried SVC(class_weight='balanced') or LGBMClassifier(is_unbalance=True), etc. (basically using the models' parameters that signal the data being imbalanced) and it didn't seem to have too much of an effect

cedar tusk May 19, 2024, 1:37 PM

#

what are the variable names?

jaunty helm May 19, 2024, 1:38 PM

#

cedar tusk what are the variable names?

wdym by that

cedar tusk May 19, 2024, 1:38 PM

#

i am trying to see if there is anything we can do to make data more meaningful

#

cuz this issue needs more context

jaunty helm May 19, 2024, 1:40 PM

#

cedar tusk i am trying to see if there is anything we can do to make data more meaningful

here's the dataset ig
it has 37 columns, and rn I've just been looking around, the SVC and lightgbm result you see above had minimal feature engineering

cedar tusk May 19, 2024, 1:42 PM

#

let me work on this dataset a bit