#data-science-and-ml | Python | Page 166

charred estuary May 16, 2025, 3:30 AM

#

dude

#

read the documentation

#

you need to edit the code

#

as isit will just make a generic dataset

#

if you are interested just look into the project

limpid dew May 16, 2025, 3:33 AM

#

Sorry, but I had to ask because I didn't think the documentation was clear.

charred estuary May 16, 2025, 3:35 AM

#

limpid dew Sorry, but I had to ask because I didn't think the documentation was clear.

I don't really see how its unclear it says: ```

"The Synthetic Conversations dataset is a set made up of inputs and outputs that was completely automated and generated by AI language models. I used AI models such as DeepSeek R1 Llama 70B Distil, Google's Gemini 2.0 Flash, Microsoft's Phi 3, and Qwen3-0.6B."```
and:


DeepSeek R1 Llama 70B Distil
Gemini 2.0 Flash
Phi 4 Reasoning
Qwen3 0.6B

Only the best responses are selected and added to the dataset. This is done by having all of the AI models voting on which output they think is the best without being able to vote for their own output."```

limpid dew May 16, 2025, 3:39 AM

#

I asked a specific question about generating math data, and you told me to read the docs but the docs don’t explain how to do that. If your project depends on users editing the code to guide the output, that should be clearly explained. Saying “it’s in the README” doesn’t work if it isn’t.

charred estuary May 16, 2025, 3:40 AM

#

^

#

"You can modify the script to ask the cluster to only generate data that will help train an AI on python debugging or math or whatever you want."

limpid dew May 16, 2025, 3:41 AM

#

Right — but saying “you can modify the script” isn’t the same as explaining how to do it. That sentence is a claim, not documentation. If customizing the prompts is essential, the README should walk through it clearly. Otherwise, pointing to it doesn’t help.

charred estuary May 16, 2025, 4:45 AM

#

limpid dew Right — but saying “you can modify the script” isn’t the same as explaining how ...

I don’t know what to tell you man. This is a Python discord server I posted thinking that you would understand basic python. Beyond that this is the AI channel within the server. If you don’t know how to modify a prompt in a script learn how to do that first. As stated in the documentation I used the OpenAI and Google-GenAI SDKs so maybe look into that

rich moth May 16, 2025, 5:13 AM

#

I joined that kaggle comp for Stanford RNA Folding. I followed their outline but I added the UCF to extract features and it actually makes the predictions way more accurate by finding hidden patterns in the RNA that regular models miss. The UCF lets us see how "mathematically complex" different parts of the RNA are, which helps guide the 3D folding. Here are some images, notice how the data points sit in the Complex/Chaotic region, its telling us RNA have intricate folding patterns, we know that, but the AI does too. And the 3D visuals show the actual predicted structures with each nucleotide color-coded (A=green, U=red, C=blue, G=yellow). Heres a few samples

lapis sequoia May 16, 2025, 7:16 AM

#

Guys I need help with Python Pandas

#

from subprocess import call
import pandas as pd
import time

#func for opening files on command
def openfile(x:str):
    call(["python", x])


#setting up the dataframe and email variables
df = pd.read_csv("CSV_Files/logindata.csv")
df.set_index('Email',inplace=True)
emp_email_end = "@can.emp"
adm_email_end = "@can.adm"


#signup and login
print("***** Welcome To  Login Page *****")
choice_signin = input("Would you like to login or signup?: ")

if choice_signin == "login":
    mail = input("Enter your email:- ")
    pwd = input("Enter your password: ")

    act_pwd = str(df.loc[mail][0])

    if pwd == act_pwd:
        if str(mail).endswith(emp_email_end):
            print("Welcome Employee!")
            time.sleep(3)
            openfile("Python_Code/employee.py")
        elif str(mail).endswith(adm_email_end):
            print("Welcome, Admin")
            time.sleep(3)
            openfile("Python_Code/admin.py")
        else:
            print("Welcome To The Canopy!!")
            time.sleep(3)
            openfile("Python_Code/customer.py")

    elif pwd != act_pwd:
        print("Password is incorrect")   

elif choice_signin == "signup":
    new_mail = input("Enter Your Email ID:- ")
    new_pwd =  input("Enter Your Password:- ")

    df.loc[new_mail] = [new_pwd]
    
    if str(new_mail).endswith(emp_email_end):
        print("You cannot register with the company email!")
    elif str(new_mail).endswith(adm_email_end):
        print("You cannot register with the company email!")
    else:
        print("Welcome To The Canopy!!")
        time.sleep(3)
        openfile("Python_Code/customer.py")



#updating the csv file
df.to_csv("CSV_Files/logindata.csv")```

#

I'm getting this exception when I use the login choice as ''login''

#

FutureWarning: Series.__getitem__ treating keys as positions is deprecated. In a future version, integer keys will always be treated as labels (consistent with DataFrame behavior). To access a value by position, use `ser.iloc[pos]` act_pwd = str(df.loc[mail][0])

#

For reference, I set the index of the csv file to the email names

#

not an integer index

#

I want to get rid of the exception, since need to show this as a school project, and I'm trying to use the try, except method, but it's not working

#

Please help

final jolt May 16, 2025, 2:09 PM

#

lapis sequoia I want to get rid of the exception, since need to show this as a school project,...

not entirely sure on your question since you say you are trying to use the try/except method but I dont see a try anywhere in that code. Can you clarify what you mean by I'm trying to use the try, except method, but it's not working ? It looks to me like it is interpreting your column name as not a string. are you sure [mail][0] is actually coming back as a single column label string?

lapis sequoia May 16, 2025, 4:02 PM

#

final jolt not entirely sure on your question since you say you are trying to use the try/e...

    mail = input("Enter your email:- ")
    pwd = input("Enter your password:- ")

    act_pwd = str(df.loc[str(mail)][0])

    try:
        act_pwd = str(df.loc[str(mail)][0])
    except FutureWarning:
        print("test")

    if pwd == act_pwd:
        if str(mail).endswith(emp_email_end):
            print("Welcome Employee!")
            time.sleep(3)
            openfile("Python_Code/employee.py")
        elif str(mail).endswith(adm_email_end):
            print("Welcome, Admin")
            time.sleep(3)
            openfile("Python_Code/admin.py")
        else:
            print("Welcome To The Canopy!!")
            time.sleep(3)
            openfile("Python_Code/customer.py")

    elif pwd != act_pwd:
        print("Password is incorrect")```

#

here

#

This isn't working either. I'm still getting the same exception, but twice now

#

FutureWarning: Series.__getitem__ treating keys as positions is deprecated. In a future version, integer keys will always be treated as labels (consistent with DataFrame behavior). To access a value by position, use `ser.iloc[pos]` act_pwd = str(df.loc[str(mail)][0]) /home/jon/Desktop/IP_Project_Hotel-Management/Python_Code/logins.py:30: FutureWarning: Series.__getitem__ treating keys as positions is deprecated. In a future version, integer keys will always be treated as labels (consistent with DataFrame behavior). To access a value by position, use `ser.iloc[pos]` act_pwd = str(df.loc[str(mail)][0]) Welcome To The Canopy!!

lapis sequoia May 16, 2025, 4:13 PM

#

final jolt not entirely sure on your question since you say you are trying to use the try/e...

and yes, the code is working as it's supposed to, it's only the warning I want to get rid of

#

How do I do that, using try and except, or sys module?

#

Or any other fix that you know of

final jolt May 16, 2025, 5:47 PM

#

lapis sequoia and yes, the code is working as it's supposed to, it's only the warning I want t...

So my assumption is still the same in that the argument you are passing to df.loc() is in fact not seen as a string but an integer or otherwise. you could try setting the value outside of the function like

mail_id = str(mail)[0]
act_pwd = str(df.loc[mail_id])```
or try using thje df.iloc like the message suggests.
Also the reason youre try/except technically did not work is because you are calling the same function outside of the try statement (5th line) which is what actually throws the exception.

lapis sequoia May 16, 2025, 5:49 PM

#

final jolt So my assumption is still the same in that the argument you are passing to df.lo...

What should I do to fix the try thing

final jolt May 16, 2025, 5:50 PM

#

try changing this:

    act_pwd = str(df.loc[str(mail)][0])

    try:
        act_pwd = str(df.loc[str(mail)][0])
    except FutureWarning:
        print("test")```
to this:
```py
    mail_id = str(mail)[0]

    try:
        act_pwd = str(df.loc[mail_id])
    except FutureWarning:
        print("test")```

lapis sequoia May 16, 2025, 5:57 PM

#

final jolt try changing this: ```py act_pwd = str(df.loc[str(mail)][0]) try: ...

not working lol

final jolt May 16, 2025, 5:59 PM

#

same error or doesnt work at all?

lapis sequoia May 16, 2025, 6:30 PM

#

same error

tight dune May 16, 2025, 8:19 PM

#

newbie here, anyone know how to make a deliniate (red line) a echogram data? x= depth , y=time

#

https://www.kaggle.com/code/robikscube/all-python-data-visualization-libraries-in-2022/notebook#Seaborn after seein these, i think it cant match with it cz can't found to add the red line

fallow coyote May 16, 2025, 8:25 PM

#

Need to ask, should I use statsmodel to help refine my model whilst using sklearn as my main library for linear regression? Also is there any difference in how sklearn and statsmodel handles linear regression? If i can get some clarification, that should help remove the frustration on what library to use for my model

verbal oar May 16, 2025, 9:18 PM

#

in statsmodel you have OLS (ordinary least squares)

#

difference in writing

#

also you have statistical summary (p-values, confidence intervals etc)

obtuse acorn May 17, 2025, 3:18 AM

#

would it make sense to set the index to be the target variable in a pandas dataframe or should i just leave it as a column?

#

like that

#

im guessing just leave it as a column

rich moth May 17, 2025, 7:05 AM

#

Its a work in progress but Ive updated my paper on the UCF if anyone is interested . Ive never done anything like this so feedback is a plus https://docs.google.com/document/d/1Bey9Qt6dcif0r4--rE3BP3GupnAw306EJGRgkZG7N3s/edit?usp=sharing

#

Still working on incorporating all the visuals though

#

dont be scared just say it. Im putting myself out there. dont be shy

deep anchor May 17, 2025, 7:10 AM

#

rich moth Its a work in progress but Ive updated my paper on the UCF if anyone is interest...

lol brooo just checked it out, and honestly?? not bad at all for ur first time 😮‍💨👏 like fr I wasn’t expectin it to be that detailed lmao. Some parts were a lil dense hhhmmmm maybe simplify a bit? but overall it lowkey makes sense 😭 visuals gonna help a lot once u throw em in tho frfr. Keep grindin, this got potential 🤙 let me kno when u update it again lol I’ll def take another look!

rich moth May 17, 2025, 7:10 AM

#

there ya go

#

thank you

deep anchor May 17, 2025, 7:11 AM

#

ok

#

thats good

tawdry sundial May 17, 2025, 2:26 PM

#

need help in #1373301621253210244

fallow coyote May 17, 2025, 4:49 PM

#

whats the difference between sklearn and statsmodel? Do I use either one or do I use a mixture of both modules?

sudden wyvern May 17, 2025, 4:49 PM

#

Hi all I am Aakash and want help in terms of creating a sql agent which uses local llm with service like ollama local models and using lang chain currently I am not able to create that efficient agent by these things can anyone please suggest me how can i create fine sql agent which can talk with database and answer user's query accordingly ❓

I have tried llama3 deepseekr1 llama3.2 models but I am getting some OutputParse exception.

rich moth May 17, 2025, 6:55 PM

#

The last image is the RNA structure of Covid 19. Even predicted the pseudoknot. The first one is 363 nt long. All these test ran in under a minute. The speed at which this thing preforms is bonkers

#

Its a straight web of connections hahaha

#

#

Look how all the RNA structure line up in the 135 degree region. It's detecting something for sure

#

In all the intial UCF test, the RNA tests, the Tade bot tests, all integrated with parts of the UCF.. it seems they all point to the same conclusion. There's something underlying in data we've been overlooking for a while. The consistent ~135 degree phase angle appearing across completely different types of data suggests a fundamental mathematical principle that seems universal.

#

There are different complexity spaces from crypto coins too.

past meteor May 17, 2025, 7:08 PM

#

fallow coyote whats the difference between sklearn and statsmodel? Do I use either one or do I...

Statsmodel has models sklearn doesn’t have, specifically things related to time series. When there is an overlap I’d say that sklearn is about prediction and statsmodels is … about statistics, inference, interpretation etc.

slim tundra May 17, 2025, 8:31 PM

#

Hi,
I am having a hard time getting bipedalwalker v3 with PPO agent to walk. The reward seems to be stuck around 10 to 20. I am trying to get at least 200.
I have tried changing the parameters and the architectures nothing worked

I want to know if the issue lies in the architecture or in the training parameters

the script uses

ActorNet (actor policy)
Criticnet to compute state value and
ActorCriticNet combining both networks and adds helper methods to act and evaluate samples

Does anyone have experience or know something about deep reinforcement learning and can help?

fallow coyote May 17, 2025, 9:23 PM

#

past meteor Statsmodel has models sklearn doesn’t have, specifically things related to time ...

When should I use either modules? Im getting confused which to use

slim tundra May 17, 2025, 9:30 PM

#

fallow coyote When should I use either modules? Im getting confused which to use

you use botth

fallow coyote May 17, 2025, 9:32 PM

#

In what way? I was thinking at first, use sklearn as my main library for the actual ML part with statsmodel to refine it. Im getting confused with this shit

slim tundra May 17, 2025, 9:41 PM

#

fallow coyote In what way? I was thinking at first, use sklearn as my main library for the act...

the actor makes action choices and the critic evaluate thos actions until the action is good enough so that the agent walks

#

teaching a kid how to walk

#

basically

fallow coyote May 17, 2025, 9:50 PM

#

Is there any significant difference in how statsmodel uses linear regression techniques compared to sklearn?

past meteor May 17, 2025, 9:53 PM

#

fallow coyote When should I use either modules? Im getting confused which to use

I can't really answer that for you, what are you trying to do

#

Are you trying to just do predictions or are you doing data analysis with linear regression?

#

If you're "just" trying to predict a value --> sklearn
If you're doing data analysis, statistics, are interested in interpreting coefficients etc. --> statsmodels

toxic pilot May 17, 2025, 10:30 PM

#

fallow coyote Need to ask, should I use statsmodel to help refine my model whilst using sklear...

linear regression is linear regression.

#

also it depends on what you're trying to accomplish; if sklearn has what you need, then use sklearn. if it doesnt, then see if statsmodel does

#

in my experience, sklearn's strength is more machine learning while statsmodel is used primarily for classical & descriptive statistics

fallow coyote May 17, 2025, 10:51 PM

#

That puts it into perspective. I guess what Im trying to do is first do some data analysis on how the price of gold is affected by certain factors and then make a simple price prediction program based on said factors

abstract wasp May 18, 2025, 12:16 AM

#

Hi can someone explain to me this diagram? It’s in regards to L1 vs L2, I don’t understand the circle/diamond and the ellipses

calm thicket May 18, 2025, 1:03 AM

#

abstract wasp Hi can someone explain to me this diagram? It’s in regards to L1 vs L2, I don’t ...

those are the points which are less than 1 away from the origin

#

using those norms

rich moth May 18, 2025, 1:30 AM

#

It predicts RNA binding sites with pretty good accuracy, i built it with a validation dataset using PDB structures with experimentally verified binding sites, measuring distances between RNA residues and bound ligands to identify ground truth

75% precision / 60% recall on the biotin aptamer (1F27_A) and 41% precision / 54% recall on the FMN riboswitch (1FMN_A)

For SARS-CoV-2 RNA frameshifting element, my algorithm identified a key binding pocket at positions 10-16 (sequence GGGUUU) with a phase angle of 133.7° - precisely matching the universal pattern.

RNA structures consistently align near and around 135°, while cryptocurrency price data appears to align near 90°. Both show strong phase alignment, just at different characteristic angles.

verbal oar May 18, 2025, 10:27 AM

#

yes as I said statsmodel has OLS

#

and statsmodel has summary statistics related to hypothesis testing etc

plush kettle May 18, 2025, 11:41 AM

#

Guys, I want to ask suppose I want to train my object detection model with resnet fpn backbone on 640 x 640 images but no augmentations whatsoever, I use 80:10:10 split so I use 40 images for training and 5 for validation, which resnet backbone is the best

#

I know the dataset size is not enough but I can only work with the available data for now because my project manager told me not to make any augmentation/s first

obtuse acorn May 18, 2025, 11:52 AM

#

if im using scikit learn do i need to always use TimeSeriesSplit if my data has a date column?

#

or is TimeSeriesSplit only for if your trying to predict what will happen in the future?

#

versus like categorising something that has a date column?

calm thicket May 18, 2025, 12:17 PM

#

obtuse acorn or is TimeSeriesSplit only for if your trying to predict what will happen in the...

correct

thick heron May 18, 2025, 12:20 PM

#

is it possible to make some ai that is well optimized for deployemnts and be very good at it?

#

i am finding it hard to optimize

serene scaffold May 18, 2025, 1:29 PM

#

thick heron is it possible to make some ai that is well optimized for deployemnts and be ver...

You're talking about a neural network?

thick heron May 18, 2025, 1:29 PM

#

yes

serene scaffold May 18, 2025, 1:30 PM

#

What hardware are you using?

thick heron May 18, 2025, 1:42 PM

#

raspberry pi

serene scaffold May 18, 2025, 1:51 PM

#

thick heron raspberry pi

There's probably nothing you can do to get good performance.

thick heron May 18, 2025, 2:01 PM

#

great then

serene scaffold May 18, 2025, 2:08 PM

#

thick heron great then

Raspberry Pis aren't intended to be very powerful. If you're trying to run a neural network, you'd need to run the neural network on a different machine that can communicate with the pi

thick heron May 18, 2025, 2:11 PM

#

🙂‍↕️ tysm

verbal oar May 18, 2025, 2:22 PM

#

is there sth like resource of popular papers from arxiv and other sources?

#

similar to arxiv sanity vanity

obtuse acorn May 18, 2025, 2:22 PM

#

any idea when to use minmax scaler vs standard scaler in scikit learn?

verbal oar May 18, 2025, 2:25 PM

#

sth like
stable diffusion model, attention is all you need, variational bayes (actually dont remember title some related to vae), ...

#

or I must compile it?

#

I need just most popular or popular dont need all of them

fallow coyote May 18, 2025, 3:21 PM

#

thick heron great then

theres the AI hat you can buy that allows you to run ai applications on it but Ill be honest, either buy an expensive ass pc or, use google colab notebooksa nd run everything off the cloud

thick heron May 18, 2025, 3:24 PM

#

fallow coyote theres the AI hat you can buy that allows you to run ai applications on it but I...

🤧 okay 👍 custom PCBs do they work expensive ass pc too too heavy and bulky for my project I need something small cloud is a second priority since it's offline based system that i designed

#

Pi is useless even with that usb coral tpu thing

fringe jay May 18, 2025, 4:23 PM

#

first time learning and practicing neural networks and ai, if any of yall could help that'd be great #1373696202398240909

green pilot May 18, 2025, 4:39 PM

#

Any suggestions on the fastest way to convert csv to txt files ? I was thinking of just using pandas but i think it might be slower than the base csv to txt converter. Any suggestions?

serene scaffold May 18, 2025, 5:02 PM

#

green pilot Any suggestions on the fastest way to convert csv to txt files ? I was thinking ...

CSV files are already plain text. What's the issue?

crimson raft May 18, 2025, 5:39 PM

#

Hey guys.
Could someone please help me and look at a python code I'm working on? I'm not programmer nor have degree in IT, so, I'm not a pro. I posted the code two weeks ago and nobody answered, so I figured I ask here. Any help is much appreciated.

serene scaffold May 18, 2025, 5:45 PM

#

crimson raft Hey guys. Could someone please help me and look at a python code I'm working on?...

You always have to post the code before anyone can look at it, so it's always best to post it right away.

crimson raft May 18, 2025, 5:57 PM

#

serene scaffold You always have to post the code before anyone can look at it, so it's always be...

Yes, you're correct. I posted it and waited for a couple of days, however, nobody commented or anything. I just wanted to ask here first if someone would have a look.

serene scaffold May 18, 2025, 5:59 PM

#

crimson raft Yes, you're correct. I posted it and waited for a couple of days, however, nobod...

People won't commit to looking at the code and providing feedback if they don't know anything about the code, or how long it is, or what it's intended to do. So it saves work for everyone, including yourself, if you always give people the information they need to do what you're asking of them right away.

#

It might be that people won't look at your code even if you do post it, and that would be unfortunate, but they certainly won't if you don't post it.

#

And if people look at it after telling you that they will look at it, they would have done that even if you hadn't made them ask you to post it.

lapis sequoia May 18, 2025, 6:40 PM

#

https://www.youtube.com/watch?v=jxlTQUkpz3Y

YouTube

the data janitor

The Truth About Machine Learning Engineers It's Not Just Math #dat...

Machine learning is about data, not math.

▶ Play video

#

how about this

rich moth May 18, 2025, 6:41 PM

#

crimson raft Hey guys. Could someone please help me and look at a python code I'm working on?...

Post it again and ask.

cloud zenith May 18, 2025, 7:34 PM

#

Hello! Can someone please help me with ONNX exporting? I'm trying to export an ELM custom model into ONNX format, but keep running into this mysterious error:

Cell In[1], line 4
      1 import numpy as np
      3 from onnx import helper
----> 4 from skl2onnx import convert_sklearn
      5 from skl2onnx.common.data_types import FloatTensorType
      6 from skl2onnx.common.utils import check_input_and_output_numbers

File ~\AppData\Local\Programs\Python\Python313\Lib\site-packages\skl2onnx\__init__.py:16
     12 __model_version__ = 0
     13 __max_supported_opset__ = 21  # Converters are tested up to this version.
---> 16 from .convert import convert_sklearn, to_onnx, wrap_as_onnx_mixin
     17 from ._supported_operators import update_registered_converter, get_model_alias
     18 from ._parse import update_registered_parser

File ~\AppData\Local\Programs\Python\Python313\Lib\site-packages\skl2onnx\convert.py:8
      6 import numpy as np
      7 import sklearn.base
----> 8 from .proto import get_latest_tested_opset_version
      9 from .common._topology import convert_topology
     10 from .common.utils_sklearn import _process_options

File ~\AppData\Local\Programs\Python\Python313\Lib\site-packages\skl2onnx\proto\__init__.py:22
     18 except ImportError:
     19     # onnx is too old.
     20     pass
---> 22 from onnx.helper import split_complex_to_pairs
     25 def make_tensor_fixed(name, data_type, dims, vals, raw=False):
     26     """
     27     Make a TensorProto with specified arguments.  If raw is False, this
     28     function will choose the corresponding proto field to store the
   (...)     31     this case.
     32     """

ImportError: cannot import name 'split_complex_to_pairs' from 'onnx.helper' (C:\Users\Admin\AppData\Local\Programs\Python\Python313\Lib\site-packages\onnx\helper.py)```

#

I'm using Python 3.13.2.

verbal oar May 18, 2025, 8:21 PM

#

# onnx is too old

#

check version of onnx

rich moth May 18, 2025, 9:27 PM

#

I plugged the UCF into a three body problem simulation.

cloud zenith May 18, 2025, 9:53 PM

#

verbal oar `# onnx is too old`

I have the latest version of ONNX though

#

Also that's not what the error says...

rich moth May 18, 2025, 9:59 PM

#

#

Aligns in the chatoic region just like my other research

lapis sequoia May 19, 2025, 6:00 AM

#

rich moth

which library do you use for such stuff?

rich moth May 19, 2025, 6:13 AM

#

lapis sequoia which library do you use for such stuff?

This was all matplotlib

lapis sequoia May 19, 2025, 6:14 AM

#

nice

#

I assume a lot of coding to get such shapes

rich moth May 19, 2025, 6:15 AM

#

lapis sequoia I assume a lot of coding to get such shapes

For the RNA stuff?

lapis sequoia May 19, 2025, 6:16 AM

#

generally speaking

rich moth May 19, 2025, 6:16 AM

#

300 lines for the three body problem stuff

lapis sequoia May 19, 2025, 6:16 AM

#

oh dear PepeSuit

rich moth May 19, 2025, 6:16 AM

#

just for visual

lapis sequoia May 19, 2025, 6:16 AM

#

didn't use chatgpt to get it faster?

rich moth May 19, 2025, 6:16 AM

#

i dont use chatgpt, but i do utilize AI

#

way I see it, my times limited on this planet. I got things todo

lapis sequoia May 19, 2025, 6:18 AM

#

fair enough

rich moth May 19, 2025, 6:19 AM

#

This ones ploty and matplotlib

#

R1136 makes my computer lag lol

#

you dont even wanna see the 700 nt one

lapis sequoia May 19, 2025, 6:42 AM

#

bloody lots of code for graphs

#

are you working on a research?

arctic wedgeBOT May 19, 2025, 6:43 AM

#

:incoming_envelope: :ok_hand: applied timeout to @rich moth until <t:1747637582:f> (10 minutes) (reason: attachments spam - sent 7 attachments).

The <@&831776746206265384> have been alerted for review.

sudden canyon May 19, 2025, 6:58 AM

#

!unmute @rich moth

arctic wedgeBOT May 19, 2025, 6:58 AM

#

:x: There's no active timeout infraction for user @rich moth.

sudden canyon May 19, 2025, 6:58 AM

#

Huh

#

Oh

obtuse acorn May 19, 2025, 7:51 AM

#

so if im using scikit learns gridsearch and ive got unbalanced categories, which score function should i use?

#

im currently using f1_macro

#

but idk if i should be using roc_auc_ovo or one of the other ones

verbal oar May 19, 2025, 8:39 AM

#

ok so not issue with onnx

obtuse acorn May 19, 2025, 9:23 AM

#

any idea what it means if a model has like 99% accuracy on both test and train data?

#

like is that overfitting or is it just really accurate?

#

i dont think theres any data leakage or anything

waxen kindle May 19, 2025, 10:05 AM

#

It is really accurate OR yout testing dataset is including into your train dataset

obtuse acorn May 19, 2025, 10:13 AM

#

waxen kindle It is really accurate OR yout testing dataset is including into your train datas...

i dont see any way it could be

#

im using a pipeline in scikit learn

verbal oar May 19, 2025, 10:17 AM

#

show graph please

#

of these curves

past meteor May 19, 2025, 10:52 AM

#

obtuse acorn so if im using scikit learns gridsearch and ive got unbalanced categories, which...

Depends on what you're doing

#

f1_macro and so on all make the assumption that the cost of misclassification is the same

obtuse acorn May 19, 2025, 10:53 AM

#

oh yeah

#

thats a good point

past meteor May 19, 2025, 10:53 AM

#

All classification problems I've worked on in the past month all had assymmetric costs. I really needed to optimize for precision or recall

#

People have probably gotten tired of me asking "Do we care more about false positives or false negatives"

#

But that's the reflex you need 🙂

obtuse acorn May 19, 2025, 10:54 AM

#

hmmm

past meteor May 19, 2025, 10:54 AM

#

(even if your dataset is balanced)

obtuse acorn May 19, 2025, 10:54 AM

#

its involving attack types

#

so like theres categories like ddos and normal etc

past meteor May 19, 2025, 10:55 AM

#

So it's multiclass?

#

Or even multilabel?

obtuse acorn May 19, 2025, 10:56 AM

#

ignore number 5, its not in the version im using

#

tho i could recreate number 5 from number 6

#

its for a uni assignment and they removed a column

past meteor May 19, 2025, 10:57 AM

#

So you're predicting #6

obtuse acorn May 19, 2025, 10:58 AM

#

yeah

past meteor May 19, 2025, 10:58 AM

#

Each record belongs to just 1 attack

obtuse acorn May 19, 2025, 10:58 AM

#

yeah

past meteor May 19, 2025, 10:58 AM

#

Exactly 1, not 0 not 1+?

obtuse acorn May 19, 2025, 10:58 AM

#

yeah

#

past meteor May 19, 2025, 10:59 AM

#

If it's a school assignment and not a "real life" problem then f1_macro or similar is probably fine

agile cobalt May 19, 2025, 10:59 AM

#

obtuse acorn

how did you split the train and test data?

obtuse acorn May 19, 2025, 10:59 AM

#

tho im guessing it would probably be better if it miscategorised something thats normal as an attack than an attack as normal

past meteor May 19, 2025, 11:00 AM

#

Yeah in the wild a false negative is worse

#

You'd want to flag more things and have that as a starting point to investigate

obtuse acorn May 19, 2025, 11:00 AM

#

agile cobalt how did you split the train and test data?

i used train_test_split from scikit learn?

past meteor May 19, 2025, 11:00 AM

#

And I'd sell it to "business people" as possible attacks

obtuse acorn May 19, 2025, 11:00 AM

#

do i need to use TimeSeriesSplit?

past meteor May 19, 2025, 11:00 AM

#

Hence recall > precision here

agile cobalt May 19, 2025, 11:01 AM

#

let me rephrase: Did you shuffle it before splitting or take the tail as the test data?

obtuse acorn May 19, 2025, 11:01 AM

#

shuffle im pretty sure

#

yeah it shuffles by default

agile cobalt May 19, 2025, 11:02 AM

#

interpolating is a lot easier (and arguably less useful) than extrapolating

if you included the records for 18:33:31 and 18:33:41 for a given day, then it should be easy for the model to guess that everything in between those two timestamps has the same label

past meteor May 19, 2025, 11:02 AM

#

not sure if you need a time series split

#

But maybe yes

#

Look at the data and see if you have correlations along the time axis yeah

agile cobalt May 19, 2025, 11:03 AM

#

in contrast, if you ask for the model to predict a label for a day that was not present in your data the chances for it to get it wrong are much, much higher

obtuse acorn May 19, 2025, 11:03 AM

#

agile cobalt interpolating is a lot easier (and arguably less useful) than extrapolating if ...

what do you mean by interpolating?

past meteor May 19, 2025, 11:03 AM

#

It's a very strange case the more that I look at it

obtuse acorn May 19, 2025, 11:03 AM

#

yeah i have no idea how the dataset actually works

past meteor May 19, 2025, 11:03 AM

#

If you random split your accuracy will be near 100 %

obtuse acorn May 19, 2025, 11:04 AM

#

its like gps data or something

past meteor May 19, 2025, 11:04 AM

#

Due to what Etrotta is talking about

obtuse acorn May 19, 2025, 11:04 AM

#

i could switch to one of the other datasets

#

it wouldnt be very difficult

past meteor May 19, 2025, 11:04 AM

#

You have N data points from each attack

agile cobalt May 19, 2025, 11:05 AM

#

obtuse acorn yeah i have no idea how the dataset actually works

understanding your data is the very first step you should take before trying to do anything with it whatsoever

past meteor May 19, 2025, 11:05 AM

#

If you drop all features except time and do a random split you have near 100 % accuracy

#

"Oh it's around 18:30, what attack happened there? I see, that's when we had the ddos"

agile cobalt May 19, 2025, 11:06 AM

#

obtuse acorn what do you mean by interpolating?

if I tell you that something costs 10$ on the day 1, 20$ on the day 3, then 10$ on the day 5, what would you guess it costs on the days 2, 4 and 6?

obtuse acorn May 19, 2025, 11:06 AM

#

agile cobalt understanding your data is the very first step you should take before trying to ...

like i get what the data is

#

i dont get how the date and latitude and longitude works to tell what the attack type is

past meteor May 19, 2025, 11:07 AM

#

Could be that they're using a specific data centre for ddos

obtuse acorn May 19, 2025, 11:08 AM

#

cloud zenith May 19, 2025, 11:08 AM

#

Hey! I've narrowed down my error to being unable to install onnxconverter_common for some weird reason. I have CMake installed, I have Visual Studio installed, the PATH variables are updated, long file names in Windows are enabled. Whenever I try to install that module, latest version for Python 3.13.2, it tries to build something called "wheels", waits for like 5 minutes, and then gives me this monstrosity of an error many thousands of lines long that ends with this:

#

Does anyone know what could possibly be causing this?

agile cobalt May 19, 2025, 11:09 AM

#

obtuse acorn

lol what, what are those points in the ocean? islands or it's normalized/scaled in some way

past meteor May 19, 2025, 11:10 AM

#

or mock data

obtuse acorn May 19, 2025, 11:10 AM

#

agile cobalt lol what, what are those points in the ocean? islands or it's normalized/scaled ...

it goes from like 0 to 500 lat long

agile cobalt May 19, 2025, 11:11 AM

#

uhhh usually long goes -180 +180 and lat goes -90 +90

#

there are some different scales and other special ways of measuring, but still seems very weird

agile cobalt May 19, 2025, 11:14 AM

#

cloud zenith Hey! I've narrowed down my error to being unable to install onnxconverter_common...

there might be a more detailed error message further up
personally I would probably just try installing it via conda instead

cloud zenith May 19, 2025, 11:17 AM

#

agile cobalt there might be a more detailed error message further up personally I would proba...

So, conda wouldn't give the same error message?

#

I've just never used conda so I don't understand how it'd be different as to what I'm doing here

obtuse acorn May 19, 2025, 11:19 AM

#

agile cobalt uhhh usually long goes -180 +180 and lat goes -90 +90

it might be that going around the world multiple times just keeps going higher?

final jolt May 19, 2025, 1:05 PM

#

obtuse acorn it might be that going around the world multiple times just keeps going higher?

I mean that functionally isnt how long/lat works. Unless in the codes' case it is using some other coordinates to denote it like rotational. But that wouldnt really make sense as a data list of source locations. Which reading that table you posted doesnt seem to be the case so perhaps an error

obtuse acorn May 19, 2025, 1:16 PM

#

final jolt I mean that functionally isnt how long/lat works. Unless in the codes' case it ...

the code from this worked https://gis.stackexchange.com/questions/303300/calculating-correct-longitude-when-its-over-180

#

longitude = (longitude % 360 + 540) % 360 - 180 turns it into -180 to 180

agile cobalt May 19, 2025, 1:21 PM

#

obtuse acorn the code from this worked https://gis.stackexchange.com/questions/303300/calcula...

unless you find some documentation explicitly saying that this is indeed how they constructed the dataset, there is no guarantee it is correct

that question is specifically adjusting it given the way vue-leaflet works, a dataset created using different tool may have a different logic

unless you plot it and see the coordinates make perfect sense (e.g. all points are in cities with datacenters) I wouldn't rely on it

obtuse acorn May 19, 2025, 1:24 PM

#

agile cobalt unless you find some documentation explicitly saying that this is indeed how the...

i mean its literally the exact same on the map

#

well i guess that would make sense

agile cobalt May 19, 2025, 1:25 PM

#

there is a chance the dataset is just completely senseless I guess
(random mocked data)

obtuse acorn May 19, 2025, 1:33 PM

#

i emailed my lecturer to ask about it

#

the website for the dataset is here if it helps https://research.unsw.edu.au/projects/toniot-datasets

agile cobalt May 19, 2025, 1:39 PM

#

in first place, did you include any rows with no ongoing attack or you always predict some kind of attack?

obtuse acorn May 19, 2025, 1:40 PM

#

theres a category called normal if thats what you mean?

agile cobalt May 19, 2025, 1:40 PM

#

oh

obtuse acorn May 19, 2025, 1:43 PM

#

right i checked the original source, everything that isnt normal is an attack

#

becuase i wasnt sure if password was a type of attack

#

but it is apparently

final jolt May 19, 2025, 2:19 PM

#

which type of data in this were you using? Also, possible theory. is your glove view backwards(mirrored) by chance? since so many end up in the ocean I wonder if its inverted.

#

my guess is in the processed_network_dataset based on the contents

final jolt May 19, 2025, 2:27 PM

#

obtuse acorn

which dataset in that site are you using for this? Trying to look at its formatting but lots of files here. certainly one of the processed ones it seems

obtuse acorn May 19, 2025, 2:45 PM

#

final jolt which dataset in that site are you using for this? Trying to look at its format...

the iot gps tracker one

#

except its not actually the one from that site

#

its like a small section of it

#

basically my uni said heres 6 datasets we modified and links to where the originals and information about them is

#

so its this one but no column 5

#

also yeah i think its the preprocessed folder

final jolt May 19, 2025, 2:47 PM

#

Yea I dont think the coloumn labels for this sheet are correct

#

Well I should state that I dont know why the lat/lon numbers are so high but simply doing the calculation you posted should result in correct data. Though no idea why the plotting doesnt land on actual, well, land

obtuse acorn May 19, 2025, 2:56 PM

#

final jolt Well I should state that I dont know why the lat/lon numbers are so high but sim...

well gps works if your not on land too i guess

#

could be weather balloons or something i guess

final jolt May 19, 2025, 2:56 PM

#

Well yea I do know that it works regardless generally speaking.

#

I mean weatherballons or similar would certainly align with the inherent issue in IoT device security in general

obtuse acorn May 19, 2025, 3:19 PM

#

that looks neat

cobalt rover May 19, 2025, 3:21 PM

#

Hey there, I ran into hardware constraints while trying to finetune 3B and 8B variants of qwen2.5 with fp16 and bf16 precision (Bzzt, OOM errors). I have access to a total of 48(24+24) GB of VRAM but this is clearly not enough to train them in full precision so I have reverted to using 8-bit quantized models for the same. For some reason on the internet, everyone seems to be training their quantized models with LoRA and I wished to know if it will be possible to train these quants with SFT/RL without relying on LoRA as I do want to change the base model's weights.

final jolt May 19, 2025, 3:25 PM

#

obtuse acorn that looks neat

Tie-dye Bowties

thick heron May 19, 2025, 3:55 PM

#

Is custom pcb worth money?

#

Just to run a mid size ml

final jolt May 19, 2025, 4:02 PM

#

thick heron Is custom pcb worth money?

huh? A customer PCB for what exactly

thick heron May 19, 2025, 4:02 PM

#

Running a multi model yolo cc and then ocr together with a base ml models

final jolt May 19, 2025, 4:04 PM

#

nevermind, I think I was thinking of a different definition of 'custom pcb'

thick heron May 19, 2025, 4:04 PM

#

Oh

#

No not that one

river cape May 19, 2025, 4:20 PM

#

cobalt rover Hey there, I ran into hardware constraints while trying to finetune 3B and 8B va...

8bit is only for inference right , not training and even if you train using it , i doubt any changes will be made to the model (updates)

#

Have you used QAT?

#

It's kinda complex so my suggestion would be LoRA unless you have massive power

cobalt rover May 19, 2025, 5:32 PM

#

river cape Have you used QAT?

came across it while researching more training methods but QAT seems to be useful primarily for training models that have to be quantized by the end of the training process.

cobalt rover May 19, 2025, 5:33 PM

#

river cape 8bit is only for inference right , not training and even if you train using it ,...

gotcha, I honestly wasn't aware that quants were inference only to be honest. i guess i have to stick with LoRA/QLoRA since i am on a deadline lol

river cape May 19, 2025, 5:34 PM

#

cobalt rover gotcha, I honestly wasn't aware that quants were inference only to be honest. i ...

Tbh LoRA gets the job done in most cases , unless you working specifically for some task-specific cases

cobalt rover May 19, 2025, 5:35 PM

#

yeah it's a coding task for a particular language

#

hope it's going to be enough- my earlier misunderstanding of ignoring that loading into the memory =/= VRAM consumed during training will cost me some days of progress welp

obtuse acorn May 19, 2025, 5:36 PM

#

am i not supposed to do dataFiltered.latitude = (dataFiltered.latitude % 360 + 540) % 360 - 180 to overwrite the latitude in the dataframe?

#

i got a chained asssignment warning

lapis sequoia May 19, 2025, 5:42 PM

#

what should the sequence length for a LSTM be over a very long period of time?

river cape May 19, 2025, 5:50 PM

#

cobalt rover hope it's going to be enough- my earlier misunderstanding of ignoring that loadi...

Trueeee

river cape May 19, 2025, 5:50 PM

#

lapis sequoia what should the sequence length for a LSTM be over a very long period of time?

Depends try different sequence length

cobalt rover May 19, 2025, 5:57 PM

#

lapis sequoia what should the sequence length for a LSTM be over a very long period of time?

what's your use case?

lapis sequoia May 19, 2025, 6:06 PM

#

river cape Depends try different sequence length

honestly, I just do that into the MSE Loss function is as small as possible

lapis sequoia May 19, 2025, 6:07 PM

#

cobalt rover what's your use case?

from the 40's until now

river cape May 19, 2025, 6:08 PM

#

lapis sequoia from the 40's until now

LSTMs can handle long sequences up to a certain limit , try using bidirectional

#

Helps understanding context better

lapis sequoia May 19, 2025, 6:09 PM

#

river cape LSTMs can handle long sequences up to a certain limit , try using bidirectional

I did in tensorflow, and I have not touch a RNN forever, I did it with natural langauge processing until I realized they were useless compared to the transformer and everything else

river cape May 19, 2025, 6:10 PM

#

lapis sequoia I did in tensorflow, and I have not touch a RNN forever, I did it with natural l...

RNNs were like the 1st milestone , then LSTMs , then Encoder-Decoder, then Attention and lastly Transfomers

#

and nowadays most of the tasks like in nlp are done by transformers , so very less use cases of the previous networks

#

but its good to know

lapis sequoia May 19, 2025, 6:12 PM

#

river cape RNNs were like the 1st milestone , then LSTMs , then Encoder-Decoder, then Atten...

I know, I am using it for time series\

river cape May 19, 2025, 6:13 PM

#

lapis sequoia I know, I am using it for time series\

Yea lstms can also be used ther

cobalt rover May 19, 2025, 6:26 PM

#

lapis sequoia from the 40's until now

i actually meant to ask what kind of data you are working with here. Anyways, a good suggestion would be to begin with 7 as that can cover a lot of time(week) and should serve as a good starting point. Besides this, you can experiment with various values and pick the one which fits your loss expectations the best! You might have some problems if its the first time working with LSTMs directly, but that's also how i started and i'm sure gpt/gemini/claude can help a lot here!

obtuse acorn May 19, 2025, 7:20 PM

#

obtuse acorn that looks neat

oops, i mapped it wrong

#

verbal oar May 19, 2025, 7:42 PM

#

is it possible to train llm on laptop instead of on cloud and it just will take much time?
so money saved but time not?

#

how it goes?

#

1-2 weeks of training on some A100 or the like

#

so it would take few months, estimated, not on A100, but some pc gpu

#

I have iris xe

serene scaffold May 19, 2025, 7:47 PM

#

verbal oar is it possible to train llm on laptop instead of on cloud and it just will take ...

Fine Tuning an LLM requires more compute power than can fit in a laptop.

verbal oar May 19, 2025, 7:48 PM

#

yes but with some weights checkpointing

serene scaffold May 19, 2025, 7:48 PM

#

And if you spent a few months trying it anyway, you'd fry the laptop

verbal oar May 19, 2025, 7:48 PM

#

ah finetuning not training read wrongly sorry

serene scaffold May 19, 2025, 7:48 PM

#

You can't train an LLM from scratch on any consumer hardware

#

And if you mean "training but not from scratch", that's what fine tuning is

verbal oar May 19, 2025, 7:50 PM

#

so tldr gen ai must be done only on cloud?

#

sorry I thought this way train llm on some compute powerful, cost few million of $
but training llm on laptop would be free (not considering power consumption)
but would just takes longer

#

but this not working like this

#

because then companies would train for months for free

#

so in short I thought can just split compute

#

but training some language model is possible on laptop? (not llm)

#

I remember I used colab for resnet50 and vgg16 so they too are not possible to train on laptop?

#

so question is from what number of parameters its not possible to train from scratch on laptop?

#

ok also I remember resnet and vgg were pretrained and it was about transfer learning

agile cobalt May 19, 2025, 8:15 PM

#

verbal oar sorry I thought this way train llm on some compute powerful, cost few million of...

it costs them a few USD per hour per GPU, with each GPU being many times more powerful than a laptop's

taking https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E for example, it required millions of hours worth of compute to train from scratch (divided in parallel amongst a lot of GPUs)

#

fine tuning is possible on high end consumer hardware, but not laptop level hardware

You can use Google Colab or Kaggle to borrow GPUs from google for free

verbal oar May 19, 2025, 8:18 PM

#

yes but for noncommercial use, for learning

#

if it would be for commercial, then meta would use just colab 😂

agile cobalt May 19, 2025, 8:20 PM

#

verbal oar if it would be for commercial, then meta would use just colab 😂

I don't think that you understand the sheer scale of the data and number of GPUs they are using to train LLMs derp

#

a single training run (*from scratch) costs millions of dolars worth of computing power

verbal oar May 19, 2025, 8:20 PM

#

some petabytes (scale of data)?

#

ah I have just 1TB

final jolt May 19, 2025, 8:26 PM

#

its more like the point is, commercial LLM cost money to train. There are not free versions at that scale even for "smaller" sets of data.

#

Because of the sheer amount of GPU power required to actually do it in an amount of time that is not absurd

fringe jay May 20, 2025, 3:56 AM

#

could someone help me in #1374233576706408458 ? I've been tweacking it for hours and I cant seem to fix it

unkempt apex May 20, 2025, 5:23 AM

#

fringe jay could someone help me in <#1374233576706408458> ? I've been tweacking it for hou...

why are you using neuralnetwork package?

river cape May 20, 2025, 9:12 AM

#

fringe jay could someone help me in <#1374233576706408458> ? I've been tweacking it for hou...

Use tensorflow or torch

hexed yew May 20, 2025, 12:48 PM

#

Any advice for imputing missing categorical data ? None of my variables appear to cluster well or have relationships with the categorical variable

polar hornet May 20, 2025, 4:04 PM

#

Hi guys, so i have an assignment at school that requires an expert in the field of artificial intelligence to be interviewed for my scientific article assignment. I really hope someone could help me here

serene scaffold May 20, 2025, 4:05 PM

#

polar hornet Hi guys, so i have an assignment at school that requires an expert in the field ...

can you direct people to your thread on this and post the list of questions?

polar hornet May 20, 2025, 4:24 PM

#

Can someone help me answer some questions for my scientific article assignment here?
https://discord.com/channels/267624335836053506/1374415687870845049

heady pivot May 20, 2025, 5:24 PM

#

Is this chat a good place to ask abou data engineering stuff?

heady pivot May 20, 2025, 5:41 PM

#

I'm a data engineer with 2 years of experience. Currently, I'm looking to start an AWS certification, but after studying through AWS Skill Builder, it seems more like a marketing stunt than a real certification. Based on my experience, most AWS services feel like auto-managed versions of open-source tools. At my startup, cost is a huge concern, so aside from Redshift, Lambda, and RDS, we avoid other AWS services. Am I wrong for sticking with hosting everything on EC2 (e.g., Kafka, Airflow, dbt for ETL) and using Lambda for code execution? This is how I’m handling things now. Any advice would be much appreciated!

Basically, all my problems are solved with SQL on RedShift and relatively simple Python scripts in Lambda (serverless). This setup handles everything we need right now!

steel spindle May 20, 2025, 7:57 PM

#

How do you create AI in python?

#

I am new to it

serene scaffold May 20, 2025, 8:00 PM

#

steel spindle How do you create AI in python?

what does "AI" mean to you?

steel spindle May 20, 2025, 8:43 PM

#

Artifical intelligence, something that can talk to you, like a person

serene scaffold May 20, 2025, 8:48 PM

#

steel spindle Artifical intelligence, something that can talk to you, like a person

So you wouldn't consider a self-driving car to be AI?

steel spindle May 20, 2025, 8:48 PM

#

No, I would consider it, but a diffent type of AI

serene scaffold May 20, 2025, 8:49 PM

#

Okay, so if you say that you want to "create AI", you have to be specific about what kind you're talking about

#

What you're describing is probably an interactive language model. You can't create those from scratch.

#

They cost millions of dollars to create

#

There are other things you can do with AI that are attainable

verbal oar May 20, 2025, 9:03 PM

#

how can I get know even a little about llamaindex,langchain,crewai?

#

what is best option official docs?

viscid urchin May 20, 2025, 9:56 PM

#

I'm only really familiar with the LangChain part of it, and dang there is a lot of surface area to cover. I used the official docs myself, which seem pretty nice.. it's just a lot to take in before you might understand the "idiomatic" way to do something with it.

lapis sequoia May 21, 2025, 12:07 AM

#

Does anyone know how to run the langchain repository locally on Windows?

I get lots of errors and a whole mess when I run the make test command in both paths
https://python.langchain.com/docs/contributing/how_to/code/setup/

Setup | 🦜️🔗 LangChain

This guide walks through how to run the repository locally and check in your first code.

viscid urchin May 21, 2025, 12:07 AM

#

I'll have to give it a shot, but not every project's actual test suite works on Windows sadly.. lemme see if it's obvious whether that's the case here

#

Not a great sign, they do not appear to have automated Windows builds in their GitHub action setup.

regal bane May 21, 2025, 12:10 AM

#

wsl is a option

lapis sequoia May 21, 2025, 12:10 AM

#

viscid urchin I'll have to give it a shot, but not every project's actual test suite works on ...

I have tried multiple things and it's going crazy

lapis sequoia May 21, 2025, 12:11 AM

#

regal bane wsl is a option

yeah but that means I will work on Linux! right? but besides windows. I want to work on Windows

viscid urchin May 21, 2025, 12:17 AM

#

lapis sequoia yeah but that means I will work on Linux! right? but besides windows. I want to ...

Sorry for the delay, this is what I get in my Windows env:

=================== 2 failed, 564 passed, 87 skipped, 1 xfailed, 172 warnings, 63 errors in 20.06s ====================
mingw32-make: *** [Makefile:25: test] Error 1

#

That's after uv sync etc like their docs suggest.

#

I guess all I can say is that this is an obvious place where a new contributor could make a positive impact on the project.

#

It just needs some stuff set up, like cross-platform in their CI config instead of just Linux

#

I'm sure these are just tests that aren't perfect yet etc rather than the lib being massively broken on Windows.

#

I actually do not love the style of this test suite implementation

regal bane May 21, 2025, 12:19 AM

#

lapis sequoia yeah but that means I will work on Linux! right? but besides windows. I want to ...

you could use both

#

all your work can be done on windows and you pop up a wsl terminal to use lang chain

lapis sequoia May 21, 2025, 12:20 AM

#

@viscid urchin Thanks for trying and it's okay don't worry about the time of respone. yeah, I got this amount of errors as well before, is it okay to ignore or what then? like is it safe to ignore them and do the work

regal bane May 21, 2025, 12:21 AM

#

definately not optimal but it works

viscid urchin May 21, 2025, 12:26 AM

#

lapis sequoia <@163779435462393856> Thanks for trying and it's okay don't worry about the time...

If you plan to contribute to langchain, it's probably worth it to at least also set up a WSL environment so you can have an 'all green' test run to compare against. If you just plan to use it, I'd say simply using it on Windows and expecting it to work is fine.. If you find something that doesn't work on Windows, you can open a github issue etc.

lapis sequoia May 21, 2025, 12:27 AM

#

viscid urchin If you plan to contribute to langchain, it's probably worth it to at least also ...

thanks, it's quite weird that a big thing like them didn't sort such a thing

#

anyway thanks to all of you @viscid urchin @regal bane

viscid urchin May 21, 2025, 12:31 AM

#

Honestly you might consider filing an issue for "please add Windows to your CI build"

#

Somebody might come along and do it

#

(I might even do it)

#

If I used LC "in anger" I 100% would.

lapis sequoia May 21, 2025, 12:38 AM

#

okay mate, I will see thanks

viscid urchin May 21, 2025, 12:47 AM

#

(I did just look pretty hard on their Issues list and there are a lot of things mentioning Windows, but nothing that seems to be asking to enhance the automated tests that get run.)

lapis sequoia May 21, 2025, 12:56 AM

#

viscid urchin (I did just look pretty hard on their Issues list and there are a lot of things ...

sorry for being late, probably because most of people just use wsl or linux on VBox or Linux as a main OS

#

who contribute a lot, idk. this is just my guess

viscid urchin May 21, 2025, 12:57 AM

#

Yeah, I'm one of those weirdos who runs a "Windows native zsh" env

lapis sequoia May 21, 2025, 12:58 AM

#

did you find an issue about this case or it's better to open one?

viscid urchin May 21, 2025, 12:58 AM

#

I'd open one; didn't find one that looked good to jump on.

lapis sequoia May 21, 2025, 12:58 AM

#

I am a windows lover tbh, I used linux for quite good time but didn't like it. although I studied it and so on

viscid urchin May 21, 2025, 12:58 AM

#

Just be super clear/polite/etc and describe the problem + proposed next step etc.

#

Yeah I've never come to love Linux.. (I do love FreeBSD though)

lapis sequoia May 21, 2025, 12:59 AM

#

viscid urchin I'd open one; didn't find one that looked good to jump on.

okay so, so I am not gonig to open one as long as you will do

viscid urchin May 21, 2025, 1:01 AM

#

Go ahead if you've got the inclination; I'm feeling lazy, just catching up on MotoGP 🙂

#

I'll gladly star/react/etc it if you do though 🍹

lapis sequoia May 21, 2025, 1:02 AM

#

for now, I am feeling lazy too. lol maybe another time

#

Do you contribute in Langchain?

viscid urchin May 21, 2025, 1:09 AM

#

No, but I've been toying with the idea to learn it better

#

and honestly you've found some low-hanging fruit that I might work on

lapis sequoia May 21, 2025, 1:23 AM

#

nice, good luck

obtuse acorn May 21, 2025, 1:33 AM

#

any idea why my MLPClassifier from scikit learn performs better when i do a gridsearch cv but worse when i just fit it with the pipe?

#

i think it might be something to do with the cross validation?

#

im using skf = StratifiedKFold(n_splits=5, shuffle=False) because ive got time series data and i figured it would be best ot keep it in order

torpid mirage May 21, 2025, 1:34 AM

#

viscid urchin Go ahead if you've got the inclination; I'm feeling lazy, just catching up on Mo...

!!!!

#

You watch MotoGP?

#

My homie

#

🫂

obtuse acorn May 21, 2025, 1:35 AM

#

obtuse acorn im using `skf = StratifiedKFold(n_splits=5, shuffle=False)` because ive got time...

so i split it once at the beginning to to get a test and train split

skf = StratifiedKFold(n_splits=5, shuffle=False)
skf.get_n_splits(X, y)
groups = dataFiltered[target].values

for train_index, val_index in skf.split(X, y):
    train_set = dataFiltered.iloc[train_index]
    test_set = dataFiltered.iloc[val_index]
    X_train, y_train = train_set.drop(columns=[target]), train_set[target]
    X_test, y_test = test_set.drop(columns=[target]), test_set[target]

#

then i ran

gridSearch = GridSearchCV(pipe, param_grid=param_grid, scoring='f1_macro', cv=skf, n_jobs=-1)

gridSearch.fit(X_train, y_train)

#

is it because i set cv=skf?

#

and somehow its matching the gridsearch results now

#

no idea whats going on

#

it was like 100% train accuracy and 85% test accuracy after the grid search

#

and then it was like 20% for both when i just fitted the pipe

viscid urchin May 21, 2025, 3:53 AM

#

torpid mirage You watch MotoGP?

Yeah, I watch MotoGP, WEC, WRC, and F1 currently. I miss WRX but don't have an easy way to get it it seems 😦

jaunty helm May 21, 2025, 4:33 AM

#

people familiar with sktime: how do I use parallel processing with transformations like Catch22?

#

I think I've tracked it down to

c22 = Catch22().set_config( ... )
```but nothing I put in `set_config` seems to do anything,
```py
cfg = { "backend:parallel": "loky" }
cfg = { "backend": "joblib" }
```etc, cpu usage is about the same

polar hornet May 21, 2025, 4:47 AM

#

https://discord.com/channels/267624335836053506/1374609166747828305 Hello, can someone help me answer some questions here for my scientific article assignment?

coral sage May 21, 2025, 6:28 AM

#

Hi, I'm trying to train a yolov11n model (to run on mobile devices) and I'm trying to train it using the entire COCO dataset (for real-time object detection). Problem is I vastly underestimated how long it was going to take to train and I wanted to know if there's anything I'm doing wrong or anything I can do to speed up the process.

Here's my code below (I haven't even changed much, it's mostly just straight from the ultralytics documentation except the dropout, patience and device (because I'm using an M1 Pro Macbook))

from ultralytics import YOLO

# Load a model
model = YOLO("yolo11n.pt")  # load a pretrained model (recommended for training)

# Train the model
results = model.train(
    data="coco.yaml", 
    epochs=100, 
    imgsz=640, 
    patience=10, 
    device="mps", 
    dropout=0.01
)

#

The dataset is already installed and I had left it to train overnight but it didn't even complete one epoch

#

I estimated that it would complete at least two but I think the time per iteration increased significantly overnight

#

and it didn't even save a last.pt or best.pt model when I interrupted the block

#

I played around with the batch size, and it started taking upwards of 40 GB of Memory at one point (I only have 16 GB of RAM so the rest was SWAP), so I just left it back to the default.

obtuse acorn May 21, 2025, 6:41 AM

#

any idea if i should drop day of the week or just leave it?

coral sage May 21, 2025, 6:42 AM

#

obtuse acorn any idea if i should drop day of the week or just leave it?

What's the label you're training for?

obtuse acorn May 21, 2025, 6:42 AM

#

type

coral sage May 21, 2025, 6:44 AM

#

I'm no expert, but I'd leave it in probably. What's the difference between day and day of the week?

coral sage May 21, 2025, 6:45 AM

#

coral sage I'm no expert, but I'd leave it in probably. What's the difference between day a...

I say so it could just be a non-linear relationship

#

and it's not like the other features have a high correlation either with respect to day of the week which makes it less signifcant

charred ferry May 21, 2025, 8:11 AM

#

can i ask about data analytics, big data, data lakes and data warehouse here?

#

I assume this is the correct channel but just wanna be sure.

#

Basically, I am deciding between a building a data warehouse project or a project that involves big data concepts, data lake, machine and basically data analytics for real-time recommendations. I'm unsure which to go for. Is there anyone who worked on either and can share their opinion on how their experience was like while working on on their work/project?

#

I am asking this because as soon as I choose my final year project then that is likely the field I will be going into as a junior developer (whatever u call it) since this would be the biggest project I ever produced (when I complete it).

obtuse acorn May 21, 2025, 10:15 AM

#

coral sage I'm no expert, but I'd leave it in probably. What's the difference between day a...

day is what the date is

#

day of the week is like, its a monday

final jolt May 21, 2025, 2:45 PM

#

charred ferry I am asking this because as soon as I choose my final year project then that is ...

Is this like end of the year for a 4 year degree project or something else? Unless this is some like guided schooling where you go right from school>internship>employment the project is probably not going to have as a massive as impact in the sense of forcing you into one side or the other in your career. What is your degree in and what kind of projects have you done so far? What level of interest do you have in either category?

jaunty helm May 21, 2025, 4:58 PM

#

jaunty helm I think I've tracked it down to ```py c22 = Catch22().set_config( ... ) ```but n...

update: seems to be some strangeness with pipelines in sktime, using it directly does seem to employ parallelism now (cpu high):

c22 = Catch22().set_config({
  'backend:parallel': 'loky', 
  'backend:parallel:params': {
    'n_jobs': -1  # technically not needed because -1 is the default
  }
})
c22.fit_transform(time_series)

though I don't really see a difference in run time
in contrast, Catch22Wrapper requires pycatch22 but is like a bjillion times faster

#

for reference: I've a multivariate (6) time series, about 2500 in length
Catch22 takes ~1min to fit_transform
Catch22Wrapper takes ~0.08sec to fit_transform

jaunty helm May 21, 2025, 5:36 PM

#

my first time experience with sktime definitely isn't the best

another example: I can't seem to get something as simple as chopping / padding all time series to a length of 2500 to work

preprocess_pipeline = (
  TruncationTransformer(2500) 
  * PaddingTransformer(2500)
)
preprocess_pipeline = (
  PaddingTransformer(2500)
  * TruncationTransformer(2500) 
)
```these 2 both don't work, throwing out some error I'm not sure how to fix

eventually I just did the truncation part manually through some `polars` `filter`ing on the index, leaving me with only `PaddingTransformer`
then there's another performance issue, as it takes several minutes *just* to do what should be a simple pad (granted I do have a lot of data)
eventually I ditched it as well and tried only `polars`, the resulting code again only takes a few seconds
```py
# something like this for padding
(
  df
  .filter(c('time_series_id').is_first_distinct())
  .select(
    'time_series_id', 
    pl.lit(list(range(pad_len))).alias('index')
  )
  .explode('index')
  .join( ... )

agile cobalt May 21, 2025, 5:44 PM

#

maybe pl.int_ranges(pad_len).alias('index') instead of pl.lit(list(range(...)))

jaunty helm May 21, 2025, 5:45 PM

#

agile cobalt maybe `pl.int_ranges(pad_len).alias('index')` instead of `pl.lit(list(range(...)...

I think I tried that but polars thinks what you're trying to do is create a column where each value is 1 int

agile cobalt May 21, 2025, 5:45 PM

#

int_range or int_ranges?

jaunty helm May 21, 2025, 5:45 PM

#

or maybe I haven't tried that idk, my brain is frying from debugging

jaunty helm May 21, 2025, 5:46 PM

#

agile cobalt `int_range` or `int_ranges`?

ah right, I think I did int_range
good catch and ty

agile cobalt May 21, 2025, 5:47 PM

#

but yeah common polars W Chad
I haven't messed much with its time series related features, but it has a lot of methods specifically for it too

jaunty helm May 21, 2025, 5:49 PM

#

unfortunate that the integration is still lagging behind 😔

#

if I use sktime again, some of the stuff supports polars while others don't, so it's probably easier to just stick to pandas (or at least, before you pass into the transforms)
also there's no tutorials explaining how you'd use a polars dataframe with the transforms, I figured it out by code digging: columns starting with __index__ will be recognized as the time series id / time index / etc
so actually I had to have column names like __index__time_series_id or __index__time, then down the line find that some don't work and .to_pandas() anyway

verbal oar May 21, 2025, 7:10 PM

#

I think shuffle=True

verbal oar May 21, 2025, 7:10 PM

#

obtuse acorn im using `skf = StratifiedKFold(n_splits=5, shuffle=False)` because ive got time...

.

#

ah keep it in order

frigid niche May 22, 2025, 1:27 AM

#

Would it be appropriate to post in here an academic website I made detailing my Neural Network that runs on a TI 84 Plus Silver Edition capable of autocorrecting words?

serene scaffold May 22, 2025, 1:35 AM

#

frigid niche Would it be appropriate to post in here an academic website I made detailing my ...

you may post it here once.

frigid niche May 22, 2025, 1:37 AM

#

Understood, thank you. I hope it will be of interest to anyone who happens across it.
https://hermesoptimus.vercel.app/

HERMES OPTIMUS - Neural Network for TI-84 Plus

A neural network implementation for the TI-84 Plus Silver Edition calculator capable of autocorrecting words.

rich moth May 22, 2025, 3:46 AM

#

I feel like this is the smoking gun. What do you guys think? It's domain distribution of 62 datasets across 4 domains. I found in my research all of them follow the same mathematical law when ranked. Information itself has a universal structure...

#

Information organizes itself different in complexity space. But even in chaos within the constraints of physical laws, there is structure.

#

Makes you really wonder about the universe itself..

plush kettle May 22, 2025, 4:44 AM

#

Guys how do you train an RCNN model that generates 2000 proposals on colab, I tried and it just cradhed because the ram isnt enough

#

So, I modified the original RCNN’s selective search, to generate 1/4 of the original proposal size

#

Also is it normal for RCNN to start with insane loss like say 200 or 100

#

Also how do I remove tensor from GPU ram I tried del tensor and cuda cache remove but cant

verbal oar May 22, 2025, 8:55 AM

#

detach?

thick pier May 22, 2025, 11:39 AM

#

hi can anyone instruct how to start with data science while u have no knowledge whatsoever

radiant cipher May 22, 2025, 12:19 PM

#

anyone aware of a open source lib that facilitates agents, data retrieval, memory and memory usage

charred ferry May 22, 2025, 12:20 PM

#

final jolt Is this like end of the year for a 4 year degree project or something else? Unl...

It is for the final year of my BSc Computer Science degree. I'm gonna be entering final year in the upcoming September. Basically due to the last final year student's performance with the final year project being bad, the teachers decided for the final year project to be started in the summer (for those who are going to be in the final year of their degree in September). I have interest in data analytics and data warehouses. In particular I love machine learning with data analytics. In fact, I was going to that project instead (data analytics with machine learning). I started learning the basics of machine learning. I have beginner knowledge of Pandas. I am good with Python. Right now I am trying to look for beginner friendly projects I can work. I want to do this because for my final year project I will need a teacher to act as my supervisor for my final year project. Some teachers may ask for my CV and experience with machine learning and data analystics. I hope to do 1 or 2 beginner friendly projects so I can make convince a teacher that I am able to learn the required concepts in order to do the project I choose.

agile cobalt May 22, 2025, 12:29 PM

#

radiant cipher anyone aware of a open source lib that facilitates agents, data retrieval, memor...

memory is generally a bit awkward, imo there isn't a good one-fits-all solution

maybe take a look at Llama Index though

radiant cipher May 22, 2025, 12:30 PM

#

agile cobalt memory is generally a bit awkward, imo there isn't a good one-fits-all solution ...

will do - i specificall want to index sourcecode and then have the llm search for "bad" things and try to replace them with "good things" - as part of some kind of tech debt sweeping tool

charred ferry May 22, 2025, 12:35 PM

#

Are websites like these good start for someone wanting to do beginner friendly machine learning projects? Advise on what projects to do as i progress would be helpful (so I can get a feel of machin learning and get practical experience/improve my knowledge)https://www.freecodecamp.org/news/how-to-build-a-house-price-prediction-model/

freeCodeCamp.org

How to Build A House Price Prediction Model – Linear Regression E...

Ever wondered how algorithms predict future house prices, stock market trends, or even your next movie preference? The answer lies in a fundamental yet powerful tool called linear regression. Don't be fooled by its seemingly simple equation – this ar...

agile cobalt May 22, 2025, 12:37 PM

#

radiant cipher will do - i specificall want to index sourcecode and then have the llm search fo...

that would probably end up crazy expensive if you show the entire repository each time, so you might want to start by creating a tool that will allow for the agent to search more effectively

could be something simple like a ctrl+shift+F equivalent, or maybe something more complex like creating linter rules

jaunty helm May 22, 2025, 12:52 PM

#

charred ferry Are websites like these good start for someone wanting to do beginner friendly m...

kaggle?

radiant cipher May 22, 2025, 2:39 PM

#

agile cobalt that would probably end up crazy expensive if you show the entire repository eac...

my current mess of an idea is to try and create memories the llms use to compare and see how i can wire them up - its entirely ok if a run on 100 repos takes a night on controlled hardware as long as i get the single starting poitns working

what i'm trying to do is find cargo cult-ed instances of initial iterations of ideas that where adopted across many ropes - and then reporting and/or trying to suggest a fix

agile cobalt May 22, 2025, 2:47 PM

#

just to check, do you understand the difference between RAG ""memories"" and in-context "memory"? specially how much the model knows about each

radiant cipher May 22, 2025, 2:56 PM

#

agile cobalt just to check, do you understand the difference between RAG ""memories"" and in-...

indeed - i believe i have to sort that out - i may run into a situation where i have to run hundreds of prompts + do memory/storage to ensure in context memory first - rag memory may end up just being something that keeps track so i can split the problem into more chunks

it would be so nice if there was a way to make context fragments and combinations of them instead of always streaming the tokens

radiant cipher May 22, 2025, 3:03 PM

#

radiant cipher indeed - i believe i have to sort that out - i may run into a situation where i ...

seems like someone came up with something for that unfortunately i only found a youtube vid discussing it - if permitted i'll post a link

agile cobalt May 22, 2025, 3:07 PM

#

pretty sure it is fine (posting relevant yt links)

serene grail May 22, 2025, 3:08 PM

#

agile cobalt just to check, do you understand the difference between RAG ""memories"" and in-...

I'd love an explanation of this if you have some link on hand

radiant cipher May 22, 2025, 3:09 PM

#

https://www.youtube.com/watch?v=YNQKq1YfBAI discusses a paper that has llms make and use memories from the tokesn they take - it claims to be better than infinite context

unfortunately there doesnt seem to be a implementation linked

YouTube

Tunadorable

The END of RAG? Episodic memory for infinite context length

HUMAN-LIKE EPISODIC MEMORY FOR INFINITE CONTEXT LLMS
ArXiv: https://arxiv.org/abs/2407.09450
Bytez: https://bytez.com/docs/arxiv/2407.09450
AlphaXiv: https://alphaxiv.org/abs/2407.09450

Support my learning journey either by clicking the Join button above, becoming a Patreon member, or a one-time Venmo!
https://patreon.com/Tunadorable
https://ac...

▶ Play video

agile cobalt May 22, 2025, 3:10 PM

#

serene grail I'd love an explanation of this if you have some link on hand

models can only really see what is in the context window (aka the prompt itself)

most abstract "memory" techniques use RAG to control what is included in the prompt - but the model does not have full access to all memories this way, it only sees subsets of it determined by the retrieval strategy

serene grail May 22, 2025, 3:11 PM

#

agile cobalt models can only really see what is in the context window (aka the prompt itself)...

And the context window is ultimately limited by your VRAM right?

agile cobalt May 22, 2025, 3:12 PM

#

serene grail And the context window is ultimately limited by your VRAM right?

yes but not only your VRAM, also how well the model can work with it and identify relevant information

serene grail May 22, 2025, 3:13 PM

#

Yeah that makes sense, thank you

agile cobalt May 22, 2025, 3:16 PM

#

radiant cipher https://www.youtube.com/watch?v=YNQKq1YfBAI discusses a paper that has llms make...

first time I hear about that paper, yeah idk

there is also prefix caching which might help a little

radiant cipher May 22, 2025, 3:18 PM

#

agile cobalt first time I hear about that paper, yeah idk there is also [prefix caching](<ht...

vllm looks like something i'd run instead of ollama

radiant cipher May 22, 2025, 3:47 PM

#

hmm - oh - i jsut learned about the model context protocol - that may be neat to put agents together

burnt geode May 22, 2025, 4:24 PM

#

Hi,
anyone with speech to speech realtime LLM experience in python?
ping me we need to develop the llm with function calling ability.

jaunty helm May 22, 2025, 4:28 PM

#

serene grail And the context window is ultimately limited by your VRAM right?

there's also a "hard limit" of sorts of what the base model was trained on which some techniques can get around while degrading quality

e.g. llama 3 was trained on text that was 8192 tokens at the longest, so that's the native limit you'll see being thrown around
during inference you can use Rotary Positional Embeddings (RoPE) to extend that while degrading the quality of responses a bit
I believe that's the technique used in tuning llama 3.1 so it can "have 128k context" even though it's based on llama 3

and as mentioned by etrotta, there's a much-easier-to-hit soft limit of things being in context, but the llm being unable to utilize them
see RULER and the newer NoLiMa benchmarks

serene grail May 22, 2025, 4:52 PM

#

jaunty helm there's also a "hard limit" of sorts of what the base model was trained on which...

Thank you thank you!

agile cobalt May 22, 2025, 5:20 PM

#

burnt geode Hi, anyone with speech to speech realtime LLM experience in python? ping me we n...

last I checked open source models suck at it, your options are pretty much either gemini live or openai realtime, both of which are very expensive

woven prairie May 22, 2025, 7:37 PM

#

Hi

#

I want to build Gen AI project for Resume, Any suggestions

charred ferry May 22, 2025, 8:26 PM

#

dict_1 = {'Ideal':5, 'Premium':4, 'Very Good':3, 'Good':2, 'Fair':1}
diamonds_df.cut = diamonds_df.cut.replace(dict_1)

dict_2 = {'D':7, 'E':6, 'F':5, 'G':4, 'H':3, 'I':2, 'J':1}
diamonds_df.color = diamonds_df.color.replace(dict_2)

dict_3 = {'IF':8, 'VVS1':7, 'VVS2':6, 'VS1':5, 'VS2':4, 'SI1':3, 'SI2':2, 'I1':1}
diamonds_df.clarity = diamonds_df.clarity.replace(dict_3)

# renaming the 'x','y','z' columns to more descriptive names
diamonds_df = diamonds_df.rename(columns={'x':'length_mm', 'y':'width_mm', 'z':'depth_mm'})

# removing dimensionless diamonds
diamonds_df = (diamonds_df.drop(diamonds_df[diamonds_df['length_mm']==0].index))
diamonds_df = (diamonds_df.drop(diamonds_df[diamonds_df['width_mm']==0].index))
diamonds_df = (diamonds_df.drop(diamonds_df[diamonds_df['depth_mm']==0].index))

# dropping duplicated rows in the DataFrame if there are any
diamonds_df = diamonds_df.drop_duplicates() ```i am getting this message: "FutureWarning: Downcasting behavior in ⁠ replace ⁠ is deprecated and will be removed in a future version. To retain the old behavior, explicitly call `result". Is replace deprecated? Will it lose support in future? Im new to Pandas. I just want to know if I should use .replace() or is it not good to use?

#

Im following this tutorial: https://medium.com/@idowuadamo2904/machine-learning-for-price-prediction-a-step-by-step-guide-ad5913b5cec7

Medium

Machine Learning for Price Prediction: A Step-by-Step Guide

Machine learning is an umbrella term for solving problems for which the development of algorithms by human programmers would be cost-prohibitive. Instead, the problems are solved by helping machines…

simple mist May 23, 2025, 1:45 AM

#

charred ferry ```# replacing the string values in some columns with numerical values dict_1 = ...

Yes, you can use pandas still with .replace

The warning is just about future changes to how pandas automatically changes data types after replacement, its basically just telling you that it wants you to follow up the the diamonds_df.color/clarity with a .infer_objects(copy=False), this will keep it as its old version instead so your code wont be affected

#

You can also run pd.set_option('future.no_silent_downcasting', True) to tell it to stop warning you

rich moth May 23, 2025, 5:27 AM

#

https://docs.google.com/document/d/1Bey9Qt6dcif0r4--rE3BP3GupnAw306EJGRgkZG7N3s/edit?usp=sharing

I updated my paper quiet a bit i think im toward the end of my journey.

Google Docs

AGPaper

The Unified Complexity Framework: A Novel Paradigm for Quantifying Data Complexity and Optimizing Curriculum Learning Andrew Scott Gracey Independent Researcher bigpunk2@gmail.com Abstract This paper introduces the Unified Complexity Framework (UCF), a novel and potentially transformative parad...

#

Please let me know if visuals come in

verbal oar May 23, 2025, 9:13 AM

#

I had about diamond price prediction but it was in excel

#

or sth not about coding in python

dire kiln May 23, 2025, 10:10 AM

#

Good Morning

#

I'm running LLama 3.2 1B with ONNX and DirectML because my AMD card is old. Loading it consumes 5.3GB of VRAM out of 8GB, which is okay, as long as it doesn't take it all.

#

1 initial prompt + 3 follow ups is enough to consumo the rest of the 8GB total VRAM. From the 5th prompt onwards it gets really slow. Still better than CPU, but worrying.

#

Is this normal?
Are sessions stored in VRAM?
Is there a fix or a way to reduce VRAM usage?

#

I ran DeepSeek R1 using a converted model I pulled from HuggingFace and it was capable of prompting again and again just fine. Probably I didn't test it enough because the lack of an openai-compatible API convinced me to delete it. But I wonder if I'm doing something wrong or am ignorant about how this works.

#

This is my first time doing this but I'm a junior/mid level python dev

#

Tried Phi3.5-mini but there was a leak that doubled the VRAM usage on the first prompt and the model kept appending the answer over and over until it ran out of tokens and returned HTTP code 500.

#

Using Lemonade SDK as runtime+REST API

#

Maybe use Hybrid models that do integer calculus on the CPU to kinda split the data between RAMs? Idk, just brainstorming

stiff crown May 23, 2025, 12:19 PM

#

PACE methodology or CRISP-DM ?

dire kiln May 23, 2025, 12:38 PM

#

stiff crown PACE methodology or CRISP-DM ?

talking with me?

coral wyvern May 23, 2025, 12:54 PM

#

I have a dataset of 1.5 million users anime lists, and I want to build an anime recommendation website. But I have no idea how much a project of this scale would cost. Is there anyone who can give me rough estimate and maybe break down the expenses?

agile cobalt May 23, 2025, 1:00 PM

#

it depends, if you run everything locally and do not host anything on the cloud it's only going to cost electricity and time

you could also train some models in Google Colab and host in Hugging Face Spaces free of charge

it could get pretty expensive if you were to rent enough compute to handle thousands of users accessing it daily though

toxic pilot May 23, 2025, 1:19 PM

#

agile cobalt it depends, if you run everything locally and do not host anything on the cloud ...

i feel like this is something they could accomplish locally

#

it depends on what methodology they use tbh

toxic pilot May 23, 2025, 1:21 PM

#

serene grail And the context window is ultimately limited by your VRAM right?

depends on the model hyperparameter. in theory you could have as large a context window as you want

#

gpt2 has a context window of 1024 tokens

dire kiln May 23, 2025, 1:23 PM

#

agile cobalt models can only really see what is in the context window (aka the prompt itself)...

Very interesting! Could one store the context info locally in a way? Even at detriment of performance. Or maybe disable context windows all together?

dire kiln May 23, 2025, 1:25 PM

#

toxic pilot depends on the model hyperparameter. in theory you could have as large a context...

Thanks for tagging that message. Ended up being very relevant to my issue.

toxic pilot May 23, 2025, 2:34 PM

#

dire kiln Very interesting! Could one store the context info locally in a way? Even at det...

what do you mean?

dire kiln May 23, 2025, 2:37 PM

#

toxic pilot what do you mean?

The context's data. Idk how this work so pardon me. Could it be stored somewhere else at the cost of latency so that it doesn't keep consuming more and more VRAM?

#

Because a single very small context in amount of follow up prompts (4 prompts) is enough to take the remaining ~2.6GB of VRAM

#

I can still use it but it becomes very slow as it tries to free memory or use regular RAM, which is what I want at the end of the day: share the load. But in assume there's a more formal way of implementing this behavior?

toxic pilot May 23, 2025, 2:43 PM

#

dire kiln The context's data. Idk how this work so pardon me. Could it be stored somewhere...

context windows are usually relatively small compared to the model

#

in a model with millions of trainable parameters, a 1024 token or even a 100000 token context window takes up negligible space

toxic pilot May 23, 2025, 2:45 PM

#

dire kiln Because a single very small context in amount of follow up prompts (4 prompts) i...

what are you running?

#

what model?

dire kiln May 23, 2025, 2:50 PM

#

toxic pilot what are you running?

LLama 3.2 1B version with ONNX runtime under DirectML. Hosted using Lemonade SDK because of the OpenAPI-compatible wrapper

GitHub

GitHub - lemonade-sdk/lemonade: Local LLM Server with NPU Acceleration

Local LLM Server with NPU Acceleration. Contribute to lemonade-sdk/lemonade development by creating an account on GitHub.

#

Either this, OpenCL or CPU as far as runtimes go

dire kiln May 23, 2025, 2:52 PM

#

dire kiln Good Morning

I explain it here

toxic pilot May 23, 2025, 2:56 PM

#

dire kiln I explain it here

maybe a memory leak

#

but i don’t think it’s an issue with the context window necessarily

dire kiln May 23, 2025, 3:00 PM

#

toxic pilot maybe a memory leak

Could it be something in how the model is converted to ONNX? I converted models before and the output suggests that there are losses in precision at least in my case. Those are official models, though. Converted, configured and fine tuned by AMD. They're hosted in the official organization at HuggingFace.

#

Maybe a leak in the runtime version that Lemonade depends on. Because of C extensions.

#

What about I download Lemonade's source and keep bumping versions of dependencies to see if it stops. Could it possible work? xD

finite surge May 23, 2025, 7:19 PM

#

guys how do i get into making ai because im stuck all i know is to learn python rn im watching bro code idk if i shoulf switch to freecodecamp

serene scaffold May 23, 2025, 7:36 PM

#

finite surge guys how do i get into making ai because im stuck all i know is to learn python ...

What does AI mean to you?

finite surge May 23, 2025, 7:48 PM

#

serene scaffold What does AI mean to you?

wdym

serene scaffold May 23, 2025, 7:48 PM

#

finite surge wdym

Please define AI for me without looking it up

finite surge May 23, 2025, 7:48 PM

#

aight

#

its artificial intelligence for me i wanna make like and programme where someone gives me info it can give info back and thats all, i know and i wanna make money by solving problem and wanna keep improving and not do a 9-5

finite surge May 23, 2025, 7:54 PM

#

serene scaffold Please define AI for me without looking it up

.

serene scaffold May 23, 2025, 8:06 PM

#

finite surge its artificial intelligence for me i wanna make like and programme where someone...

Learning about AI won't help you "escape the 9-5"

finite surge May 23, 2025, 8:07 PM

#

serene scaffold Learning about AI won't help you "escape the 9-5"

i still wanna just make money idc how little

serene scaffold May 23, 2025, 8:08 PM

#

finite surge i still wanna just make money idc how little

You can't do that with AI.

finite surge May 23, 2025, 8:08 PM

#

serene scaffold You can't do that with AI.

u can

serene scaffold May 23, 2025, 8:08 PM

#

finite surge u can

Okay, good luck.

finite surge May 23, 2025, 8:08 PM

#

serene scaffold Okay, good luck.

?

#

was that a test or smth im confused

serene scaffold May 23, 2025, 8:09 PM

#

finite surge ?

I'm not going to help you with something that I think is misguided and a waste of your time. If you're interested in actually learning about and understanding AI and preparing for a career in that space, I'm happy to help.

finite surge May 23, 2025, 8:10 PM

#

should i try cause im 14 so i wanted to make ai

serene scaffold May 23, 2025, 8:11 PM

#

finite surge should i try cause im 14 so i wanted to make ai

If you want to do the thing I said, there are worthwhile things you can start doing at 14

finite surge May 23, 2025, 8:12 PM

#

dont i gotta do college for cs or smth i dont really know

small igloo May 23, 2025, 8:24 PM

#

Hi I was thinking of making a CNN model to track real-time deforestation using satellite imagery, what dataset should I be using?

serene scaffold May 23, 2025, 8:57 PM

#

finite surge dont i gotta do college for cs or smth i dont really know

Yes

finite surge May 23, 2025, 9:11 PM

#

O

rich moth May 24, 2025, 2:33 AM

#

finite surge O

Don't listen to people tell you can't do something, if you wanna do it, go Nike on it and just do it. The worst you will do is fail and possible learn something. This isn't rock climbing. But you need a better plan or idea, and start researching how you want to work on it. You have all the tools at your fingertips, I recommended start getting better with those first.

gritty vessel May 24, 2025, 5:12 AM

#

Hey guys I wanted to know How to train model on huge data

#

My Features are of shape for training 584,1536,1392,7 and targets 584,1536,1392

#

I kept to train a model at night and It has not even completed 1 epoch yet

#

All data is about 100gb

#

so i stored both features and targets in seprate npy file and then I am training them in batch so all data is not loaded in ram

#

any other way I can train little faster?

#

Or it does seems unusual actually to training this much time for 1 epoch

iron basalt May 24, 2025, 5:31 AM

#

gritty vessel My Features are of shape for training 584,1536,1392,7 and targets 584,1536,1392

Are these images? What are they?

gritty vessel May 24, 2025, 5:31 AM

#

yes images we can say

iron basalt May 24, 2025, 5:32 AM

#

Downscale if applicable.

gritty vessel May 24, 2025, 5:32 AM

#

size of arrays is 1536,1392

#

584 are timestamps

#

and 7 are channels

gritty vessel May 24, 2025, 5:35 AM

#

iron basalt Downscale if applicable.

Okie

gritty vessel May 24, 2025, 6:53 AM

#

I have one more doubt so as my model is training my ram usage is increasing

#

all data is about 100gb

#

so at max it sahould take 100gb and Im training it in batches

#

and still its taking 133gb ram

#

on idle its around10-15gb

lapis sequoia May 24, 2025, 11:10 AM

#

Dude I am creating most powerful research tool x model

finite surge May 24, 2025, 12:02 PM

#

rich moth Don't listen to people tell you can't do something, if you wanna do it, go Nike...

ty imma keep striving

sly isle May 24, 2025, 1:49 PM

#

What are the best courses/certificates for Data Science in 2025? 🤔

serene scaffold May 24, 2025, 1:56 PM

#

sly isle What are the best courses/certificates for Data Science in 2025? 🤔

There are no certificates for data science that have any value.

sly isle May 24, 2025, 1:56 PM

#

serene scaffold There are no certificates for data science that have any value.

Really? So what is the best way otherwise?

serene scaffold May 24, 2025, 1:57 PM

#

sly isle Really? So what is the best way otherwise?

Getting a degree in computer science with data science related coursework

sly isle May 24, 2025, 1:57 PM

#

serene scaffold Getting a degree in computer science with data science related coursework

Yes, I see that a course can't replace an entire degree. However, wouldn't a online course demonstrate practical knowledge?

serene scaffold May 24, 2025, 1:58 PM

#

sly isle Yes, I see that a course can't replace an entire degree. However, wouldn't a onl...

Every position you apply to will have many applicants with relevant degrees, so if you don't have one, your resume won't even be considered

sly isle May 24, 2025, 1:59 PM

#

serene scaffold Every position you apply to will have many applicants with relevant degrees, so ...

I'm currently enrolled in a degree, but I also want to do something outside of university, you know? 😅

serene scaffold May 24, 2025, 1:59 PM

#

sly isle I'm currently enrolled in a degree, but I also want to do something outside of u...

Talk to the professors for the data science courses and ask if you can participate in their research.

#

That's what I did, and it's the main reason I got a job.

sly isle May 24, 2025, 2:00 PM

#

serene scaffold Talk to the professors for the data science courses and ask if you can participa...

Where did you study at?

serene scaffold May 24, 2025, 2:00 PM

#

Virginia Commonwealth University

sly isle May 24, 2025, 2:01 PM

#

serene scaffold Virginia Commonwealth University

Interesting... I think that would make more sense. Thanks for the recommendation! 👍

rain kelp May 24, 2025, 2:06 PM

#

is this a good neural network model?

toxic pilot May 24, 2025, 2:10 PM

#

gritty vessel My Features are of shape for training 584,1536,1392,7 and targets 584,1536,1392

network size?

toxic pilot May 24, 2025, 2:10 PM

#

rain kelp is this a good neural network model?

depends;for the most part it looks decent

#

if t hats the loss & train for something like MNIST, then you might be able to do better

rain kelp May 24, 2025, 2:11 PM

#

can i ask you if this one is better? i am new to this stuff so i cant compare to other graphs

toxic pilot May 24, 2025, 2:11 PM

#

if it is loss & train for something like NLP, then ur doing really well

rain kelp May 24, 2025, 2:11 PM

#

chat gpt told me the first one is better but i am worried about those spikes

toxic pilot May 24, 2025, 2:12 PM

#

rain kelp can i ask you if this one is better? i am new to this stuff so i cant compare to...

the first one is probably better

rain kelp May 24, 2025, 2:12 PM

#

alr thanks!

toxic pilot May 24, 2025, 2:12 PM

#

rain kelp chat gpt told me the first one is better but i am worried about those spikes

fluctuation is normal, as long as its not huge fluctuation all the time

toxic pilot May 24, 2025, 2:12 PM

#

rain kelp can i ask you if this one is better? i am new to this stuff so i cant compare to...

this one is a bit concerning because it starts off at 80 something % accuracy

rain kelp May 24, 2025, 2:12 PM

#

model = models.Sequential()
model.add(layers.Conv2D(32, (3,3), activation='relu', input_shape=(32, 32, 3)))
model.add(layers.BatchNormalization())
model.add(layers.MaxPooling2D(2,2))

model.add(layers.Conv2D(64, (3,3), activation='relu'))
model.add(layers.BatchNormalization())
model.add(layers.MaxPooling2D(2,2))

model.add(layers.Conv2D(64, (3,3), activation='relu'))
model.add(layers.BatchNormalization())

model.add(layers.GlobalAveragePooling2D())
model.add(layers.Dropout(0.5))
model.add(layers.Dense(64, activation='relu'))
model.add(layers.BatchNormalization())
model.add(layers.Dense(10, activation='softmax'))

#

this is my model

toxic pilot May 24, 2025, 2:13 PM

#

what are you trying to do>

rain kelp May 24, 2025, 2:13 PM

#

i am learning image classification

toxic pilot May 24, 2025, 2:13 PM

#

also you could experiment with filter sizes as well

#

maybe 3x3, 5x5, 7x7

#

oh for Mnist?

rain kelp May 24, 2025, 2:13 PM

#

what mnist?

toxic pilot May 24, 2025, 2:13 PM

#

like handwritten number detection?

rain kelp May 24, 2025, 2:14 PM

#

no i am using a dataset with 10 image classes and trying to classify the test data

toxic pilot May 24, 2025, 2:14 PM

#

ah okay

rain kelp May 24, 2025, 2:15 PM

#

this one:
(training_images, training_labels), (testing_images, testing_labels) = datasets.cifar10.load_data()

toxic pilot May 24, 2025, 2:16 PM

#

ill just say that my newtork for a similar kind of thing was:

Conv2d --> BatchNorm --> ReLU --> Conv2d --> BatchNorm --> ReLU --> MaxPool 2d --> Conv2d --> BatchNorm --> ReLU --> Max Pool 2d --> Dropout(0.5) --> Dense --> dropout(0.5) --> Dense --> Dropout(0.5) --> Linear Output layer

rain kelp May 24, 2025, 2:16 PM

#

ill try that one and compare! thanks

toxic pilot May 24, 2025, 2:16 PM

#

i probably used too many batch norms 💀

#

idk what dimensions ur images are, but my conv filters were 3x3

rain kelp May 24, 2025, 2:17 PM

#

also do u run stuff on your cpu or gpu? because i saw that using the nvidia gpu the training is much faster

toxic pilot May 24, 2025, 2:17 PM

#

rain kelp also do u run stuff on your cpu or gpu? because i saw that using the nvidia gpu ...

shouldnt really matter for such a small newtork

#

if ur on mac, use mps

rain kelp May 24, 2025, 2:18 PM

#

toxic pilot idk what dimensions ur images are, but my conv filters were 3x3

if you mean pixel wise i have scaled the to 1

toxic pilot May 24, 2025, 2:18 PM

#

if ur on a cuda supporting machine, use Cuda obviously

#

otherwise CPU is probably fine

rain kelp May 24, 2025, 2:18 PM

#

training_images, testing_images = training_images / 255, testing_images / 255

toxic pilot May 24, 2025, 2:18 PM

#

layers.Conv2D(32, (3,3), activation='relu', input_shape=(32, 32, 3)) The filters (kernels) are the (3x3) thing

toxic pilot May 24, 2025, 2:18 PM

#

rain kelp if you mean pixel wise i have scaled the to 1

you mean gray scale?

#

should be fine

#

just means you get to use less feature maps in your conv layers

rain kelp May 24, 2025, 2:20 PM

#

ok thanks for the help. so i guess when doing the model its always best to test many different things and see whats better

toxic pilot May 24, 2025, 2:20 PM

#

rain kelp ok thanks for the help. so i guess when doing the model its always best to test ...

yeah.

#

for context, i ran on 24 epochs with a batchsize of 32

rain kelp May 24, 2025, 2:20 PM

#

is there like a logic behind or just brute force it ahah

toxic pilot May 24, 2025, 2:21 PM

#

with 60000 (i think?) images

rain kelp May 24, 2025, 2:21 PM

#

this is how ive done it history = model.fit(
training_images,
training_labels,
epochs=30,
validation_data=(testing_images, testing_labels),
callbacks=[early_stop, reduce_lr]
)

#

ill try your model now

toxic pilot May 24, 2025, 2:22 PM

#

dynamic lr is probably overkill but yeah, looks good

toxic pilot May 24, 2025, 2:23 PM

#

rain kelp is there like a logic behind or just brute force it ahah

it is kind of bruteforce

#

im sure there is a deterministic way to do things 💀

rain kelp May 24, 2025, 2:24 PM

#

yeah little by little ill learn this dark magic ahaha

toxic pilot May 24, 2025, 2:25 PM

#

there is a formula to prevent over fitting; if you keep the number of hidden neurons below N_h = N_s/(alpha * (N_i + N_o)) where N_i = # input, N_o = # output, N_s = # samples in training set and alpha = some scaling factor between 5-->10

toxic pilot May 24, 2025, 2:26 PM

#

rain kelp model = models.Sequential() model.add(layers.Conv2D(32, (3,3), activation='relu'...

oh wait is this keras

#

or Tensorflow or whatever

rain kelp May 24, 2025, 2:26 PM

#

yeah

toxic pilot May 24, 2025, 2:26 PM

#

sweet

rain kelp May 24, 2025, 2:27 PM

#

you used torch? i have never done that yet

toxic pilot May 24, 2025, 2:27 PM

#

i use torch

#

and also burn.dev, which is a rust framework

#

it really doesnt matter imo

rain kelp May 24, 2025, 2:27 PM

#

whats the difference?

toxic pilot May 24, 2025, 2:27 PM

#

toxic pilot it really doesnt matter imo

^^ no difference

rain kelp May 24, 2025, 2:28 PM

#

ah ok so ill just learn one of the 2

toxic pilot May 24, 2025, 2:28 PM

#

i think TF might be more performant (????) but it really doesnt matter

rain kelp May 24, 2025, 2:29 PM

#

next project ill try that gpu stuff to speed thing up. when i read that i got very interested

toxic pilot May 24, 2025, 2:29 PM

#

🔥

rain kelp May 24, 2025, 2:30 PM

#

#

this is your model

toxic pilot May 24, 2025, 2:30 PM

#

rain kelp

hmm that doesnt look good

#

val > train accuracy is usually not good

rain kelp May 24, 2025, 2:30 PM

#

ill probabluy have to increase the epochs?

toxic pilot May 24, 2025, 2:30 PM

#

likely not

rain kelp May 24, 2025, 2:31 PM

#

its overfitting?

toxic pilot May 24, 2025, 2:31 PM

#

idk what channels you used

toxic pilot May 24, 2025, 2:31 PM

#

rain kelp its overfitting?

definitely not

rain kelp May 24, 2025, 2:31 PM

#

toxic pilot May 24, 2025, 2:32 PM

#

oh i used 32, 64, 128

#

also early stopping of 2 epochx

#

Adam with weightdecay of 1e-5

#

lr 1e-4

toxic pilot May 24, 2025, 2:40 PM

#

rain kelp

wait if ur gray scaling, why is your input 32x32x3

#

oh i also usex a 1x1 padding, (idk if thats the same as padding="same" in keras)

pure tundra May 24, 2025, 2:43 PM

#

Hello am josephinewebexpert
Have a business idea but are having trouble launching it online? Come on, let's discuss. If you would like some free advice, please message me.

rain kelp May 24, 2025, 2:43 PM

#

toxic pilot wait if ur gray scaling, why is your input 32x32x3

no i am not grayscaling. i am just normalizing the images

#

the data set has images of 32x32x3

toxic pilot May 24, 2025, 2:44 PM

#

rain kelp the data set has images of 32x32x3

right, likely maps for r, g and b

rain kelp May 24, 2025, 2:45 PM

#

its basically image of planes, trucks, deer and other stuff. i have limited the training to 20k images and testing to 4k

toxic pilot May 24, 2025, 2:46 PM

#

rain kelp its basically image of planes, trucks, deer and other stuff. i have limited the ...

sounds about right

#

yeah no way ur overfitting

rain kelp May 24, 2025, 2:46 PM

#

so over fitting is when the lines go far apart because it memorises the training images?

#

this is the 3rd model

toxic pilot May 24, 2025, 2:47 PM

#

rain kelp this is the 3rd model

hmm, val > train for accuracy is usually not good

#

nor is train > val for loss

#

maybe simplify your model?

jaunty helm May 24, 2025, 2:48 PM

#

rain kelp so over fitting is when the lines go far apart because it memorises the training...

overfitting is the point when the training objective keeps improving but validation objective starts getting worse

toxic pilot May 24, 2025, 2:49 PM

#

toxic pilot maybe simplify your model?

and also maybe weight decay

jaunty helm May 24, 2025, 2:49 PM

#

rain kelp this is the 3rd model

in this image the model is not overfitting cause the val loss is still decreasing

toxic pilot May 24, 2025, 2:52 PM

#

toxic pilot nor is train > val for loss

oh wait this might just be because of the dropout layers

jaunty helm May 24, 2025, 2:52 PM

#

training objective being worse than validation objective is also not a total disaster as it could occur naturally
like for example, having dropout layers (which you do) actively hurts model performance in training to seek a better generalized model
or say an image augmentation step like affine is a part of your training, then your training dataset keeps changing so the model can't really overfit it ever (unless it's highly overparameterized), so you might see that training loss stops decreasing at one point yet the validation loss keeps improving

rain kelp May 24, 2025, 2:55 PM

#

ok thanks for all the help. i have learned a lot. I will put this project to rest now and come back once i learn new things

rain kelp May 24, 2025, 2:56 PM

#

jaunty helm training objective being worse than validation objective is also not a total dis...

so in my other models i shouldve let it train longer as the values didnt plateu?

toxic pilot May 24, 2025, 2:57 PM

#

rain kelp so in my other models i shouldve let it train longer as the values didnt plateu?

well your early stopping condition is 5epochs with no performance increase, so its probably not a problem

jaunty helm May 24, 2025, 2:59 PM

#

rain kelp so in my other models i shouldve let it train longer as the values didnt plateu?

I mean if you did your data separation right, and validation loss is still decreasing, that just means that your model is still getting better at classifying unseen data (which is good)
In the first image you shared it looks like val loss kinda plateaued tho, so maybe not much to gain from further training that one
the one I replied to is still improving

#

and it's also not that difficult to compare your different models; just compare the val loss of them
e.g. in your post that said:

is this a good neural network model? (img)
you can see that the val accuracy ended up at about 0.72
can i ask you if this one is better? i am new to this stuff so i cant compare to other graphs
In this post the val accuracy is also about 0.72
this is your model
in this post the val accuracy is only 0.5

so comparing the 3 in terms of performance, model 1 = model 2 > model 3 (roughly)
but model 3 can still be trained cause val loss is still improving

#

obviously you have to be a bit careful when you only compare on the same validation data cause in a way you're now just fitting to seen data
that's when cross validation comes in if you want to look that up

toxic pilot May 24, 2025, 3:10 PM

#

jaunty helm and it's also not that difficult to compare your different models; just compare ...

although 20000 is admittedly a limited amount of data; would increasing epochs help? I'd tend to think not

#

i feel like a better solution for model 3 might be to downsize the model

#

also batch size could make a big difference

rain kelp May 24, 2025, 3:11 PM

#

but so a 73% accuracy means that there is still a lot of room for improvement?

toxic pilot May 24, 2025, 3:12 PM

#

rain kelp but so a 73% accuracy means that there is still a lot of room for improvement?

not necessarily

#

see how testing accuracy is plateauing?

rain kelp May 24, 2025, 3:13 PM

#

aah ok. ok ill do my last test where i train my first model to the whole dataset instead of just 20k images

jaunty helm May 24, 2025, 3:13 PM

#

toxic pilot although 20000 is admittedly a limited amount of data; would increasing epochs h...

well if the val loss starts plateauing then at that point more epochs probably doesn't help
(* though there was reserch a while ago that suggests training beyond that point for a long while will reach "model grokking" which is when after a long period of no improvement suddenly it improves again)

toxic pilot May 24, 2025, 3:15 PM

#

rain kelp aah ok. ok ill do my last test where i train my first model to the whole dataset...

you could also downsize yoru model and decrease (?) batch size

toxic pilot May 24, 2025, 3:16 PM

#

jaunty helm well if the val loss starts plateauing then at that point more epochs probably d...

the * part would require your optimizer to be using weight decay no?

jaunty helm May 24, 2025, 3:16 PM

#

rain kelp but so a 73% accuracy means that there is still a lot of room for improvement?

potentially
obviously, the perfect model would get 100% accuracy so in a sense yeah there's always room for improvement until you reach that
however you have to consider other problems
example: maybe your current training data inherently can't make such a model - heck, maybe your current training data can only ever make a model of 72% accuracy

rain kelp May 24, 2025, 3:18 PM

#

jaunty helm May 24, 2025, 3:19 PM

#

toxic pilot the * part would require your optimizer to be using weight decay no?

maybe not require, but it probably helps a lot
this was what I was talking about btw; then later some geniuses figured out to use fourier transforms to speed the process up here

toxic pilot May 24, 2025, 3:20 PM

#

jaunty helm maybe not require, but it probably helps a lot [this](https://arxiv.org/pdf/2201...

interesting; ill give it a read this summer

jaunty helm May 24, 2025, 3:20 PM

#

rain kelp

looks better
whatever you changed made it so the accuracy plateaus at about 0.75
(tho again be careful about overfitting yourself on the validation set)

rain kelp May 24, 2025, 3:20 PM

#

i just increased the training data from 20k to around 60k

toxic pilot May 24, 2025, 3:20 PM

#

nice!

gritty vessel May 24, 2025, 3:30 PM

#

toxic pilot network size?

unet 64-128-256-512-1024-512-256-128-64

#

its weather data so there are two conditions lightning and no lightning

#

i calculated manually so lightning events are only 3% of the dataset

#

should i apply weighted loss calculation?

#

my current run I believe will be completed till morning

#

currently loss is reducing but I am pretty sure Its because of no lightning cases

toxic pilot May 24, 2025, 3:47 PM

#

gritty vessel unet 64-128-256-512-1024-512-256-128-64

either use a huge model or randomly sample from your dataset

gritty vessel May 24, 2025, 3:47 PM

#

more dense model?

toxic pilot May 24, 2025, 3:48 PM

#

im not sure what your goal is, but 100gb of data will certainly overfit

gritty vessel May 24, 2025, 3:48 PM

#

my goal is to predict lightning

#

given 7 channels

#

but naturally no lightning events are much more higher than

#

lightning events

toxic pilot May 24, 2025, 3:49 PM

#

still thihk you haev way too much data

gritty vessel May 24, 2025, 3:50 PM

#

i did some calculation

limpid zenith May 24, 2025, 3:50 PM

#

gritty vessel should i apply weighted loss calculation?

You can do Focal Loss

#

It will automatically handle class imbalance

gritty vessel May 24, 2025, 3:51 PM

#

total points in data = 8,052,129,792

toxic pilot May 24, 2025, 3:51 PM

#

gritty vessel total points in data = 8,052,129,792

trainable parameters?

limpid zenith May 24, 2025, 3:51 PM

#

Yeah if the model is too large it will also take forever

gritty vessel May 24, 2025, 3:52 PM

#

toxic pilot trainable parameters?

I didn't print it

#

when I am using pytorch I always forget to do that

toxic pilot May 24, 2025, 3:52 PM

#

limpid zenith You can do Focal Loss

poly loss would also wokr

limpid zenith May 24, 2025, 3:53 PM

#

Poly loss? Like MSE?

gritty vessel May 24, 2025, 3:53 PM

#

gritty vessel total points in data = 8,052,129,792

lightning events = 42,560,000

toxic pilot May 24, 2025, 3:53 PM

#

limpid zenith Poly loss? Like MSE?

https://arxiv.org/pdf/2204.12511

gritty vessel May 24, 2025, 3:54 PM

#

so when we calculate percentage

#

damn its only 0.5%

limpid zenith May 24, 2025, 3:54 PM

#

Oh didn't know about polyloss...awesome...learn something every day

toxic pilot May 24, 2025, 3:54 PM

#

gritty vessel damn its only 0.5%

yeah so maybe artificially dropout some of the non-lighting cases

gritty vessel May 24, 2025, 3:54 PM

#

3% I MIGHT HAVE MISS CALCULTED IT WRONG

#

SORRY

toxic pilot May 24, 2025, 3:55 PM

#

just because you have a full dataset doesnt mean you should use the full dataset

gritty vessel May 24, 2025, 3:55 PM

#

toxic pilot yeah so maybe artificially dropout some of the non-lighting cases

then will it create bias in model? predicting lightning cases more

toxic pilot May 24, 2025, 3:55 PM

#

gritty vessel then will it create bias in model? predicting lightning cases more

not necessarily

#

your goal is to predict lightning; your model will learn the behavior/features it should expect before lightning vs not before lightning, and it shouldnt matter that not lighting occurs more frequently necessarily

#

so maybe randomly sample 42 million non lightinging events

#

and use that as your dataset

gritty vessel May 24, 2025, 3:56 PM

#

ok

#

just one more thing Im passing 2d arrays so how will random sampling will work?

#

It will create patches in data then right?

toxic pilot May 24, 2025, 3:57 PM

#

in fact your model will probably perform worse if ur doing a binary classification, and one of your cases is only consists of 3% of the data

toxic pilot May 24, 2025, 3:57 PM

#

gritty vessel just one more thing Im passing 2d arrays so how will random sampling will work?

im not sure how your data is structured so i cant say

gritty vessel May 24, 2025, 3:57 PM

#

consider it as an image

toxic pilot May 24, 2025, 3:57 PM

#

try resample from sklearn

#

https://scikit-learn.org/stable/modules/generated/sklearn.utils.resample.html

gritty vessel May 24, 2025, 3:58 PM

#

Number of samples: 834
Shape of one sample: (1536, 1392, 7)

#

this is for features

toxic pilot May 24, 2025, 3:58 PM

#

wait what

#

okay how many distinct images do you have

gritty vessel May 24, 2025, 3:58 PM

#

and target is No of samples 834
Shape of one sample 1536,1392

gritty vessel May 24, 2025, 3:59 PM

#

toxic pilot okay how many distinct images do you have

834

#

time stamps

toxic pilot May 24, 2025, 3:59 PM

#

?

#

oh

gritty vessel May 24, 2025, 3:59 PM

#

and each time stamp got 8 images 7 features and 1 target

toxic pilot May 24, 2025, 3:59 PM

#

an lstm might actually be a good tool for this 💀

gritty vessel May 24, 2025, 4:00 PM

#

convlstm?

#

I am planning to use it but I am first trying to predict normally

#

aftee this what I will do I will give lag in data

toxic pilot May 24, 2025, 4:00 PM

#

well how do you plan to encode time series?

#

oh by concatting the images into one matrix?

gritty vessel May 24, 2025, 4:01 PM

#

that will have information loss?

#

matrix is 2d

toxic pilot May 24, 2025, 4:02 PM

#

well yes

gritty vessel May 24, 2025, 4:02 PM

#

we can say ndim array ?

south finch May 24, 2025, 4:02 PM

#

What's ndim

toxic pilot May 24, 2025, 4:03 PM

#

south finch What's ndim

n-dimensional tensor i assume?

gritty vessel May 24, 2025, 4:03 PM

#

n dimensional

#

yes

south finch May 24, 2025, 4:03 PM

#

gritty vessel n dimensional

Oh, I see

limpid zenith May 24, 2025, 4:03 PM

#

Looking at polyloss it seems like it needs class weights to prevent class imbalance.

#

Wouldn't Focal Loss be more appropriate if you don't want to compute class weights?

gritty vessel May 24, 2025, 4:04 PM

#

yes I was reading about it

toxic pilot May 24, 2025, 4:04 PM

#

limpid zenith Wouldn't Focal Loss be more appropriate if you don't want to compute class weigh...

maybe, but looking at Kaboom's goals and model holistically, i dont actually think itll be a problem

#

just use cross entropy or something

#

and randomly sample an equal number of non-lightning cases as lightning cases

limpid zenith May 24, 2025, 4:05 PM

#

Well 100gb class weights computation seems a bit much

gritty vessel May 24, 2025, 4:05 PM

#

it says it modifies cross entropy loss that down-weights the loss for easily classified examples

toxic pilot May 24, 2025, 4:05 PM

#

limpid zenith Well 100gb class weights computation seems a bit much

i mean class imbalance shouldnt be an issue at all

#

just dont use all the non-lightning data

gritty vessel May 24, 2025, 4:05 PM

#

but how we can remove it from 2d grid?

#

one thing I think is to clip 128 x 128 or 256 x256 snaps

#

over lightning events

limpid zenith May 24, 2025, 4:08 PM

#

Is this a binary classification probelm of lightning or no lightning?

toxic pilot May 24, 2025, 4:08 PM

#

limpid zenith Is this a binary classification probelm of lightning or no lightning?

essentially yes

gritty vessel May 24, 2025, 4:08 PM

#

limpid zenith Is this a binary classification probelm of lightning or no lightning?

yes

toxic pilot May 24, 2025, 4:09 PM

#

feeding in a time series, we want to find out if the next step is lightning or no lightning, is what im interpreting this as

gritty vessel May 24, 2025, 4:09 PM

#

exactly

#

thats latter step I will give like time t features and targets will be t+2

toxic pilot May 24, 2025, 4:09 PM

#

toxic pilot feeding in a time series, we want to find out if the next step is lightning or n...

so really, just cross entropy loss or BCE or something and just sample for a samller subset of nonlightning data

limpid zenith May 24, 2025, 4:10 PM

#

Ahhh the events leading up to lightning will be used to predict lightning...yeah an LSTM is a good way for this with BCE

toxic pilot May 24, 2025, 4:10 PM

#

limpid zenith Ahhh the events leading up to lightning will be used to predict lightning...yeah...

padding a bunch series would be pretty funny though

#

jank as hell

#

i mean itd work probably, but its just conceptually hilarious

gritty vessel May 24, 2025, 4:11 PM

#

toxic pilot so really, just cross entropy loss or BCE or something and just sample for a sam...

how will resampling work?Any small example?

gritty vessel May 24, 2025, 4:12 PM

#

limpid zenith Ahhh the events leading up to lightning will be used to predict lightning...yeah...

yes conv lstm conv for spatial features and lstm for temporal features

limpid zenith May 24, 2025, 4:13 PM

#

toxic pilot i mean itd work probably, but its just conceptually hilarious

It would be yeah...maybe a consistent window of time before lightning? That way no need of padding in that dim

gritty vessel May 24, 2025, 4:15 PM

#

okie I Will try this focal loss,and resampling and conv lstm I will update you guys

#

is it ok?I mean can I update?

limpid zenith May 24, 2025, 4:16 PM

#

Yeah if somethings off or some error just message here yeah

gritty vessel May 24, 2025, 4:17 PM

#

Okie Thank you

rich moth May 25, 2025, 12:25 AM

#

Testing a Universal Complexity Framework (UCF) across different data types
I've been working on a mathematical framework that measures how information organizes itself, and got some interesting cross-domain results I wanted to share.
What I tested: UCF assigns a "phase angle" (θ) to different types of data based on their complexity patterns. The theory predicts certain ranges for different domains:

Financial markets: ~90° ("controlled uncertainty")
Mathematical sequences: ~0° ("pure order")
Physical systems: ~180° ("conservation")

Financial validation results:
Tested 4 major cryptocurrencies, all landed in the predicted 70-110° range:

BTC: 86.2°
ETH: 102.6°
ADA: 91.4°
XRP: 91.5°

Unexpected discoveries:

Prime numbers → 116.7° (closer to biological optimization than pure order)
Natural language → 180.5° (shows conservation-like patterns)
Chaos systems → 98.1° (confirmed controlled uncertainty)

What's interesting: UCF seems to detect consistent mathematical signatures across completely different types of information - financial data, language, mathematics, physics all show distinct but predictable patterns.
The financial predictions working so consistently was unexpected.

#

multiple runs show consistent results

jaunty helm May 25, 2025, 3:19 AM

#

gritty vessel consider it as an image

do you actually have images or not? you can "consider it as an image" doesn't mean that'll be the best way to solve it

little dawn May 25, 2025, 3:44 AM

#

can a person with low IQ or low problem solving skills become a good data scientist by doing practice/hardwork??

south finch May 25, 2025, 4:12 AM

#

little dawn can a person with low IQ or low problem solving skills become a good data scient...

Yea

bleak rampart May 25, 2025, 5:14 AM

#

little dawn can a person with low IQ or low problem solving skills become a good data scient...

Sir Richard Feynman :
I was an ordinary person who studied hard. There are no miracle people. It happens they get interested in this thing and they learn all this stuff, but they’re just people.

bleak rampart May 25, 2025, 5:47 AM

#

rich moth Testing a Universal Complexity Framework (UCF) across different data types I've ...

Please elaborate what the phase angle is and what it indicates

rich moth May 25, 2025, 6:37 AM

#

bleak rampart Please elaborate what the phase angle is and what it indicates

hey @bleak rampart thanks for the question. In the UCF the 'structural phase angle θ' is designed to capture the nature or character of the internal organization and structure within a data sample. Think of it like this, while the Magnitude ∣Φ∣ tells us how much complexity or energy there is, the Phase θ tries to tell us what kind of structure is present

bleak rampart May 25, 2025, 9:39 AM

#

rich moth hey <@731012586530668596> thanks for the question. In the UCF the 'structural ...

Ohh! Thanks

For a given domain the angle wouldn't be a constant value, it would differ...So the angles you provided will be the average value during recent time ?

rich moth May 25, 2025, 10:48 AM

#

bleak rampart Ohh! Thanks For a given domain the angle wouldn't be a constant value, it would...

Yeah, for any data I throw at the UCF, the structural phase isn't going to be some static, one-size-fits-all number. Every individual chunk of data – whether it's a window of an RNA sequence or a snapshot of market indicators – gets its own θ based on its unique internal structure at that moment. That's why those polar plots above show a scatter of points; each one is a distinct UCF signature.

thick heron May 25, 2025, 12:57 PM

#

hey i have a hard time doing my project can some one look at my git maybe give me some suggestions

#

no? ( = . = ) its oky

odd tulip May 25, 2025, 2:06 PM

#

thick heron hey i have a hard time doing my project can some one look at my git maybe give m...

I dont mind, I cant promise ill be able to help you but I'll try

thick heron May 25, 2025, 2:13 PM

#

odd tulip I dont mind, I cant promise ill be able to help you but I'll try

Anything is better then none

#

Just feedback will do

thick heron May 25, 2025, 3:07 PM

#

Tts is. Not going great that's why idk how to fix idk ai is feeding me nonsense and yt doesn't help

fair solar May 25, 2025, 3:24 PM

#

damn i should visit this channel more often, TIL abt polyloss

woven prairie May 25, 2025, 6:34 PM

#

Hello does anyone know about guard rails

serene scaffold May 25, 2025, 6:48 PM

#

woven prairie Hello does anyone know about guard rails

In what context?

woven prairie May 25, 2025, 6:52 PM

#

LLM

#

Like which prevents hallucinations

odd tulip May 25, 2025, 7:57 PM

#

thick heron Just feedback will do

it looks good and organised but its definitely beyond my skill level. I would have liked some images but it seems you are in the process of adding them.

runic parcel May 25, 2025, 8:35 PM

#

can anyone help me for the computer vision + ocr problem

#

I am trying to use yolo and tesseract for this project

serene scaffold May 25, 2025, 10:18 PM

#

runic parcel can anyone help me for the computer vision + ocr problem

Always ask your whole question and give the information people would need to start answering it. Never ask to ask

serene scaffold May 25, 2025, 10:18 PM

#

woven prairie Like which prevents hallucinations

You can insert extra information after the user's prompt

glacial root May 25, 2025, 10:33 PM

#

what's the hiring process like for computer vision internships?

serene scaffold May 25, 2025, 10:46 PM

#

glacial root what's the hiring process like for computer vision internships?

They'll ask you to tell them more about items on your resume that stood out to them, and they'll ask you "trivia questions" about computer vision to figure out if you're fake.

spring reef May 25, 2025, 10:51 PM

#

What are some misconceptions about A.I.? I understand that it is more useful in analyzing data than it is at writing novels or creating art, but is there anything else about A.I. that I have missed?

#

Also, what is it like being a data analyst or data scientist? Is it not that bad of a career path to go into? Is it a growing career path due to the development of A.I. or is there something else to being a data scientist outside of A.I. development?

serene scaffold May 25, 2025, 10:58 PM

#

spring reef What are some misconceptions about A.I.? I understand that it is more useful in ...

People in 2025 think that AI is only generative language models.

glacial root May 25, 2025, 10:59 PM

#

serene scaffold They'll ask you to tell them more about items on your resume that stood out to t...

i see

#

so no interview problems

#

like no on the spot coding

spring reef May 25, 2025, 10:59 PM

#

serene scaffold People in 2025 think that AI is only generative language models.

It is not? What is it then?

glacial root May 25, 2025, 10:59 PM

#

just computer vision theory questions and then questions about my past experience/projects that i have on my resume

serene scaffold May 25, 2025, 11:00 PM

#

spring reef It is not? What is it then?

Just think about what was considered AI before 2022. Those things still exist.

glacial root May 25, 2025, 11:00 PM

#

i don't get how people are able to get computer vision internships the summer after freshman year of college if recruiting starts in the fall

serene scaffold May 25, 2025, 11:00 PM

#

Like, self driving cars are not generative language models.

glacial root May 25, 2025, 11:00 PM

#

i guess they just start learning really early

serene scaffold May 25, 2025, 11:00 PM

#

glacial root i don't get how people are able to get computer vision internships the summer af...

You're right, that usually doesn't happen.

glacial root May 25, 2025, 11:00 PM

#

not much time left before recruiting season so i should probably get to work lol

glacial root May 25, 2025, 11:01 PM

#

serene scaffold You're right, that usually doesn't happen.

oh

#

i saw some guy from the uni i'll be going to who is a camera perceptions intern at aptiv

#

i don't know the guy but i saw his linkedin profile, all he had was an mnist classifier project

#

and some research, which i'm not too sure if it's related to computer vision or not as it's not too clear (and i don't know anything lol)

#

but the interesting thing was, none of it was before september 2024

#

the project date was december 2024

#

so either aptiv has a really late recruiting cycle or they just don't expect much/very little competition

spring reef May 25, 2025, 11:05 PM

#

serene scaffold Just think about what was considered AI before 2022. Those things still exist.

I only know what A.I. was thought up as in popular fiction. Such as Data from Star Trek, or SkyNet from the Terminator Franchise. The only other terminology I know the term A.I. was used for was for bots that simulated human players in computer games. Was any of that what you were referring to?

serene scaffold May 25, 2025, 11:06 PM

#

spring reef I only know what A.I. was thought up as in popular fiction. Such as Data from St...

Generative language models like ChatGPT feel like the AI entities from science fiction, but they aren't actually very similar.

glacial root May 25, 2025, 11:07 PM

#

typically what's the expectation for computer vision interns

#

or better question, what would be considered a competitive profile

glacial root May 25, 2025, 11:08 PM

#

serene scaffold Generative language models like ChatGPT feel like the AI entities from science f...

that would probably be closer to the concept of agi right

serene scaffold May 25, 2025, 11:09 PM

#

glacial root that would probably be closer to the concept of agi right

It seems like AGI because language generation feels more intrinsically human than driving a car. But LLMs aren't self aware

spring reef May 25, 2025, 11:12 PM

#

serene scaffold Generative language models like ChatGPT feel like the AI entities from science f...

In what way? What is the difference in generative language models compared to any other form of A.I.? What are the other versions of A.I.? I do not know what makes something considered A.I. outside of what I had listed, and even then I understand that the A.I. mentioned in Popular culture is not possible with modern technology right now.

serene scaffold May 25, 2025, 11:24 PM

#

spring reef In what way? What is the difference in generative language models compared to an...

Decision making systems are often AI.

spring reef May 25, 2025, 11:27 PM

#

serene scaffold Decision making systems are often AI.

What are some known decision making systems then? What makes them different from Operating systems?

serene scaffold May 25, 2025, 11:29 PM

#

spring reef What are some known decision making systems then? What makes them different from...

They're not related. An operating system is how applications interact with the hardware of a computer. A decision making system, in this context, is an application.

#

If you have something that decides/predicts how much a house should cost based on its properties, that's the kind of thing that I'm talking about

spring reef May 25, 2025, 11:30 PM

#

Apologies if I am asking basic questions, I am mostly unfamiliar with how software and coding works as I am a beginner at this moment of time. I am genuinely trying to understand you, but it is mostly going over my head as I have no experience hearing these terms before.

serene scaffold May 25, 2025, 11:30 PM

#

That's okay

#

I'm at my parents house, so I can't give in depth answers atm

spring reef May 25, 2025, 11:31 PM

#

serene scaffold If you have something that decides/predicts how much a house should cost based o...

Oh, so like an Excel sheet?

spring reef May 25, 2025, 11:31 PM

#

serene scaffold I'm at my parents house, so I can't give in depth answers atm

Oh ok. Thanks for trying to explain.

serene scaffold May 25, 2025, 11:31 PM

#

spring reef Oh, so like an Excel sheet?

Models are often trained on tabular data which might be an excel spreadsheet.

spring reef May 25, 2025, 11:32 PM

#

Oh ok.

serene scaffold May 25, 2025, 11:32 PM

#

But if you had an excel spreadsheet that calculates what you or someone else thinks the cost should be, you have to write a formula/function that calculates it in terms of the columns, right?

#

With machine learning, you have all those columns, and the actual price of the home, and the model figures out what function of the columns consistently arrives at the expected price

spring reef May 25, 2025, 11:36 PM

#

So automation comes with machine learning?

thick heron May 26, 2025, 2:39 AM

#

runic parcel can anyone help me for the computer vision + ocr problem

You can check my repo silver vi it's already 90% done

thick heron May 26, 2025, 2:40 AM

#

odd tulip it looks good and organised but its definitely beyond my skill level. I would ha...

That feedback helps

serene scaffold May 26, 2025, 3:08 AM

#

spring reef So automation comes with machine learning?

well, any computer program is designed to automate something.

spring reef May 26, 2025, 3:15 AM

#

Oh ok

deep anchor May 26, 2025, 6:48 AM

#

Hí

runic parcel May 26, 2025, 8:26 AM

#

thick heron You can check my repo silver vi it's already 90% done

idk if it will work, because my usecase is a bit different

tawdry dove May 26, 2025, 9:03 AM

#

I wanted to learn about how to detect changes in the image data , as in if any bill has a name and someone changed it. The bot should recognise that it is altered and flag it as fraud. Are there any pre built models to do this. Also what all should I know to achieve this ik the basics but any good research paper would help.

thick heron May 26, 2025, 9:04 AM

#

runic parcel idk if it will work, because my usecase is a bit different

If it's just yolo(v8) used for identification and then tesseract to do some text extraction what ever use case is keep in mind that you have to make sure that the data set you use is relevant to your thing and if you are planning to deploy then try nano and small first then move to other models if you feel this is too heavy for your deployment then consider downgrading few versions like yolo v5

woven prairie May 26, 2025, 9:14 AM

#

thick heron If it's just yolo(v8) used for identification and then tesseract to do some text...

Tesseract the python one

#

Amazon also provides one that is amazing

#

Currently I am working in a project, where I have to extract all the content from ppt slides and passsed it to llm for further functionality

thick heron May 26, 2025, 9:17 AM

#

woven prairie Amazon also provides one that is amazing

You also need internet if it's offline then he's done for it's not great i have tested a lot of things it's only 70ish that on a clear image not blurred ones

woven prairie May 26, 2025, 9:17 AM

#

woven prairie Currently I am working in a project, where I have to extract all the content fro...

Now I am using pdf plumber to extract tables from ppt and easyocr to extract text from images

thick heron May 26, 2025, 9:19 AM

#

woven prairie Now I am using pdf plumber to extract tables from ppt and easyocr to extract tex...

Great thought

woven prairie May 26, 2025, 9:19 AM

#

But with text from images I want to extract the meaning of images

thick heron May 26, 2025, 9:20 AM

#

woven prairie But with text from images I want to extract the meaning of images

Assuming that you are internet internet, I recommend trying out the google ai studio api key for summary

#

It's free really good and has rate limits be careful with that part

woven prairie May 26, 2025, 9:21 AM

#

If i want to extract the context of image , how this can be done

#

How this can be done

runic parcel May 26, 2025, 12:22 PM

#

thick heron If it's just yolo(v8) used for identification and then tesseract to do some text...

i trained my images on yolov11 medium

thick heron May 26, 2025, 12:23 PM

#

runic parcel i trained my images on yolov11 medium

Good did you get what you desired?

runic parcel May 26, 2025, 12:24 PM

#

thick heron Good did you get what you desired?

Yes that is perfect, got 99% acccuracy

#

but now i am trying to use tesseract to extract the text

#

i managed to get most while trying differernt preprocessing, but still cant it cant extract few things like digit 5

thick heron May 26, 2025, 12:25 PM

#

woven prairie If i want to extract the context of image , how this can be done

If the images are text based then you can send that directly to gemini or extract the text which may not be efficient

thick heron May 26, 2025, 12:25 PM

#

runic parcel i managed to get most while trying differernt preprocessing, but still cant it c...

Well is that 5 in a weird font

#

Or some colour

runic parcel May 26, 2025, 12:26 PM

#

No its proper

#

see

#

properly detects 4 and 9

#

but for 5 it gives § this shit

thick heron May 26, 2025, 12:27 PM

#

Did it give you a s instead of 5? Or completely ignored it?

thick heron May 26, 2025, 12:27 PM

#

runic parcel properly detects 4 and 9

Thought so

thick heron May 26, 2025, 12:27 PM

#

runic parcel but for 5 it gives § this shit

This usually happens

runic parcel May 26, 2025, 12:27 PM

#

need to use image preprocessing

#

u have any good ideas?

#

for thresholding and preropressing

#

like before it detected nothing, i tried different preprocessing and got pretty good results

thick heron May 26, 2025, 12:28 PM

#

Did you classify each image like this folder has images of 5

runic parcel May 26, 2025, 12:28 PM

#

wdym

thick heron May 26, 2025, 12:28 PM

#

runic parcel for thresholding and preropressing

Keeping the threshold around 7 is a good try 5 or 4.5 too if

thick heron May 26, 2025, 12:29 PM

#

runic parcel wdym

Is your data set a mix of all the characters?

runic parcel May 26, 2025, 12:29 PM

#

gray = cv2.cvtColor(cropped_img, cv2.COLOR_BGR2GRAY)
gray = cv2.resize(gray, None, fx=2, fy=2, interpolation=cv2.INTER_LINEAR)

blur = cv2.GaussianBlur(gray, (5, 5), 0)

_, thresh = cv2.threshold(blur, 0, 255, cv2.THRESH_BINARY + 
cv2.THRESH_OTSU)```

#

this is my preprocessing

runic parcel May 26, 2025, 12:30 PM

#

thick heron Is your data set a mix of all the characters?

yes then how did it find 4 and 9

#

there is problem in ocr

#

not my dataset

thick heron May 26, 2025, 12:31 PM

#

You turned them into a grey scale which is a good choice to avoid the colours u blurred it a bit and resized it

thick heron May 26, 2025, 12:31 PM

#

runic parcel there is problem in ocr

Ocr is not 100 percent accurate no model i tried had that capabilities it's best to use stuff by Google or oracle basically big companies

runic parcel May 26, 2025, 12:32 PM

#

thick heron Ocr is not 100 percent accurate no model i tried had that capabilities it's best...

is that paid?

thick heron May 26, 2025, 12:32 PM

#

runic parcel is that paid?

They have free tiers too do your own research first to find the best model as of now

#

My system was hybrid online + offline

#

2 different softwares used

runic parcel May 26, 2025, 12:33 PM

#

but isnt there a way to do with opesource

#

like tesseract

#

pladdleocr sutff

thick heron May 26, 2025, 12:33 PM

#

Yea

#

Tesseract is good ocr

runic parcel May 26, 2025, 12:34 PM

#

but having issue

#

ik its good but results are not the way i want

#

is there a way to tell it that lable will always be a number and do ocr from that way

thick heron May 26, 2025, 12:35 PM

#

I see for the data processing part try labeling and then training on that

#

Try improving your data set specifically images of 5 and §

thick heron May 26, 2025, 12:36 PM

#

runic parcel is there a way to tell it that lable will always be a number and do ocr from th...

Yes you can

#

So you are not doing text only numbers?

runic parcel May 26, 2025, 12:37 PM

#

thick heron So you are not doing text only numbers?

no

#

both text and number, but that lable only consists of numbers

#

wont be anything else except of numbers

thick heron May 26, 2025, 12:39 PM

#

Oh wait the labels are in numbers?

runic parcel May 26, 2025, 12:39 PM

#

no bro

#

the lable is kda

#

kda is the lable name, and inside it there will be numbers

#

#

like this, so there will be numbers this way

thick heron May 26, 2025, 12:44 PM

#

I have a question you are doing these completely digital images to ocr and then extract text out of it and use the extract no physical ones? And issue is it's sometimes going to miss read 5 with § and now your pre processing method is turning the images black and white and then blurring then?

#

Am i clear on this part?

#

Kda is a label that consists of pure numbers and nothing else?

runic parcel May 26, 2025, 1:04 PM

#

thick heron I have a question you are doing these completely digital images to ocr and then ...

the images are digital, no handwritten images will be there

runic parcel May 26, 2025, 1:04 PM

#

thick heron Kda is a label that consists of pure numbers and nothing else?

exactly, pure numbers nth else

#

see i am trying to get the data from a scoreboard like this into json format like given below:

{
  "radiant_score": 15,
  "dire_score": 10,
  "teams": {
    "radiant": [
    { "player_name": "Ani", "hero": "Night Stalker", "level": 9, "gold": 489, "kda": [3, 1, 2], "ultimate": true },
    { "player_name": "Alvyy", "hero": "Templar Assassin", "level": 10, "gold": 1136, "kda": [5, 1, 5], "ultimate": true },
    { "player_name": "REDRUM", "hero": "Crystal Maiden", "level": 7, "gold": 234, "kda": [1, 4, 3], "ultimate": false },
    { "player_name": "Big Doodle", "hero": "Earthshaker", "level": 7, "gold": 2138, "kda": [1, 1, 3], "ultimate": true },
    { "player_name": "pick weak=punishment", "hero": "Ember Spirit", "level": 8, "gold": 601, "kda": [0, 4, 4], "ultimate": true }
    ],
    "dire": [
    { "player_name": "Stleip", "hero": "Tusk", "level": 6, "gold": 514, "kda": [0, 3, 6], "ultimate": false },
    { "player_name": "红双喜", "hero": "Morphling", "level": 8, "gold": 1301, "kda": [2, 1, 1], "ultimate": true },
    { "player_name": "hy not listening", "hero": "Mars", "level": 7, "gold": 788, "kda": [2, 2, 4], "ultimate": false },
    { "player_name": "xin", "hero": "Lina", "level": 6, "gold": 249, "kda": [3, 2, 2], "ultimate": false },
    { "player_name": "Love is patient, lo...", "hero": "Lion", "level": 7, "gold": 1301, "kda": [1, 6, 5], "ultimate": true }
  ]
  }
}

thick heron May 26, 2025, 1:07 PM

#

and the kda is the issue since the numbers are missunderstood

agile pewter May 26, 2025, 6:50 PM

#

hello im learning ml with the sckitlearn, but the tutorials i saw use the sckitlearn default databases, how can i make my own and save to use after?

#

in fact i managed to train one just don't know what to do after to store for later use

agile cobalt May 26, 2025, 6:54 PM

#

agile pewter hello im learning ml with the sckitlearn, but the tutorials i saw use the sckitl...

If it does not exists in a digital format yet: Create your own dataset by hand in Excel then export as a CSV file and load using pandas or polars

If it exists in a digital format, it may vary a lot but generally speaking find it online and/or write a script to format it in a way the models can understand

woven prairie May 26, 2025, 7:01 PM

#

Does anyone know any open source vision model that takes image input and tell what basically is image about.

agile cobalt May 26, 2025, 7:05 PM

#

woven prairie Does anyone know any open source vision model that takes image input and tell wh...

that is extremely generic

there are thousands of classifier models that can do that for different topics depending on what you consider "being about something" to mean, or you can just throw it at any multimodal LLM

#

random example: https://docs.ultralytics.com/datasets/classify/imagenet/

ImageNet

Explore the extensive ImageNet dataset and discover its role in advancing deep learning in computer vision. Access pretrained models and training examples.

abstract wasp May 26, 2025, 7:39 PM

#

Hi am I allowed to send a survey for some data collection? It’s for a project

serene scaffold May 26, 2025, 8:10 PM

#

abstract wasp Hi am I allowed to send a survey for some data collection? It’s for a project

No, we don't allow that

verbal oar May 26, 2025, 8:50 PM

#

what project to build? I want practice ml

#

dont know price prediction of electricity?

#

sth where there is data

#

I watched ml from scratch type of videos from vizuara

thick nest May 26, 2025, 9:25 PM

#

hi, i'm think about to make a tictactoe with neural network without probabilities just linear algebra, this is possible, right?

#

i'll use ReLU and Softmax function

agile cobalt May 26, 2025, 10:38 PM

#

neural network
without probabilities
with Softmax
what exactly do you mean by "(without) probabilities"?..

thick nest May 26, 2025, 10:47 PM

#

I mean that I'm using softmax just to highlight the best move, the one with the highest score

#

I'll create two hidden layers with 9 neurons each. I'll use ReLU as the activation function in the hidden layers, and then a softmax layer at the end to generate a vector of scores, one for each possible move. Then I'll use argmax to pick the position with the highest score and place the X there. So the softmax helps highlight the best move, but I'm not using it for real probabilities

rich moth May 27, 2025, 12:05 AM

#

thick heron hey i have a hard time doing my project can some one look at my git maybe give m...

Lets see your link.

rich moth May 27, 2025, 6:02 AM

#

fickle shale May 27, 2025, 6:39 AM

#

rich moth

this design is too sexy!!

upper niche May 27, 2025, 6:58 AM

#

I need some information in regards to data normalization

#

is it better to normalize the data before or after splitting

#

I don't get the logic of: by exposing the data before training, you may cause data leakage,

#

    #Check if nromalizatio is requested
    #if yes 
    #   normalized X _ train and X_test, y_train and y_test
    # 
    
    print(X_train.head());
    model = LinearRegression();
    model.fit(X_train, y_train)```

#

Let's assume that pseudo code doesn't exist. Now, the test is split, the data is not normalized. the variable model is never been exposed to the normalization,right

main citrus May 27, 2025, 9:11 AM

#

Hi guys, I have been learning ML, eda and data engineer, nlp and a bit of deep learning for 2 years
I am 16 yo
Do you think that I can get summer work in a company with that?

odd meteor May 27, 2025, 9:49 AM

#

main citrus Hi guys, I have been learning ML, eda and data engineer, nlp and a bit of deep l...

Nothing is impossible. It'll be more easier if you have built a couple of solid projects as well

odd meteor May 27, 2025, 9:50 AM

#

upper niche is it better to normalize the data before or after splitting

It's advised to normalize after splitting to avoid leaking information to the test data

verbal oar May 27, 2025, 10:24 AM

#

what solid projects, its about few lines for example with sklearn

#

model
fit
predict

past meteor May 27, 2025, 10:30 AM

#

verbal oar what solid projects, its about few lines for example with sklearn

Sure, but the hard part isn't training the model it's showing you can find a suitable usecase (that isn't some kaggle stuff), understand your data, do the right preprocessing, ...

verbal oar May 27, 2025, 10:35 AM

#

ah ok so other parts related to data science

past meteor May 27, 2025, 10:53 AM

#

Sure, but that makes sense right? 😄

#

It's as you say, the code is so easy (.fit / .predict) they'd hire no one to do just that

verbal oar May 27, 2025, 2:19 PM

#

makes sense, thats why high salary

#

so you must know crisp-dm or similar project cycle

#

but honestly still not too much of coding compared to some web app development where there are modules, components, rather big systems or game engine development

#

this is what attracts me to do some data science

#

not high salary but much less code

lapis sequoia May 27, 2025, 2:54 PM

#

Made something with python

#

https://github.com/Abhiiishek-rana/FAIL-UP

GitHub

GitHub - Abhiiishek-rana/FAIL-UP: This project converts video lectu...

This project converts video lectures into clear, organized notes that are detailed enough to stand on their own. Even if you skip watching the video, the notes provide all the essential information...

void stone May 27, 2025, 3:35 PM

#

Hi guys, I want to break into data science and have an internship at my first year at university in summer
I am currently learning python through cs50p and I was wondering if anybody has resources I can use to be able to make relevant projects during university
I have looked up at kaggle but I am not sure
I also watched this video : https://www.youtube.com/watch?v=9R3X0JoCLyU
It helps me in direction but not necessarily in the process of learning

YouTube

Programming with Mosh

The Complete Data Science Roadmap

Go from zero to a data scientist in 12 months. This step-by-step roadmap covers the essential skills you must learn to become a data scientist in 2024.

❤️ Join this channel to get access to perks:
https://www.youtube.com/channel/UCWv7vMbMWH4-V0ZXdmDpPBA/join

Download the FREE roadmap PDF here: https://mosh.link/data-science-roadmap

✋ S...

▶ Play video

verbal oar May 27, 2025, 5:49 PM

#

what about simplilearn data science course?

void stone May 27, 2025, 5:51 PM

#

verbal oar what about simplilearn data science course?

I will look into that thanks

verbal oar May 27, 2025, 5:52 PM