#data-science-and-ml

1 messages · Page 166 of 1

charred estuary
#

dude

#

read the documentation

#

you need to edit the code

#

as isit will just make a generic dataset

#

if you are interested just look into the project

limpid dew
#

Sorry, but I had to ask because I didn't think the documentation was clear.

charred estuary
# limpid dew Sorry, but I had to ask because I didn't think the documentation was clear.

I don't really see how its unclear it says: ```

"The Synthetic Conversations dataset is a set made up of inputs and outputs that was completely automated and generated by AI language models. I used AI models such as DeepSeek R1 Llama 70B Distil, Google's Gemini 2.0 Flash, Microsoft's Phi 3, and Qwen3-0.6B."```
and:


DeepSeek R1 Llama 70B Distil
Gemini 2.0 Flash
Phi 4 Reasoning
Qwen3 0.6B

Only the best responses are selected and added to the dataset. This is done by having all of the AI models voting on which output they think is the best without being able to vote for their own output."```
limpid dew
#

I asked a specific question about generating math data, and you told me to read the docs but the docs don’t explain how to do that. If your project depends on users editing the code to guide the output, that should be clearly explained. Saying “it’s in the README” doesn’t work if it isn’t.

charred estuary
#

^

#

"You can modify the script to ask the cluster to only generate data that will help train an AI on python debugging or math or whatever you want."

limpid dew
#

Right — but saying “you can modify the script” isn’t the same as explaining how to do it. That sentence is a claim, not documentation. If customizing the prompts is essential, the README should walk through it clearly. Otherwise, pointing to it doesn’t help.

charred estuary
rich moth
#

I joined that kaggle comp for Stanford RNA Folding. I followed their outline but I added the UCF to extract features and it actually makes the predictions way more accurate by finding hidden patterns in the RNA that regular models miss. The UCF lets us see how "mathematically complex" different parts of the RNA are, which helps guide the 3D folding. Here are some images, notice how the data points sit in the Complex/Chaotic region, its telling us RNA have intricate folding patterns, we know that, but the AI does too. And the 3D visuals show the actual predicted structures with each nucleotide color-coded (A=green, U=red, C=blue, G=yellow). Heres a few samples

lapis sequoia
#

Guys I need help with Python Pandas

#
from subprocess import call
import pandas as pd
import time

#func for opening files on command
def openfile(x:str):
    call(["python", x])


#setting up the dataframe and email variables
df = pd.read_csv("CSV_Files/logindata.csv")
df.set_index('Email',inplace=True)
emp_email_end = "@can.emp"
adm_email_end = "@can.adm"


#signup and login
print("***** Welcome To  Login Page *****")
choice_signin = input("Would you like to login or signup?: ")

if choice_signin == "login":
    mail = input("Enter your email:- ")
    pwd = input("Enter your password: ")

    act_pwd = str(df.loc[mail][0])

    if pwd == act_pwd:
        if str(mail).endswith(emp_email_end):
            print("Welcome Employee!")
            time.sleep(3)
            openfile("Python_Code/employee.py")
        elif str(mail).endswith(adm_email_end):
            print("Welcome, Admin")
            time.sleep(3)
            openfile("Python_Code/admin.py")
        else:
            print("Welcome To The Canopy!!")
            time.sleep(3)
            openfile("Python_Code/customer.py")

    elif pwd != act_pwd:
        print("Password is incorrect")   

elif choice_signin == "signup":
    new_mail = input("Enter Your Email ID:- ")
    new_pwd =  input("Enter Your Password:- ")

    df.loc[new_mail] = [new_pwd]
    
    if str(new_mail).endswith(emp_email_end):
        print("You cannot register with the company email!")
    elif str(new_mail).endswith(adm_email_end):
        print("You cannot register with the company email!")
    else:
        print("Welcome To The Canopy!!")
        time.sleep(3)
        openfile("Python_Code/customer.py")



#updating the csv file
df.to_csv("CSV_Files/logindata.csv")```
#

I'm getting this exception when I use the login choice as ''login''

#

FutureWarning: Series.__getitem__ treating keys as positions is deprecated. In a future version, integer keys will always be treated as labels (consistent with DataFrame behavior). To access a value by position, use `ser.iloc[pos]` act_pwd = str(df.loc[mail][0])

#

For reference, I set the index of the csv file to the email names

#

not an integer index

#

I want to get rid of the exception, since need to show this as a school project, and I'm trying to use the try, except method, but it's not working

#

Please help

final jolt
lapis sequoia
# final jolt not entirely sure on your question since you say you are trying to use the try/e...
    mail = input("Enter your email:- ")
    pwd = input("Enter your password:- ")

    act_pwd = str(df.loc[str(mail)][0])

    try:
        act_pwd = str(df.loc[str(mail)][0])
    except FutureWarning:
        print("test")

    if pwd == act_pwd:
        if str(mail).endswith(emp_email_end):
            print("Welcome Employee!")
            time.sleep(3)
            openfile("Python_Code/employee.py")
        elif str(mail).endswith(adm_email_end):
            print("Welcome, Admin")
            time.sleep(3)
            openfile("Python_Code/admin.py")
        else:
            print("Welcome To The Canopy!!")
            time.sleep(3)
            openfile("Python_Code/customer.py")

    elif pwd != act_pwd:
        print("Password is incorrect")```
#

here

#

This isn't working either. I'm still getting the same exception, but twice now

#

FutureWarning: Series.__getitem__ treating keys as positions is deprecated. In a future version, integer keys will always be treated as labels (consistent with DataFrame behavior). To access a value by position, use `ser.iloc[pos]` act_pwd = str(df.loc[str(mail)][0]) /home/jon/Desktop/IP_Project_Hotel-Management/Python_Code/logins.py:30: FutureWarning: Series.__getitem__ treating keys as positions is deprecated. In a future version, integer keys will always be treated as labels (consistent with DataFrame behavior). To access a value by position, use `ser.iloc[pos]` act_pwd = str(df.loc[str(mail)][0]) Welcome To The Canopy!!

lapis sequoia
#

How do I do that, using try and except, or sys module?

#

Or any other fix that you know of

final jolt
# lapis sequoia and yes, the code is working as it's supposed to, it's only the warning I want t...

So my assumption is still the same in that the argument you are passing to df.loc() is in fact not seen as a string but an integer or otherwise. you could try setting the value outside of the function like

mail_id = str(mail)[0]
act_pwd = str(df.loc[mail_id])```
or try using thje df.iloc like the message suggests.
Also the reason youre try/except technically did not work is because you are calling the same function outside of the try statement (5th line) which is what actually throws the exception.
lapis sequoia
final jolt
#

try changing this:

    act_pwd = str(df.loc[str(mail)][0])

    try:
        act_pwd = str(df.loc[str(mail)][0])
    except FutureWarning:
        print("test")```
to this:
```py
    mail_id = str(mail)[0]

    try:
        act_pwd = str(df.loc[mail_id])
    except FutureWarning:
        print("test")```
final jolt
#

same error or doesnt work at all?

lapis sequoia
#

same error

tight dune
#

newbie here, anyone know how to make a deliniate (red line) a echogram data? x= depth , y=time

fallow coyote
#

Need to ask, should I use statsmodel to help refine my model whilst using sklearn as my main library for linear regression? Also is there any difference in how sklearn and statsmodel handles linear regression? If i can get some clarification, that should help remove the frustration on what library to use for my model

verbal oar
#

in statsmodel you have OLS (ordinary least squares)

#

difference in writing

#

also you have statistical summary (p-values, confidence intervals etc)

obtuse acorn
#

would it make sense to set the index to be the target variable in a pandas dataframe or should i just leave it as a column?

#

like that

#

im guessing just leave it as a column

rich moth
#

Still working on incorporating all the visuals though

#

dont be scared just say it. Im putting myself out there. dont be shy

deep anchor
# rich moth Its a work in progress but Ive updated my paper on the UCF if anyone is interest...

lol brooo just checked it out, and honestly?? not bad at all for ur first time 😮‍💨👏 like fr I wasn’t expectin it to be that detailed lmao. Some parts were a lil dense hhhmmmm maybe simplify a bit? but overall it lowkey makes sense 😭 visuals gonna help a lot once u throw em in tho frfr. Keep grindin, this got potential 🤙 let me kno when u update it again lol I’ll def take another look!

rich moth
#

there ya go

#

thank you

deep anchor
#

ok

#

thats good

tawdry sundial
fallow coyote
#

whats the difference between sklearn and statsmodel? Do I use either one or do I use a mixture of both modules?

sudden wyvern
#

Hi all I am Aakash and want help in terms of creating a sql agent which uses local llm with service like ollama local models and using lang chain currently I am not able to create that efficient agent by these things can anyone please suggest me how can i create fine sql agent which can talk with database and answer user's query accordingly ❓

I have tried llama3 deepseekr1 llama3.2 models but I am getting some OutputParse exception.

rich moth
#

The last image is the RNA structure of Covid 19. Even predicted the pseudoknot. The first one is 363 nt long. All these test ran in under a minute. The speed at which this thing preforms is bonkers

#

Its a straight web of connections hahaha

#

Look how all the RNA structure line up in the 135 degree region. It's detecting something for sure

#

In all the intial UCF test, the RNA tests, the Tade bot tests, all integrated with parts of the UCF.. it seems they all point to the same conclusion. There's something underlying in data we've been overlooking for a while. The consistent ~135 degree phase angle appearing across completely different types of data suggests a fundamental mathematical principle that seems universal.

#

There are different complexity spaces from crypto coins too.

past meteor
slim tundra
#

Hi,
I am having a hard time getting bipedalwalker v3 with PPO agent to walk. The reward seems to be stuck around 10 to 20. I am trying to get at least 200.
I have tried changing the parameters and the architectures nothing worked

I want to know if the issue lies in the architecture or in the training parameters

the script uses

ActorNet (actor policy)
Criticnet to compute state value and
ActorCriticNet combining both networks and adds helper methods to act and evaluate samples

Does anyone have experience or know something about deep reinforcement learning and can help?

fallow coyote
fallow coyote
#

In what way? I was thinking at first, use sklearn as my main library for the actual ML part with statsmodel to refine it. Im getting confused with this shit

slim tundra
#

teaching a kid how to walk

#

basically

fallow coyote
#

Is there any significant difference in how statsmodel uses linear regression techniques compared to sklearn?

past meteor
#

Are you trying to just do predictions or are you doing data analysis with linear regression?

#

If you're "just" trying to predict a value --> sklearn
If you're doing data analysis, statistics, are interested in interpreting coefficients etc. --> statsmodels

toxic pilot
#

also it depends on what you're trying to accomplish; if sklearn has what you need, then use sklearn. if it doesnt, then see if statsmodel does

#

in my experience, sklearn's strength is more machine learning while statsmodel is used primarily for classical & descriptive statistics

fallow coyote
#

That puts it into perspective. I guess what Im trying to do is first do some data analysis on how the price of gold is affected by certain factors and then make a simple price prediction program based on said factors

abstract wasp
#

Hi can someone explain to me this diagram? It’s in regards to L1 vs L2, I don’t understand the circle/diamond and the ellipses

calm thicket
#

using those norms

rich moth
#

It predicts RNA binding sites with pretty good accuracy, i built it with a validation dataset using PDB structures with experimentally verified binding sites, measuring distances between RNA residues and bound ligands to identify ground truth

75% precision / 60% recall on the biotin aptamer (1F27_A) and 41% precision / 54% recall on the FMN riboswitch (1FMN_A)

For SARS-CoV-2 RNA frameshifting element, my algorithm identified a key binding pocket at positions 10-16 (sequence GGGUUU) with a phase angle of 133.7° - precisely matching the universal pattern.

RNA structures consistently align near and around 135°, while cryptocurrency price data appears to align near 90°. Both show strong phase alignment, just at different characteristic angles.

verbal oar
#

yes as I said statsmodel has OLS

#

and statsmodel has summary statistics related to hypothesis testing etc

plush kettle
#

Guys, I want to ask suppose I want to train my object detection model with resnet fpn backbone on 640 x 640 images but no augmentations whatsoever, I use 80:10:10 split so I use 40 images for training and 5 for validation, which resnet backbone is the best

#

I know the dataset size is not enough but I can only work with the available data for now because my project manager told me not to make any augmentation/s first

obtuse acorn
#

if im using scikit learn do i need to always use TimeSeriesSplit if my data has a date column?

#

or is TimeSeriesSplit only for if your trying to predict what will happen in the future?

#

versus like categorising something that has a date column?

thick heron
#

is it possible to make some ai that is well optimized for deployemnts and be very good at it?

#

i am finding it hard to optimize

serene scaffold
thick heron
#

yes

serene scaffold
#

What hardware are you using?

thick heron
#

raspberry pi

serene scaffold
thick heron
#

great then

serene scaffold
# thick heron great then

Raspberry Pis aren't intended to be very powerful. If you're trying to run a neural network, you'd need to run the neural network on a different machine that can communicate with the pi

thick heron
#

🙂‍↕️ tysm

verbal oar
#

is there sth like resource of popular papers from arxiv and other sources?

#

similar to arxiv sanity vanity

obtuse acorn
#

any idea when to use minmax scaler vs standard scaler in scikit learn?

verbal oar
#

sth like
stable diffusion model, attention is all you need, variational bayes (actually dont remember title some related to vae), ...

#

or I must compile it?

#

I need just most popular or popular dont need all of them

fallow coyote
# thick heron great then

theres the AI hat you can buy that allows you to run ai applications on it but Ill be honest, either buy an expensive ass pc or, use google colab notebooksa nd run everything off the cloud

thick heron
#

Pi is useless even with that usb coral tpu thing

fringe jay
#

first time learning and practicing neural networks and ai, if any of yall could help that'd be great #1373696202398240909

green pilot
#

Any suggestions on the fastest way to convert csv to txt files ? I was thinking of just using pandas but i think it might be slower than the base csv to txt converter. Any suggestions?

serene scaffold
crimson raft
#

Hey guys.
Could someone please help me and look at a python code I'm working on? I'm not programmer nor have degree in IT, so, I'm not a pro. I posted the code two weeks ago and nobody answered, so I figured I ask here. Any help is much appreciated.

serene scaffold
crimson raft
serene scaffold
#

It might be that people won't look at your code even if you do post it, and that would be unfortunate, but they certainly won't if you don't post it.

#

And if people look at it after telling you that they will look at it, they would have done that even if you hadn't made them ask you to post it.

cloud zenith
#

Hello! Can someone please help me with ONNX exporting? I'm trying to export an ELM custom model into ONNX format, but keep running into this mysterious error:

Cell In[1], line 4
      1 import numpy as np
      3 from onnx import helper
----> 4 from skl2onnx import convert_sklearn
      5 from skl2onnx.common.data_types import FloatTensorType
      6 from skl2onnx.common.utils import check_input_and_output_numbers

File ~\AppData\Local\Programs\Python\Python313\Lib\site-packages\skl2onnx\__init__.py:16
     12 __model_version__ = 0
     13 __max_supported_opset__ = 21  # Converters are tested up to this version.
---> 16 from .convert import convert_sklearn, to_onnx, wrap_as_onnx_mixin
     17 from ._supported_operators import update_registered_converter, get_model_alias
     18 from ._parse import update_registered_parser

File ~\AppData\Local\Programs\Python\Python313\Lib\site-packages\skl2onnx\convert.py:8
      6 import numpy as np
      7 import sklearn.base
----> 8 from .proto import get_latest_tested_opset_version
      9 from .common._topology import convert_topology
     10 from .common.utils_sklearn import _process_options

File ~\AppData\Local\Programs\Python\Python313\Lib\site-packages\skl2onnx\proto\__init__.py:22
     18 except ImportError:
     19     # onnx is too old.
     20     pass
---> 22 from onnx.helper import split_complex_to_pairs
     25 def make_tensor_fixed(name, data_type, dims, vals, raw=False):
     26     """
     27     Make a TensorProto with specified arguments.  If raw is False, this
     28     function will choose the corresponding proto field to store the
   (...)     31     this case.
     32     """

ImportError: cannot import name 'split_complex_to_pairs' from 'onnx.helper' (C:\Users\Admin\AppData\Local\Programs\Python\Python313\Lib\site-packages\onnx\helper.py)```
#

I'm using Python 3.13.2.

verbal oar
#

# onnx is too old

#

check version of onnx

rich moth
#

I plugged the UCF into a three body problem simulation.

cloud zenith
#

Also that's not what the error says...

rich moth
#

Aligns in the chatoic region just like my other research

lapis sequoia
rich moth
lapis sequoia
#

nice

#

I assume a lot of coding to get such shapes

rich moth
lapis sequoia
#

generally speaking

rich moth
#

300 lines for the three body problem stuff

lapis sequoia
#

oh dear PepeSuit

rich moth
#

just for visual

lapis sequoia
#

didn't use chatgpt to get it faster?

rich moth
#

i dont use chatgpt, but i do utilize AI

#

way I see it, my times limited on this planet. I got things todo

lapis sequoia
#

fair enough

rich moth
#

This ones ploty and matplotlib

#

R1136 makes my computer lag lol

#

you dont even wanna see the 700 nt one

lapis sequoia
#

bloody lots of code for graphs

#

are you working on a research?

arctic wedgeBOT
#

:incoming_envelope: :ok_hand: applied timeout to @rich moth until <t:1747637582:f> (10 minutes) (reason: attachments spam - sent 7 attachments).

The <@&831776746206265384> have been alerted for review.

sudden canyon
#

!unmute @rich moth

arctic wedgeBOT
#

:x: There's no active timeout infraction for user @rich moth.

sudden canyon
#

Huh

#

Oh

obtuse acorn
#

so if im using scikit learns gridsearch and ive got unbalanced categories, which score function should i use?

#

im currently using f1_macro

#

but idk if i should be using roc_auc_ovo or one of the other ones

verbal oar
#

ok so not issue with onnx

obtuse acorn
#

any idea what it means if a model has like 99% accuracy on both test and train data?

#

like is that overfitting or is it just really accurate?

#

i dont think theres any data leakage or anything

waxen kindle
#

It is really accurate OR yout testing dataset is including into your train dataset

obtuse acorn
#

im using a pipeline in scikit learn

verbal oar
#

show graph please

#

of these curves

past meteor
#

f1_macro and so on all make the assumption that the cost of misclassification is the same

obtuse acorn
#

oh yeah

#

thats a good point

past meteor
#

All classification problems I've worked on in the past month all had assymmetric costs. I really needed to optimize for precision or recall

#

People have probably gotten tired of me asking "Do we care more about false positives or false negatives"

#

But that's the reflex you need 🙂

obtuse acorn
#

hmmm

past meteor
#

(even if your dataset is balanced)

obtuse acorn
#

its involving attack types

#

so like theres categories like ddos and normal etc

past meteor
#

So it's multiclass?

#

Or even multilabel?

obtuse acorn
#

ignore number 5, its not in the version im using

#

tho i could recreate number 5 from number 6

#

its for a uni assignment and they removed a column

past meteor
#

So you're predicting #6

obtuse acorn
#

yeah

past meteor
#

Each record belongs to just 1 attack

obtuse acorn
#

yeah

past meteor
#

Exactly 1, not 0 not 1+?

obtuse acorn
#

yeah

past meteor
#

If it's a school assignment and not a "real life" problem then f1_macro or similar is probably fine

agile cobalt
obtuse acorn
#

tho im guessing it would probably be better if it miscategorised something thats normal as an attack than an attack as normal

past meteor
#

Yeah in the wild a false negative is worse

#

You'd want to flag more things and have that as a starting point to investigate

obtuse acorn
past meteor
#

And I'd sell it to "business people" as possible attacks

obtuse acorn
#

do i need to use TimeSeriesSplit?

past meteor
#

Hence recall > precision here

agile cobalt
#

let me rephrase: Did you shuffle it before splitting or take the tail as the test data?

obtuse acorn
#

shuffle im pretty sure

#

yeah it shuffles by default

agile cobalt
#

interpolating is a lot easier (and arguably less useful) than extrapolating

if you included the records for 18:33:31 and 18:33:41 for a given day, then it should be easy for the model to guess that everything in between those two timestamps has the same label

past meteor
#

not sure if you need a time series split

#

But maybe yes

#

Look at the data and see if you have correlations along the time axis yeah

agile cobalt
#

in contrast, if you ask for the model to predict a label for a day that was not present in your data the chances for it to get it wrong are much, much higher

obtuse acorn
past meteor
#

It's a very strange case the more that I look at it

obtuse acorn
#

yeah i have no idea how the dataset actually works

past meteor
#

If you random split your accuracy will be near 100 %

obtuse acorn
#

its like gps data or something

past meteor
#

Due to what Etrotta is talking about

obtuse acorn
#

i could switch to one of the other datasets

#

it wouldnt be very difficult

past meteor
#

You have N data points from each attack

agile cobalt
past meteor
#

If you drop all features except time and do a random split you have near 100 % accuracy

#

"Oh it's around 18:30, what attack happened there? I see, that's when we had the ddos"

agile cobalt
obtuse acorn
#

i dont get how the date and latitude and longitude works to tell what the attack type is

past meteor
#

Could be that they're using a specific data centre for ddos

obtuse acorn
cloud zenith
#

Hey! I've narrowed down my error to being unable to install onnxconverter_common for some weird reason. I have CMake installed, I have Visual Studio installed, the PATH variables are updated, long file names in Windows are enabled. Whenever I try to install that module, latest version for Python 3.13.2, it tries to build something called "wheels", waits for like 5 minutes, and then gives me this monstrosity of an error many thousands of lines long that ends with this:

#

Does anyone know what could possibly be causing this?

agile cobalt
# obtuse acorn

lol what, what are those points in the ocean? islands or it's normalized/scaled in some way

past meteor
#

or mock data

obtuse acorn
agile cobalt
#

uhhh usually long goes -180 +180 and lat goes -90 +90

#

there are some different scales and other special ways of measuring, but still seems very weird

agile cobalt
cloud zenith
#

I've just never used conda so I don't understand how it'd be different as to what I'm doing here

obtuse acorn
final jolt
obtuse acorn
#

longitude = (longitude % 360 + 540) % 360 - 180 turns it into -180 to 180

agile cobalt
# obtuse acorn the code from this worked https://gis.stackexchange.com/questions/303300/calcula...

unless you find some documentation explicitly saying that this is indeed how they constructed the dataset, there is no guarantee it is correct

that question is specifically adjusting it given the way vue-leaflet works, a dataset created using different tool may have a different logic

unless you plot it and see the coordinates make perfect sense (e.g. all points are in cities with datacenters) I wouldn't rely on it

obtuse acorn
#

well i guess that would make sense

agile cobalt
#

there is a chance the dataset is just completely senseless I guess
(random mocked data)

obtuse acorn
#

i emailed my lecturer to ask about it

agile cobalt
#

in first place, did you include any rows with no ongoing attack or you always predict some kind of attack?

obtuse acorn
#

theres a category called normal if thats what you mean?

agile cobalt
#

oh

obtuse acorn
#

right i checked the original source, everything that isnt normal is an attack

#

becuase i wasnt sure if password was a type of attack

#

but it is apparently

final jolt
#

which type of data in this were you using? Also, possible theory. is your glove view backwards(mirrored) by chance? since so many end up in the ocean I wonder if its inverted.

#

my guess is in the processed_network_dataset based on the contents

final jolt
# obtuse acorn

which dataset in that site are you using for this? Trying to look at its formatting but lots of files here. certainly one of the processed ones it seems

obtuse acorn
#

except its not actually the one from that site

#

its like a small section of it

#

basically my uni said heres 6 datasets we modified and links to where the originals and information about them is

#

so its this one but no column 5

#

also yeah i think its the preprocessed folder

final jolt
#

Yea I dont think the coloumn labels for this sheet are correct

#

Well I should state that I dont know why the lat/lon numbers are so high but simply doing the calculation you posted should result in correct data. Though no idea why the plotting doesnt land on actual, well, land

obtuse acorn
#

could be weather balloons or something i guess

final jolt
#

Well yea I do know that it works regardless generally speaking.

#

I mean weatherballons or similar would certainly align with the inherent issue in IoT device security in general

obtuse acorn
#

that looks neat

cobalt rover
#

Hey there, I ran into hardware constraints while trying to finetune 3B and 8B variants of qwen2.5 with fp16 and bf16 precision (Bzzt, OOM errors). I have access to a total of 48(24+24) GB of VRAM but this is clearly not enough to train them in full precision so I have reverted to using 8-bit quantized models for the same. For some reason on the internet, everyone seems to be training their quantized models with LoRA and I wished to know if it will be possible to train these quants with SFT/RL without relying on LoRA as I do want to change the base model's weights.

final jolt
thick heron
#

Is custom pcb worth money?

#

Just to run a mid size ml

final jolt
thick heron
#

Running a multi model yolo cc and then ocr together with a base ml models

final jolt
#

nevermind, I think I was thinking of a different definition of 'custom pcb'

thick heron
#

Oh

#

No not that one

river cape
#

Have you used QAT?

#

It's kinda complex so my suggestion would be LoRA unless you have massive power

cobalt rover
# river cape Have you used QAT?

came across it while researching more training methods but QAT seems to be useful primarily for training models that have to be quantized by the end of the training process.

cobalt rover
river cape
cobalt rover
#

yeah it's a coding task for a particular language

#

hope it's going to be enough- my earlier misunderstanding of ignoring that loading into the memory =/= VRAM consumed during training will cost me some days of progress welp

obtuse acorn
#

am i not supposed to do dataFiltered.latitude = (dataFiltered.latitude % 360 + 540) % 360 - 180 to overwrite the latitude in the dataframe?

#

i got a chained asssignment warning

lapis sequoia
#

what should the sequence length for a LSTM be over a very long period of time?

river cape
lapis sequoia
lapis sequoia
river cape
#

Helps understanding context better

lapis sequoia
river cape
#

and nowadays most of the tasks like in nlp are done by transformers , so very less use cases of the previous networks

#

but its good to know

lapis sequoia
river cape
cobalt rover
# lapis sequoia from the 40's until now

i actually meant to ask what kind of data you are working with here. Anyways, a good suggestion would be to begin with 7 as that can cover a lot of time(week) and should serve as a good starting point. Besides this, you can experiment with various values and pick the one which fits your loss expectations the best! You might have some problems if its the first time working with LSTMs directly, but that's also how i started and i'm sure gpt/gemini/claude can help a lot here!

obtuse acorn
verbal oar
#

is it possible to train llm on laptop instead of on cloud and it just will take much time?
so money saved but time not?

#

how it goes?

#

1-2 weeks of training on some A100 or the like

#

so it would take few months, estimated, not on A100, but some pc gpu

#

I have iris xe

serene scaffold
verbal oar
#

yes but with some weights checkpointing

serene scaffold
#

And if you spent a few months trying it anyway, you'd fry the laptop

verbal oar
#

ah finetuning not training read wrongly sorry

serene scaffold
#

You can't train an LLM from scratch on any consumer hardware

#

And if you mean "training but not from scratch", that's what fine tuning is

verbal oar
#

so tldr gen ai must be done only on cloud?

#

sorry I thought this way train llm on some compute powerful, cost few million of $
but training llm on laptop would be free (not considering power consumption)
but would just takes longer

#

but this not working like this

#

because then companies would train for months for free

#

so in short I thought can just split compute

#

but training some language model is possible on laptop? (not llm)

#

I remember I used colab for resnet50 and vgg16 so they too are not possible to train on laptop?

#

so question is from what number of parameters its not possible to train from scratch on laptop?

#

ok also I remember resnet and vgg were pretrained and it was about transfer learning

agile cobalt
#

fine tuning is possible on high end consumer hardware, but not laptop level hardware

You can use Google Colab or Kaggle to borrow GPUs from google for free

verbal oar
#

yes but for noncommercial use, for learning

#

if it would be for commercial, then meta would use just colab 😂

agile cobalt
#

a single training run (*from scratch) costs millions of dolars worth of computing power

verbal oar
#

some petabytes (scale of data)?

#

ah I have just 1TB

final jolt
#

its more like the point is, commercial LLM cost money to train. There are not free versions at that scale even for "smaller" sets of data.

#

Because of the sheer amount of GPU power required to actually do it in an amount of time that is not absurd

fringe jay
#

could someone help me in #1374233576706408458 ? I've been tweacking it for hours and I cant seem to fix it

unkempt apex
hexed yew
#

Any advice for imputing missing categorical data ? None of my variables appear to cluster well or have relationships with the categorical variable

polar hornet
#

Hi guys, so i have an assignment at school that requires an expert in the field of artificial intelligence to be interviewed for my scientific article assignment. I really hope someone could help me here

serene scaffold
polar hornet
heady pivot
#

Is this chat a good place to ask abou data engineering stuff?

heady pivot
#

I'm a data engineer with 2 years of experience. Currently, I'm looking to start an AWS certification, but after studying through AWS Skill Builder, it seems more like a marketing stunt than a real certification. Based on my experience, most AWS services feel like auto-managed versions of open-source tools. At my startup, cost is a huge concern, so aside from Redshift, Lambda, and RDS, we avoid other AWS services. Am I wrong for sticking with hosting everything on EC2 (e.g., Kafka, Airflow, dbt for ETL) and using Lambda for code execution? This is how I’m handling things now. Any advice would be much appreciated!

Basically, all my problems are solved with SQL on RedShift and relatively simple Python scripts in Lambda (serverless). This setup handles everything we need right now!

steel spindle
#

How do you create AI in python?

#

I am new to it

serene scaffold
steel spindle
#

Artifical intelligence, something that can talk to you, like a person

serene scaffold
steel spindle
#

No, I would consider it, but a diffent type of AI

serene scaffold
#

Okay, so if you say that you want to "create AI", you have to be specific about what kind you're talking about

#

What you're describing is probably an interactive language model. You can't create those from scratch.

#

They cost millions of dollars to create

#

There are other things you can do with AI that are attainable

verbal oar
#

how can I get know even a little about llamaindex,langchain,crewai?

#

what is best option official docs?

viscid urchin
#

I'm only really familiar with the LangChain part of it, and dang there is a lot of surface area to cover. I used the official docs myself, which seem pretty nice.. it's just a lot to take in before you might understand the "idiomatic" way to do something with it.

lapis sequoia
viscid urchin
#

I'll have to give it a shot, but not every project's actual test suite works on Windows sadly.. lemme see if it's obvious whether that's the case here

#

Not a great sign, they do not appear to have automated Windows builds in their GitHub action setup.

regal bane
#

wsl is a option

lapis sequoia
lapis sequoia
viscid urchin
#

That's after uv sync etc like their docs suggest.

#

I guess all I can say is that this is an obvious place where a new contributor could make a positive impact on the project.

#

It just needs some stuff set up, like cross-platform in their CI config instead of just Linux

#

I'm sure these are just tests that aren't perfect yet etc rather than the lib being massively broken on Windows.

#

I actually do not love the style of this test suite implementation

regal bane
#

all your work can be done on windows and you pop up a wsl terminal to use lang chain

lapis sequoia
#

@viscid urchin Thanks for trying and it's okay don't worry about the time of respone. yeah, I got this amount of errors as well before, is it okay to ignore or what then? like is it safe to ignore them and do the work

regal bane
#

definately not optimal but it works

viscid urchin
lapis sequoia
#

anyway thanks to all of you @viscid urchin @regal bane

viscid urchin
#

Honestly you might consider filing an issue for "please add Windows to your CI build"

#

Somebody might come along and do it

#

(I might even do it)

#

If I used LC "in anger" I 100% would.

lapis sequoia
#

okay mate, I will see thanks

viscid urchin
#

(I did just look pretty hard on their Issues list and there are a lot of things mentioning Windows, but nothing that seems to be asking to enhance the automated tests that get run.)

lapis sequoia
#

who contribute a lot, idk. this is just my guess

viscid urchin
#

Yeah, I'm one of those weirdos who runs a "Windows native zsh" env

lapis sequoia
#

did you find an issue about this case or it's better to open one?

viscid urchin
#

I'd open one; didn't find one that looked good to jump on.

lapis sequoia
#

I am a windows lover tbh, I used linux for quite good time but didn't like it. although I studied it and so on

viscid urchin
#

Just be super clear/polite/etc and describe the problem + proposed next step etc.

#

Yeah I've never come to love Linux.. (I do love FreeBSD though)

lapis sequoia
viscid urchin
#

Go ahead if you've got the inclination; I'm feeling lazy, just catching up on MotoGP 🙂

#

I'll gladly star/react/etc it if you do though 🍹

lapis sequoia
#

for now, I am feeling lazy too. lol maybe another time

#

Do you contribute in Langchain?

viscid urchin
#

No, but I've been toying with the idea to learn it better

#

and honestly you've found some low-hanging fruit that I might work on

lapis sequoia
#

nice, good luck

obtuse acorn
#

any idea why my MLPClassifier from scikit learn performs better when i do a gridsearch cv but worse when i just fit it with the pipe?

#

i think it might be something to do with the cross validation?

#

im using skf = StratifiedKFold(n_splits=5, shuffle=False) because ive got time series data and i figured it would be best ot keep it in order

torpid mirage
#

You watch MotoGP?

#

My homie

#

🫂

obtuse acorn
# obtuse acorn im using `skf = StratifiedKFold(n_splits=5, shuffle=False)` because ive got time...

so i split it once at the beginning to to get a test and train split

skf = StratifiedKFold(n_splits=5, shuffle=False)
skf.get_n_splits(X, y)
groups = dataFiltered[target].values

for train_index, val_index in skf.split(X, y):
    train_set = dataFiltered.iloc[train_index]
    test_set = dataFiltered.iloc[val_index]
    X_train, y_train = train_set.drop(columns=[target]), train_set[target]
    X_test, y_test = test_set.drop(columns=[target]), test_set[target]
#

then i ran

gridSearch = GridSearchCV(pipe, param_grid=param_grid, scoring='f1_macro', cv=skf, n_jobs=-1)

gridSearch.fit(X_train, y_train)
#

is it because i set cv=skf?

#

and somehow its matching the gridsearch results now

#

no idea whats going on

#

it was like 100% train accuracy and 85% test accuracy after the grid search

#

and then it was like 20% for both when i just fitted the pipe

viscid urchin
jaunty helm
#

people familiar with sktime: how do I use parallel processing with transformations like Catch22?

#

I think I've tracked it down to

c22 = Catch22().set_config( ... )
```but nothing I put in `set_config` seems to do anything,
```py
cfg = { "backend:parallel": "loky" }
cfg = { "backend": "joblib" }
```etc, cpu usage is about the same
polar hornet
coral sage
#

Hi, I'm trying to train a yolov11n model (to run on mobile devices) and I'm trying to train it using the entire COCO dataset (for real-time object detection). Problem is I vastly underestimated how long it was going to take to train and I wanted to know if there's anything I'm doing wrong or anything I can do to speed up the process.

Here's my code below (I haven't even changed much, it's mostly just straight from the ultralytics documentation except the dropout, patience and device (because I'm using an M1 Pro Macbook))

from ultralytics import YOLO

# Load a model
model = YOLO("yolo11n.pt")  # load a pretrained model (recommended for training)

# Train the model
results = model.train(
    data="coco.yaml", 
    epochs=100, 
    imgsz=640, 
    patience=10, 
    device="mps", 
    dropout=0.01
)
#

The dataset is already installed and I had left it to train overnight but it didn't even complete one epoch

#

I estimated that it would complete at least two but I think the time per iteration increased significantly overnight

#

and it didn't even save a last.pt or best.pt model when I interrupted the block

#

I played around with the batch size, and it started taking upwards of 40 GB of Memory at one point (I only have 16 GB of RAM so the rest was SWAP), so I just left it back to the default.

obtuse acorn
#

any idea if i should drop day of the week or just leave it?

coral sage
obtuse acorn
#

type

coral sage
#

I'm no expert, but I'd leave it in probably. What's the difference between day and day of the week?

coral sage
#

and it's not like the other features have a high correlation either with respect to day of the week which makes it less signifcant

charred ferry
#

can i ask about data analytics, big data, data lakes and data warehouse here?

#

I assume this is the correct channel but just wanna be sure.

#

Basically, I am deciding between a building a data warehouse project or a project that involves big data concepts, data lake, machine and basically data analytics for real-time recommendations. I'm unsure which to go for. Is there anyone who worked on either and can share their opinion on how their experience was like while working on on their work/project?

#

I am asking this because as soon as I choose my final year project then that is likely the field I will be going into as a junior developer (whatever u call it) since this would be the biggest project I ever produced (when I complete it).

obtuse acorn
#

day of the week is like, its a monday

final jolt
# charred ferry I am asking this because as soon as I choose my final year project then that is ...

Is this like end of the year for a 4 year degree project or something else? Unless this is some like guided schooling where you go right from school>internship>employment the project is probably not going to have as a massive as impact in the sense of forcing you into one side or the other in your career. What is your degree in and what kind of projects have you done so far? What level of interest do you have in either category?

jaunty helm
# jaunty helm I think I've tracked it down to ```py c22 = Catch22().set_config( ... ) ```but n...

update: seems to be some strangeness with pipelines in sktime, using it directly does seem to employ parallelism now (cpu high):

c22 = Catch22().set_config({
  'backend:parallel': 'loky', 
  'backend:parallel:params': {
    'n_jobs': -1  # technically not needed because -1 is the default
  }
})
c22.fit_transform(time_series)

though I don't really see a difference in run time
in contrast, Catch22Wrapper requires pycatch22 but is like a bjillion times faster

#

for reference: I've a multivariate (6) time series, about 2500 in length
Catch22 takes ~1min to fit_transform
Catch22Wrapper takes ~0.08sec to fit_transform

jaunty helm
#

my first time experience with sktime definitely isn't the best

another example: I can't seem to get something as simple as chopping / padding all time series to a length of 2500 to work

preprocess_pipeline = (
  TruncationTransformer(2500) 
  * PaddingTransformer(2500)
)
preprocess_pipeline = (
  PaddingTransformer(2500)
  * TruncationTransformer(2500) 
)
```these 2 both don't work, throwing out some error I'm not sure how to fix

eventually I just did the truncation part manually through some `polars` `filter`ing on the index, leaving me with only `PaddingTransformer`
then there's another performance issue, as it takes several minutes *just* to do what should be a simple pad (granted I do have a lot of data)
eventually I ditched it as well and tried only `polars`, the resulting code again only takes a few seconds
```py
# something like this for padding
(
  df
  .filter(c('time_series_id').is_first_distinct())
  .select(
    'time_series_id', 
    pl.lit(list(range(pad_len))).alias('index')
  )
  .explode('index')
  .join( ... )
agile cobalt
#

maybe pl.int_ranges(pad_len).alias('index') instead of pl.lit(list(range(...)))

jaunty helm
agile cobalt
#

int_range or int_ranges?

jaunty helm
#

or maybe I haven't tried that idk, my brain is frying from debugging

jaunty helm
agile cobalt
#

but yeah common polars W Chad
I haven't messed much with its time series related features, but it has a lot of methods specifically for it too

jaunty helm
#

unfortunate that the integration is still lagging behind 😔

#

if I use sktime again, some of the stuff supports polars while others don't, so it's probably easier to just stick to pandas (or at least, before you pass into the transforms)
also there's no tutorials explaining how you'd use a polars dataframe with the transforms, I figured it out by code digging: columns starting with __index__ will be recognized as the time series id / time index / etc
so actually I had to have column names like __index__time_series_id or __index__time, then down the line find that some don't work and .to_pandas() anyway

verbal oar
#

I think shuffle=True

frigid niche
#

Would it be appropriate to post in here an academic website I made detailing my Neural Network that runs on a TI 84 Plus Silver Edition capable of autocorrecting words?

serene scaffold
frigid niche
rich moth
#

I feel like this is the smoking gun. What do you guys think? It's domain distribution of 62 datasets across 4 domains. I found in my research all of them follow the same mathematical law when ranked. Information itself has a universal structure...

#

Information organizes itself different in complexity space. But even in chaos within the constraints of physical laws, there is structure.

#

Makes you really wonder about the universe itself..

plush kettle
#

Guys how do you train an RCNN model that generates 2000 proposals on colab, I tried and it just cradhed because the ram isnt enough

#

So, I modified the original RCNN’s selective search, to generate 1/4 of the original proposal size

#

Also is it normal for RCNN to start with insane loss like say 200 or 100

#

Also how do I remove tensor from GPU ram I tried del tensor and cuda cache remove but cant

verbal oar
#

detach?

thick pier
#

hi can anyone instruct how to start with data science while u have no knowledge whatsoever

radiant cipher
#

anyone aware of a open source lib that facilitates agents, data retrieval, memory and memory usage

charred ferry
# final jolt Is this like end of the year for a 4 year degree project or something else? Unl...

It is for the final year of my BSc Computer Science degree. I'm gonna be entering final year in the upcoming September. Basically due to the last final year student's performance with the final year project being bad, the teachers decided for the final year project to be started in the summer (for those who are going to be in the final year of their degree in September). I have interest in data analytics and data warehouses. In particular I love machine learning with data analytics. In fact, I was going to that project instead (data analytics with machine learning). I started learning the basics of machine learning. I have beginner knowledge of Pandas. I am good with Python. Right now I am trying to look for beginner friendly projects I can work. I want to do this because for my final year project I will need a teacher to act as my supervisor for my final year project. Some teachers may ask for my CV and experience with machine learning and data analystics. I hope to do 1 or 2 beginner friendly projects so I can make convince a teacher that I am able to learn the required concepts in order to do the project I choose.

agile cobalt
radiant cipher
charred ferry
#

Are websites like these good start for someone wanting to do beginner friendly machine learning projects? Advise on what projects to do as i progress would be helpful (so I can get a feel of machin learning and get practical experience/improve my knowledge)https://www.freecodecamp.org/news/how-to-build-a-house-price-prediction-model/

freeCodeCamp.org

Ever wondered how algorithms predict future house prices, stock market trends, or even your next movie preference? The answer lies in a fundamental yet powerful tool called linear regression. Don't be fooled by its seemingly simple equation – this ar...

agile cobalt
radiant cipher
# agile cobalt that would probably end up crazy expensive if you show the entire repository eac...

my current mess of an idea is to try and create memories the llms use to compare and see how i can wire them up - its entirely ok if a run on 100 repos takes a night on controlled hardware as long as i get the single starting poitns working

what i'm trying to do is find cargo cult-ed instances of initial iterations of ideas that where adopted across many ropes - and then reporting and/or trying to suggest a fix

agile cobalt
#

just to check, do you understand the difference between RAG ""memories"" and in-context "memory"? specially how much the model knows about each

radiant cipher
# agile cobalt just to check, do you understand the difference between RAG ""memories"" and in-...

indeed - i believe i have to sort that out - i may run into a situation where i have to run hundreds of prompts + do memory/storage to ensure in context memory first - rag memory may end up just being something that keeps track so i can split the problem into more chunks

it would be so nice if there was a way to make context fragments and combinations of them instead of always streaming the tokens

radiant cipher
agile cobalt
#

pretty sure it is fine (posting relevant yt links)

serene grail
radiant cipher
#

https://www.youtube.com/watch?v=YNQKq1YfBAI discusses a paper that has llms make and use memories from the tokesn they take - it claims to be better than infinite context

unfortunately there doesnt seem to be a implementation linked

HUMAN-LIKE EPISODIC MEMORY FOR INFINITE CONTEXT LLMS
ArXiv: https://arxiv.org/abs/2407.09450
Bytez: https://bytez.com/docs/arxiv/2407.09450
AlphaXiv: https://alphaxiv.org/abs/2407.09450

Support my learning journey either by clicking the Join button above, becoming a Patreon member, or a one-time Venmo!
https://patreon.com/Tunadorable
https://ac...

▶ Play video
agile cobalt
serene grail
agile cobalt
serene grail
#

Yeah that makes sense, thank you

agile cobalt
radiant cipher
radiant cipher
#

hmm - oh - i jsut learned about the model context protocol - that may be neat to put agents together

burnt geode
#

Hi,
anyone with speech to speech realtime LLM experience in python?
ping me we need to develop the llm with function calling ability.

jaunty helm
# serene grail And the context window is ultimately limited by your VRAM right?

there's also a "hard limit" of sorts of what the base model was trained on which some techniques can get around while degrading quality

  • e.g. llama 3 was trained on text that was 8192 tokens at the longest, so that's the native limit you'll see being thrown around
  • during inference you can use Rotary Positional Embeddings (RoPE) to extend that while degrading the quality of responses a bit
  • I believe that's the technique used in tuning llama 3.1 so it can "have 128k context" even though it's based on llama 3

and as mentioned by etrotta, there's a much-easier-to-hit soft limit of things being in context, but the llm being unable to utilize them
see RULER and the newer NoLiMa benchmarks

agile cobalt
woven prairie
#

Hi

#

I want to build Gen AI project for Resume, Any suggestions

charred ferry
#
dict_1 = {'Ideal':5, 'Premium':4, 'Very Good':3, 'Good':2, 'Fair':1}
diamonds_df.cut = diamonds_df.cut.replace(dict_1)

dict_2 = {'D':7, 'E':6, 'F':5, 'G':4, 'H':3, 'I':2, 'J':1}
diamonds_df.color = diamonds_df.color.replace(dict_2)

dict_3 = {'IF':8, 'VVS1':7, 'VVS2':6, 'VS1':5, 'VS2':4, 'SI1':3, 'SI2':2, 'I1':1}
diamonds_df.clarity = diamonds_df.clarity.replace(dict_3)

# renaming the 'x','y','z' columns to more descriptive names
diamonds_df = diamonds_df.rename(columns={'x':'length_mm', 'y':'width_mm', 'z':'depth_mm'})

# removing dimensionless diamonds
diamonds_df = (diamonds_df.drop(diamonds_df[diamonds_df['length_mm']==0].index))
diamonds_df = (diamonds_df.drop(diamonds_df[diamonds_df['width_mm']==0].index))
diamonds_df = (diamonds_df.drop(diamonds_df[diamonds_df['depth_mm']==0].index))

# dropping duplicated rows in the DataFrame if there are any
diamonds_df = diamonds_df.drop_duplicates() ```i am getting this message: "FutureWarning: Downcasting behavior in ⁠ replace ⁠ is deprecated and will be removed in a future version. To retain the old behavior, explicitly call `result". Is replace deprecated? Will it lose support in future? Im new to Pandas. I just want to know if I should use .replace() or is it not good to use?
simple mist
#

You can also run pd.set_option('future.no_silent_downcasting', True) to tell it to stop warning you

rich moth
#

Please let me know if visuals come in

verbal oar
#

I had about diamond price prediction but it was in excel

#

or sth not about coding in python

dire kiln
#

Good Morning

#

I'm running LLama 3.2 1B with ONNX and DirectML because my AMD card is old. Loading it consumes 5.3GB of VRAM out of 8GB, which is okay, as long as it doesn't take it all.

#

1 initial prompt + 3 follow ups is enough to consumo the rest of the 8GB total VRAM. From the 5th prompt onwards it gets really slow. Still better than CPU, but worrying.

#
  • Is this normal?
  • Are sessions stored in VRAM?
  • Is there a fix or a way to reduce VRAM usage?
#

I ran DeepSeek R1 using a converted model I pulled from HuggingFace and it was capable of prompting again and again just fine. Probably I didn't test it enough because the lack of an openai-compatible API convinced me to delete it. But I wonder if I'm doing something wrong or am ignorant about how this works.

#

This is my first time doing this but I'm a junior/mid level python dev

#

Tried Phi3.5-mini but there was a leak that doubled the VRAM usage on the first prompt and the model kept appending the answer over and over until it ran out of tokens and returned HTTP code 500.

#

Using Lemonade SDK as runtime+REST API

#

Maybe use Hybrid models that do integer calculus on the CPU to kinda split the data between RAMs? Idk, just brainstorming

stiff crown
#

PACE methodology or CRISP-DM ?

dire kiln
coral wyvern
#

I have a dataset of 1.5 million users anime lists, and I want to build an anime recommendation website. But I have no idea how much a project of this scale would cost. Is there anyone who can give me rough estimate and maybe break down the expenses?

agile cobalt
#

it depends, if you run everything locally and do not host anything on the cloud it's only going to cost electricity and time

you could also train some models in Google Colab and host in Hugging Face Spaces free of charge

it could get pretty expensive if you were to rent enough compute to handle thousands of users accessing it daily though

toxic pilot
#

it depends on what methodology they use tbh

toxic pilot
#

gpt2 has a context window of 1024 tokens

dire kiln
dire kiln
dire kiln
# toxic pilot what do you mean?

The context's data. Idk how this work so pardon me. Could it be stored somewhere else at the cost of latency so that it doesn't keep consuming more and more VRAM?

#

Because a single very small context in amount of follow up prompts (4 prompts) is enough to take the remaining ~2.6GB of VRAM

#

I can still use it but it becomes very slow as it tries to free memory or use regular RAM, which is what I want at the end of the day: share the load. But in assume there's a more formal way of implementing this behavior?

toxic pilot
#

in a model with millions of trainable parameters, a 1024 token or even a 100000 token context window takes up negligible space

toxic pilot
#

what model?

dire kiln
#

Either this, OpenCL or CPU as far as runtimes go

dire kiln
toxic pilot
#

but i don’t think it’s an issue with the context window necessarily

dire kiln
# toxic pilot maybe a memory leak

Could it be something in how the model is converted to ONNX? I converted models before and the output suggests that there are losses in precision at least in my case. Those are official models, though. Converted, configured and fine tuned by AMD. They're hosted in the official organization at HuggingFace.

#

Maybe a leak in the runtime version that Lemonade depends on. Because of C extensions.

#

What about I download Lemonade's source and keep bumping versions of dependencies to see if it stops. Could it possible work? xD

finite surge
#

guys how do i get into making ai because im stuck all i know is to learn python rn im watching bro code idk if i shoulf switch to freecodecamp

finite surge
serene scaffold
finite surge
#

aight

#

its artificial intelligence for me i wanna make like and programme where someone gives me info it can give info back and thats all, i know and i wanna make money by solving problem and wanna keep improving and not do a 9-5

serene scaffold
finite surge
serene scaffold
finite surge
serene scaffold
finite surge
#

was that a test or smth im confused

serene scaffold
# finite surge ?

I'm not going to help you with something that I think is misguided and a waste of your time. If you're interested in actually learning about and understanding AI and preparing for a career in that space, I'm happy to help.

finite surge
#

should i try cause im 14 so i wanted to make ai

serene scaffold
finite surge
#

dont i gotta do college for cs or smth i dont really know

small igloo
#

Hi I was thinking of making a CNN model to track real-time deforestation using satellite imagery, what dataset should I be using?

finite surge
#

O

rich moth
# finite surge O

Don't listen to people tell you can't do something, if you wanna do it, go Nike on it and just do it. The worst you will do is fail and possible learn something. This isn't rock climbing. But you need a better plan or idea, and start researching how you want to work on it. You have all the tools at your fingertips, I recommended start getting better with those first.

gritty vessel
#

Hey guys I wanted to know How to train model on huge data

#

My Features are of shape for training 584,1536,1392,7 and targets 584,1536,1392

#

I kept to train a model at night and It has not even completed 1 epoch yet

#

All data is about 100gb

#

so i stored both features and targets in seprate npy file and then I am training them in batch so all data is not loaded in ram

#

any other way I can train little faster?

#

Or it does seems unusual actually to training this much time for 1 epoch

iron basalt
gritty vessel
#

yes images we can say

iron basalt
#

Downscale if applicable.

gritty vessel
#

size of arrays is 1536,1392

#

584 are timestamps

#

and 7 are channels

gritty vessel
gritty vessel
#

I have one more doubt so as my model is training my ram usage is increasing

#

all data is about 100gb

#

so at max it sahould take 100gb and Im training it in batches

#

and still its taking 133gb ram

#

on idle its around10-15gb

lapis sequoia
#

Dude I am creating most powerful research tool x model

sly isle
#

What are the best courses/certificates for Data Science in 2025? 🤔

serene scaffold
sly isle
serene scaffold
sly isle
serene scaffold
sly isle
serene scaffold
#

That's what I did, and it's the main reason I got a job.

serene scaffold
#

Virginia Commonwealth University

sly isle
rain kelp
#

is this a good neural network model?

toxic pilot
#

if t hats the loss & train for something like MNIST, then you might be able to do better

rain kelp
#

can i ask you if this one is better? i am new to this stuff so i cant compare to other graphs

toxic pilot
#

if it is loss & train for something like NLP, then ur doing really well

rain kelp
#

chat gpt told me the first one is better but i am worried about those spikes

toxic pilot
rain kelp
#

alr thanks!

toxic pilot
toxic pilot
rain kelp
#

model = models.Sequential()
model.add(layers.Conv2D(32, (3,3), activation='relu', input_shape=(32, 32, 3)))
model.add(layers.BatchNormalization())
model.add(layers.MaxPooling2D(2,2))

model.add(layers.Conv2D(64, (3,3), activation='relu'))
model.add(layers.BatchNormalization())
model.add(layers.MaxPooling2D(2,2))

model.add(layers.Conv2D(64, (3,3), activation='relu'))
model.add(layers.BatchNormalization())

model.add(layers.GlobalAveragePooling2D())
model.add(layers.Dropout(0.5))
model.add(layers.Dense(64, activation='relu'))
model.add(layers.BatchNormalization())
model.add(layers.Dense(10, activation='softmax'))

#

this is my model

toxic pilot
#

what are you trying to do>

rain kelp
#

i am learning image classification

toxic pilot
#

also you could experiment with filter sizes as well

#

maybe 3x3, 5x5, 7x7

#

oh for Mnist?

rain kelp
#

what mnist?

toxic pilot
#

like handwritten number detection?

rain kelp
#

no i am using a dataset with 10 image classes and trying to classify the test data

toxic pilot
#

ah okay

rain kelp
#

this one:
(training_images, training_labels), (testing_images, testing_labels) = datasets.cifar10.load_data()

toxic pilot
#

ill just say that my newtork for a similar kind of thing was:

Conv2d --> BatchNorm --> ReLU --> Conv2d --> BatchNorm --> ReLU --> MaxPool 2d --> Conv2d --> BatchNorm --> ReLU --> Max Pool 2d --> Dropout(0.5) --> Dense --> dropout(0.5) --> Dense --> Dropout(0.5) --> Linear Output layer

rain kelp
#

ill try that one and compare! thanks

toxic pilot
#

i probably used too many batch norms 💀

#

idk what dimensions ur images are, but my conv filters were 3x3

rain kelp
#

also do u run stuff on your cpu or gpu? because i saw that using the nvidia gpu the training is much faster

toxic pilot
#

if ur on mac, use mps

rain kelp
toxic pilot
#

if ur on a cuda supporting machine, use Cuda obviously

#

otherwise CPU is probably fine

rain kelp
#

training_images, testing_images = training_images / 255, testing_images / 255

toxic pilot
#

layers.Conv2D(32, (3,3), activation='relu', input_shape=(32, 32, 3)) The filters (kernels) are the (3x3) thing

toxic pilot
#

should be fine

#

just means you get to use less feature maps in your conv layers

rain kelp
#

ok thanks for the help. so i guess when doing the model its always best to test many different things and see whats better

toxic pilot
#

for context, i ran on 24 epochs with a batchsize of 32

rain kelp
#

is there like a logic behind or just brute force it ahah

toxic pilot
#

with 60000 (i think?) images

rain kelp
#

this is how ive done it history = model.fit(
training_images,
training_labels,
epochs=30,
validation_data=(testing_images, testing_labels),
callbacks=[early_stop, reduce_lr]
)

#

ill try your model now

toxic pilot
#

dynamic lr is probably overkill but yeah, looks good

toxic pilot
#

im sure there is a deterministic way to do things 💀

rain kelp
#

yeah little by little ill learn this dark magic ahaha

toxic pilot
#

there is a formula to prevent over fitting; if you keep the number of hidden neurons below N_h = N_s/(alpha * (N_i + N_o)) where N_i = # input, N_o = # output, N_s = # samples in training set and alpha = some scaling factor between 5-->10

toxic pilot
#

or Tensorflow or whatever

rain kelp
#

yeah

toxic pilot
#

sweet

rain kelp
#

you used torch? i have never done that yet

toxic pilot
#

i use torch

#

and also burn.dev, which is a rust framework

#

it really doesnt matter imo

rain kelp
#

whats the difference?

toxic pilot
rain kelp
#

ah ok so ill just learn one of the 2

toxic pilot
#

i think TF might be more performant (????) but it really doesnt matter

rain kelp
#

next project ill try that gpu stuff to speed thing up. when i read that i got very interested

toxic pilot
#

🔥

rain kelp
#

this is your model

toxic pilot
#

val > train accuracy is usually not good

rain kelp
#

ill probabluy have to increase the epochs?

toxic pilot
#

likely not

rain kelp
#

its overfitting?

toxic pilot
#

idk what channels you used

toxic pilot
rain kelp
toxic pilot
#

oh i used 32, 64, 128

#

also early stopping of 2 epochx

#

Adam with weightdecay of 1e-5

#

lr 1e-4

toxic pilot
# rain kelp

wait if ur gray scaling, why is your input 32x32x3

#

oh i also usex a 1x1 padding, (idk if thats the same as padding="same" in keras)

pure tundra
#

Hello am josephinewebexpert
Have a business idea but are having trouble launching it online? Come on, let's discuss. If you would like some free advice, please message me.

rain kelp
#

the data set has images of 32x32x3

toxic pilot
rain kelp
#

its basically image of planes, trucks, deer and other stuff. i have limited the training to 20k images and testing to 4k

toxic pilot
#

yeah no way ur overfitting

rain kelp
#

so over fitting is when the lines go far apart because it memorises the training images?

#

this is the 3rd model

toxic pilot
#

nor is train > val for loss

#

maybe simplify your model?

jaunty helm
toxic pilot
jaunty helm
toxic pilot
jaunty helm
#

training objective being worse than validation objective is also not a total disaster as it could occur naturally
like for example, having dropout layers (which you do) actively hurts model performance in training to seek a better generalized model
or say an image augmentation step like affine is a part of your training, then your training dataset keeps changing so the model can't really overfit it ever (unless it's highly overparameterized), so you might see that training loss stops decreasing at one point yet the validation loss keeps improving

rain kelp
#

ok thanks for all the help. i have learned a lot. I will put this project to rest now and come back once i learn new things

rain kelp
toxic pilot
jaunty helm
#

and it's also not that difficult to compare your different models; just compare the val loss of them
e.g. in your post that said:

is this a good neural network model? (img)
you can see that the val accuracy ended up at about 0.72
can i ask you if this one is better? i am new to this stuff so i cant compare to other graphs
In this post the val accuracy is also about 0.72
this is your model
in this post the val accuracy is only 0.5

so comparing the 3 in terms of performance, model 1 = model 2 > model 3 (roughly)
but model 3 can still be trained cause val loss is still improving

#

obviously you have to be a bit careful when you only compare on the same validation data cause in a way you're now just fitting to seen data
that's when cross validation comes in if you want to look that up

toxic pilot
#

i feel like a better solution for model 3 might be to downsize the model

#

also batch size could make a big difference

rain kelp
#

but so a 73% accuracy means that there is still a lot of room for improvement?

toxic pilot
#

see how testing accuracy is plateauing?

rain kelp
#

aah ok. ok ill do my last test where i train my first model to the whole dataset instead of just 20k images

jaunty helm
toxic pilot
toxic pilot
jaunty helm
rain kelp
jaunty helm
toxic pilot
jaunty helm
# rain kelp

looks better
whatever you changed made it so the accuracy plateaus at about 0.75
(tho again be careful about overfitting yourself on the validation set)

rain kelp
#

i just increased the training data from 20k to around 60k

toxic pilot
#

nice!

gritty vessel
#

its weather data so there are two conditions lightning and no lightning

#

i calculated manually so lightning events are only 3% of the dataset

#

should i apply weighted loss calculation?

#

my current run I believe will be completed till morning

#

currently loss is reducing but I am pretty sure Its because of no lightning cases

toxic pilot
gritty vessel
#

more dense model?

toxic pilot
#

im not sure what your goal is, but 100gb of data will certainly overfit

gritty vessel
#

my goal is to predict lightning

#

given 7 channels

#

but naturally no lightning events are much more higher than

#

lightning events

toxic pilot
#

still thihk you haev way too much data

gritty vessel
#

i did some calculation

limpid zenith
#

It will automatically handle class imbalance

gritty vessel
#

total points in data = 8,052,129,792

toxic pilot
limpid zenith
#

Yeah if the model is too large it will also take forever

gritty vessel
#

when I am using pytorch I always forget to do that

toxic pilot
limpid zenith
#

Poly loss? Like MSE?

gritty vessel
gritty vessel
#

so when we calculate percentage

#

damn its only 0.5%

limpid zenith
#

Oh didn't know about polyloss...awesome...learn something every day

toxic pilot
gritty vessel
#

3% I MIGHT HAVE MISS CALCULTED IT WRONG

#

SORRY

toxic pilot
#

just because you have a full dataset doesnt mean you should use the full dataset

gritty vessel
toxic pilot
#

your goal is to predict lightning; your model will learn the behavior/features it should expect before lightning vs not before lightning, and it shouldnt matter that not lighting occurs more frequently necessarily

#

so maybe randomly sample 42 million non lightinging events

#

and use that as your dataset

gritty vessel
#

ok

#

just one more thing Im passing 2d arrays so how will random sampling will work?

#

It will create patches in data then right?

toxic pilot
#

in fact your model will probably perform worse if ur doing a binary classification, and one of your cases is only consists of 3% of the data

toxic pilot
gritty vessel
#

consider it as an image

toxic pilot
#

try resample from sklearn

gritty vessel
#

Number of samples: 834
Shape of one sample: (1536, 1392, 7)

#

this is for features

toxic pilot
#

wait what

#

okay how many distinct images do you have

gritty vessel
#

and target is No of samples 834
Shape of one sample 1536,1392

gritty vessel
#

time stamps

toxic pilot
#

?

#

oh

gritty vessel
#

and each time stamp got 8 images 7 features and 1 target

toxic pilot
#

an lstm might actually be a good tool for this 💀

gritty vessel
#

convlstm?

#

I am planning to use it but I am first trying to predict normally

#

aftee this what I will do I will give lag in data

toxic pilot
#

well how do you plan to encode time series?

#

oh by concatting the images into one matrix?

gritty vessel
#

that will have information loss?

#

matrix is 2d

toxic pilot
#

well yes

gritty vessel
#

we can say ndim array ?

south finch
#

What's ndim

toxic pilot
gritty vessel
#

n dimensional

#

yes

south finch
limpid zenith
#

Looking at polyloss it seems like it needs class weights to prevent class imbalance.

#

Wouldn't Focal Loss be more appropriate if you don't want to compute class weights?

gritty vessel
#

yes I was reading about it

toxic pilot
#

just use cross entropy or something

#

and randomly sample an equal number of non-lightning cases as lightning cases

limpid zenith
#

Well 100gb class weights computation seems a bit much

gritty vessel
#

it says it modifies cross entropy loss that down-weights the loss for easily classified examples

toxic pilot
#

just dont use all the non-lightning data

gritty vessel
#

but how we can remove it from 2d grid?

#

one thing I think is to clip 128 x 128 or 256 x256 snaps

#

over lightning events

limpid zenith
#

Is this a binary classification probelm of lightning or no lightning?

toxic pilot
#

feeding in a time series, we want to find out if the next step is lightning or no lightning, is what im interpreting this as

gritty vessel
#

exactly

#

thats latter step I will give like time t features and targets will be t+2

toxic pilot
limpid zenith
#

Ahhh the events leading up to lightning will be used to predict lightning...yeah an LSTM is a good way for this with BCE

toxic pilot
#

jank as hell

#

i mean itd work probably, but its just conceptually hilarious

gritty vessel
gritty vessel
limpid zenith
gritty vessel
#

okie I Will try this focal loss,and resampling and conv lstm I will update you guys

#

is it ok?I mean can I update?

limpid zenith
#

Yeah if somethings off or some error just message here yeah

gritty vessel
#

Okie Thank you

rich moth
#

Testing a Universal Complexity Framework (UCF) across different data types
I've been working on a mathematical framework that measures how information organizes itself, and got some interesting cross-domain results I wanted to share.
What I tested: UCF assigns a "phase angle" (θ) to different types of data based on their complexity patterns. The theory predicts certain ranges for different domains:

Financial markets: ~90° ("controlled uncertainty")
Mathematical sequences: ~0° ("pure order")
Physical systems: ~180° ("conservation")

Financial validation results:
Tested 4 major cryptocurrencies, all landed in the predicted 70-110° range:

BTC: 86.2°
ETH: 102.6°
ADA: 91.4°
XRP: 91.5°

Unexpected discoveries:

Prime numbers → 116.7° (closer to biological optimization than pure order)
Natural language → 180.5° (shows conservation-like patterns)
Chaos systems → 98.1° (confirmed controlled uncertainty)

What's interesting: UCF seems to detect consistent mathematical signatures across completely different types of information - financial data, language, mathematics, physics all show distinct but predictable patterns.
The financial predictions working so consistently was unexpected.

#

multiple runs show consistent results

jaunty helm
little dawn
#

can a person with low IQ or low problem solving skills become a good data scientist by doing practice/hardwork??

bleak rampart
bleak rampart
rich moth
# bleak rampart Please elaborate what the phase angle is and what it indicates

hey @bleak rampart thanks for the question. In the UCF the 'structural phase angle θ' is designed to capture the nature or character of the internal organization and structure within a data sample. Think of it like this, while the Magnitude ∣Φ∣ tells us how much complexity or energy there is, the Phase θ tries to tell us what kind of structure is present

bleak rampart
rich moth
# bleak rampart Ohh! Thanks For a given domain the angle wouldn't be a constant value, it would...

Yeah, for any data I throw at the UCF, the structural phase isn't going to be some static, one-size-fits-all number. Every individual chunk of data – whether it's a window of an RNA sequence or a snapshot of market indicators – gets its own θ based on its unique internal structure at that moment. That's why those polar plots above show a scatter of points; each one is a distinct UCF signature.

thick heron
#

hey i have a hard time doing my project can some one look at my git maybe give me some suggestions

#

no? ( = . = ) its oky

odd tulip
thick heron
#

Just feedback will do

thick heron
#

Tts is. Not going great that's why idk how to fix idk ai is feeding me nonsense and yt doesn't help

fair solar
#

damn i should visit this channel more often, TIL abt polyloss

woven prairie
#

Hello does anyone know about guard rails

serene scaffold
woven prairie
#

LLM

#

Like which prevents hallucinations

odd tulip
# thick heron Just feedback will do

it looks good and organised but its definitely beyond my skill level. I would have liked some images but it seems you are in the process of adding them.

runic parcel
#

can anyone help me for the computer vision + ocr problem

#

I am trying to use yolo and tesseract for this project

serene scaffold
serene scaffold
glacial root
#

what's the hiring process like for computer vision internships?

serene scaffold
spring reef
#

What are some misconceptions about A.I.? I understand that it is more useful in analyzing data than it is at writing novels or creating art, but is there anything else about A.I. that I have missed?

#

Also, what is it like being a data analyst or data scientist? Is it not that bad of a career path to go into? Is it a growing career path due to the development of A.I. or is there something else to being a data scientist outside of A.I. development?

serene scaffold
glacial root
#

so no interview problems

#

like no on the spot coding

spring reef
glacial root
#

just computer vision theory questions and then questions about my past experience/projects that i have on my resume

serene scaffold
glacial root
#

i don't get how people are able to get computer vision internships the summer after freshman year of college if recruiting starts in the fall

serene scaffold
#

Like, self driving cars are not generative language models.

glacial root
#

i guess they just start learning really early

serene scaffold
glacial root
#

not much time left before recruiting season so i should probably get to work lol

glacial root
#

i saw some guy from the uni i'll be going to who is a camera perceptions intern at aptiv

#

i don't know the guy but i saw his linkedin profile, all he had was an mnist classifier project

#

and some research, which i'm not too sure if it's related to computer vision or not as it's not too clear (and i don't know anything lol)

#

but the interesting thing was, none of it was before september 2024

#

the project date was december 2024

#

so either aptiv has a really late recruiting cycle or they just don't expect much/very little competition

spring reef
serene scaffold
glacial root
#

typically what's the expectation for computer vision interns

#

or better question, what would be considered a competitive profile

glacial root
serene scaffold
spring reef
serene scaffold
spring reef
serene scaffold
#

If you have something that decides/predicts how much a house should cost based on its properties, that's the kind of thing that I'm talking about

spring reef
#

Apologies if I am asking basic questions, I am mostly unfamiliar with how software and coding works as I am a beginner at this moment of time. I am genuinely trying to understand you, but it is mostly going over my head as I have no experience hearing these terms before.

serene scaffold
#

That's okay

#

I'm at my parents house, so I can't give in depth answers atm

spring reef
serene scaffold
spring reef
#

Oh ok.

serene scaffold
#

But if you had an excel spreadsheet that calculates what you or someone else thinks the cost should be, you have to write a formula/function that calculates it in terms of the columns, right?

#

With machine learning, you have all those columns, and the actual price of the home, and the model figures out what function of the columns consistently arrives at the expected price

spring reef
#

So automation comes with machine learning?

thick heron
serene scaffold
spring reef
#

Oh ok

deep anchor
#

runic parcel
tawdry dove
#

I wanted to learn about how to detect changes in the image data , as in if any bill has a name and someone changed it. The bot should recognise that it is altered and flag it as fraud. Are there any pre built models to do this. Also what all should I know to achieve this ik the basics but any good research paper would help.

thick heron
# runic parcel idk if it will work, because my usecase is a bit different

If it's just yolo(v8) used for identification and then tesseract to do some text extraction what ever use case is keep in mind that you have to make sure that the data set you use is relevant to your thing and if you are planning to deploy then try nano and small first then move to other models if you feel this is too heavy for your deployment then consider downgrading few versions like yolo v5

woven prairie
#

Amazon also provides one that is amazing

#

Currently I am working in a project, where I have to extract all the content from ppt slides and passsed it to llm for further functionality

thick heron
woven prairie
woven prairie
#

But with text from images I want to extract the meaning of images

thick heron
#

It's free really good and has rate limits be careful with that part

woven prairie
#

If i want to extract the context of image , how this can be done

#

How this can be done

runic parcel
thick heron
runic parcel
#

but now i am trying to use tesseract to extract the text

#

i managed to get most while trying differernt preprocessing, but still cant it cant extract few things like digit 5

thick heron
thick heron
#

Or some colour

runic parcel
#

No its proper

#

properly detects 4 and 9

#

but for 5 it gives § this shit

thick heron
#

Did it give you a s instead of 5? Or completely ignored it?

thick heron
thick heron
runic parcel
#

need to use image preprocessing

#

u have any good ideas?

#

for thresholding and preropressing

#

like before it detected nothing, i tried different preprocessing and got pretty good results

thick heron
#

Did you classify each image like this folder has images of 5

runic parcel
#

wdym

thick heron
thick heron
runic parcel
#
gray = cv2.cvtColor(cropped_img, cv2.COLOR_BGR2GRAY)
gray = cv2.resize(gray, None, fx=2, fy=2, interpolation=cv2.INTER_LINEAR)

blur = cv2.GaussianBlur(gray, (5, 5), 0)

_, thresh = cv2.threshold(blur, 0, 255, cv2.THRESH_BINARY + 
cv2.THRESH_OTSU)```
#

this is my preprocessing

runic parcel
#

there is problem in ocr

#

not my dataset

thick heron
#

You turned them into a grey scale which is a good choice to avoid the colours u blurred it a bit and resized it

thick heron
thick heron
#

My system was hybrid online + offline

#

2 different softwares used

runic parcel
#

but isnt there a way to do with opesource

#

like tesseract

#

pladdleocr sutff

thick heron
#

Yea

#

Tesseract is good ocr

runic parcel
#

but having issue

#

ik its good but results are not the way i want

#

is there a way to tell it that lable will always be a number and do ocr from that way

thick heron
#

I see for the data processing part try labeling and then training on that

#

Try improving your data set specifically images of 5 and §

thick heron
#

So you are not doing text only numbers?

runic parcel
#

both text and number, but that lable only consists of numbers

#

wont be anything else except of numbers

thick heron
#

Oh wait the labels are in numbers?

runic parcel
#

no bro

#

the lable is kda

#

kda is the lable name, and inside it there will be numbers

#

like this, so there will be numbers this way

thick heron
#

I have a question you are doing these completely digital images to ocr and then extract text out of it and use the extract no physical ones? And issue is it's sometimes going to miss read 5 with § and now your pre processing method is turning the images black and white and then blurring then?

#

Am i clear on this part?

#

Kda is a label that consists of pure numbers and nothing else?

runic parcel
runic parcel
#

see i am trying to get the data from a scoreboard like this into json format like given below:

{
  "radiant_score": 15,
  "dire_score": 10,
  "teams": {
    "radiant": [
    { "player_name": "Ani", "hero": "Night Stalker", "level": 9, "gold": 489, "kda": [3, 1, 2], "ultimate": true },
    { "player_name": "Alvyy", "hero": "Templar Assassin", "level": 10, "gold": 1136, "kda": [5, 1, 5], "ultimate": true },
    { "player_name": "REDRUM", "hero": "Crystal Maiden", "level": 7, "gold": 234, "kda": [1, 4, 3], "ultimate": false },
    { "player_name": "Big Doodle", "hero": "Earthshaker", "level": 7, "gold": 2138, "kda": [1, 1, 3], "ultimate": true },
    { "player_name": "pick weak=punishment", "hero": "Ember Spirit", "level": 8, "gold": 601, "kda": [0, 4, 4], "ultimate": true }
    ],
    "dire": [
    { "player_name": "Stleip", "hero": "Tusk", "level": 6, "gold": 514, "kda": [0, 3, 6], "ultimate": false },
    { "player_name": "红双喜", "hero": "Morphling", "level": 8, "gold": 1301, "kda": [2, 1, 1], "ultimate": true },
    { "player_name": "hy not listening", "hero": "Mars", "level": 7, "gold": 788, "kda": [2, 2, 4], "ultimate": false },
    { "player_name": "xin", "hero": "Lina", "level": 6, "gold": 249, "kda": [3, 2, 2], "ultimate": false },
    { "player_name": "Love is patient, lo...", "hero": "Lion", "level": 7, "gold": 1301, "kda": [1, 6, 5], "ultimate": true }
  ]
  }
}
thick heron
#

and the kda is the issue since the numbers are missunderstood

agile pewter
#

hello im learning ml with the sckitlearn, but the tutorials i saw use the sckitlearn default databases, how can i make my own and save to use after?

#

in fact i managed to train one just don't know what to do after to store for later use

agile cobalt
woven prairie
#

Does anyone know any open source vision model that takes image input and tell what basically is image about.

agile cobalt
abstract wasp
#

Hi am I allowed to send a survey for some data collection? It’s for a project

verbal oar
#

what project to build? I want practice ml

#

dont know price prediction of electricity?

#

sth where there is data

#

I watched ml from scratch type of videos from vizuara

thick nest
#

hi, i'm think about to make a tictactoe with neural network without probabilities just linear algebra, this is possible, right?

#

i'll use ReLU and Softmax function

agile cobalt
#

neural network
without probabilities
with Softmax
what exactly do you mean by "(without) probabilities"?..

thick nest
#

I mean that I'm using softmax just to highlight the best move, the one with the highest score

#

I'll create two hidden layers with 9 neurons each. I'll use ReLU as the activation function in the hidden layers, and then a softmax layer at the end to generate a vector of scores, one for each possible move. Then I'll use argmax to pick the position with the highest score and place the X there. So the softmax helps highlight the best move, but I'm not using it for real probabilities

rich moth
fickle shale
upper niche
#

I need some information in regards to data normalization

#

is it better to normalize the data before or after splitting

#

I don't get the logic of: by exposing the data before training, you may cause data leakage,

#
    #Check if nromalizatio is requested
    #if yes 
    #   normalized X _ train and X_test, y_train and y_test
    # 
    
    print(X_train.head());
    model = LinearRegression();
    model.fit(X_train, y_train)```
#

Let's assume that pseudo code doesn't exist. Now, the test is split, the data is not normalized. the variable model is never been exposed to the normalization,right

main citrus
#

Hi guys, I have been learning ML, eda and data engineer, nlp and a bit of deep learning for 2 years
I am 16 yo
Do you think that I can get summer work in a company with that?

odd meteor
odd meteor
verbal oar
#

what solid projects, its about few lines for example with sklearn

#

model
fit
predict

past meteor
verbal oar
#

ah ok so other parts related to data science

past meteor
#

Sure, but that makes sense right? 😄

#

It's as you say, the code is so easy (.fit / .predict) they'd hire no one to do just that

verbal oar
#

makes sense, thats why high salary

#

so you must know crisp-dm or similar project cycle

#

but honestly still not too much of coding compared to some web app development where there are modules, components, rather big systems or game engine development

#

this is what attracts me to do some data science

#

not high salary but much less code

lapis sequoia
#

Made something with python

void stone
#

Hi guys, I want to break into data science and have an internship at my first year at university in summer
I am currently learning python through cs50p and I was wondering if anybody has resources I can use to be able to make relevant projects during university
I have looked up at kaggle but I am not sure
I also watched this video : https://www.youtube.com/watch?v=9R3X0JoCLyU
It helps me in direction but not necessarily in the process of learning

Go from zero to a data scientist in 12 months. This step-by-step roadmap covers the essential skills you must learn to become a data scientist in 2024.

❤️ Join this channel to get access to perks:
https://www.youtube.com/channel/UCWv7vMbMWH4-V0ZXdmDpPBA/join

Download the FREE roadmap PDF here: https://mosh.link/data-science-roadmap

✋ S...

▶ Play video
verbal oar
#

what about simplilearn data science course?

void stone
void stone
void stone