#data-science-and-ml | Python | Page 167

verbal oar May 27, 2025, 5:55 PM

#

I'm watching it currently and its about project lifecycle, descriptive stats, roadmap and so on

void stone May 27, 2025, 6:53 PM

#

verbal oar I'm watching it currently and its about project lifecycle, descriptive stats, ro...

Alright, nice

rich moth May 27, 2025, 6:54 PM

#

In the main image, its like 135-180 degrees is the goldilocks zone. What these datasets seem to have in common in most cases is they're "evolved" information systems that are subject to selection pressure. They represent stable, prsisten patterns that have survived over time and they're in the optimal learnability zone, complex enough to be interesting , simple enough to process.

Like financial markets evolved through economic selection pressures, biological sequences with 4 billion years of evolutionary optimization, language patterns evolution and communication efficiency, image structures, time series. you get it.

Then you got some outliers. Like Artifically constructed data, pathological cases or ery recent/unfiltered information but its all the wrong angle.

87.1% cross domain classification accuracy baby, the pattern is real, not random

tepid garnet May 27, 2025, 11:09 PM

#

rich moth In the main image, its like 135-180 degrees is the goldilocks zone. What these d...

it is amazing, where did you learn it? i learned about models and how it works, i am currently understanding agents, if it is possible for you to tell me how or what should i learn to be able to do that

#

i would be thankful

gloomy sun May 27, 2025, 11:10 PM

#

Hi

tepid garnet May 27, 2025, 11:11 PM

#

gloomy sun Hi

hey 👋

rich moth May 28, 2025, 12:25 AM

#

tepid garnet it is amazing, where did you learn it? i learned about models and how it works, ...

I just think of ideas I like after read about new methods or something I learn. I think about what I have in my arsenal of code and how I connect the best tools together for the right job. You get better the more stuff you make. Agents are a hot direction right now.

lapis flax May 28, 2025, 1:57 AM

#

Anyone knowledgeable on how to use the subset dataloader in pytorch to just get three classes from the fashionMNIST dataset? I’ve been running around in the documentation and trying things but nothing is clear, and the best way I’ve found is someone else’s method that directly uses the full dataset from its github repo and more ‘manually’ extracts the three classes i’m interested in.

#

Re: the subset dataloader, I would want to use a boolean mask (or whatever equivalent form of indexing) to just grab all instances of T-shirts, coats, and shirts, and then do my thing from there

stuck belfry May 28, 2025, 2:10 AM

#

Hey gang, How do i practice data science? I wanna invest my time into learning numpy and sci-py and matplotlib but i have nothing to apply myself towards or any data to mess with, any pointers?

serene scaffold May 28, 2025, 2:12 AM

#

stuck belfry Hey gang, How do i practice data science? I wanna invest my time into learning n...

the most important thing is that you don't try to learn in terms of libraries. if you go into it thinking "I'm going to learn numpy and then I'll learn pandas and then I'll learn scikit-learn" you won't get anywhere.

focus on learning concepts, and use whatever libraries you need to implement something that applies that concept.

stuck belfry May 28, 2025, 2:13 AM

#

serene scaffold the most important thing is that you don't try to learn in terms of libraries. i...

thank you for your response, but knowing this i'm not really sure of where else to go from here, do you suggest a data science book like the one openstax offers? https://openstax.org/details/books/principles-data-science

OpenStax | Free Textbooks Online with No Catch

OpenStax offers free college textbooks for all types of students, making education accessible & affordable for everyone. Browse our list of available subjects!

serene scaffold May 28, 2025, 2:14 AM

#

stuck belfry thank you for your response, but knowing this i'm not really sure of where else ...

I've never heard of openstax

#

!resources data science

arctic wedgeBOT May 28, 2025, 2:14 AM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

serene scaffold May 28, 2025, 2:14 AM

#

@stuck belfry you can look at this curated list ^

stuck belfry May 28, 2025, 2:15 AM

#

wow this looks sweet, thank you again

serene scaffold May 28, 2025, 2:15 AM

#

also, as a reality check, keep in mind that it's very unlikely that you'd be able to get a job in data science without a degree, so if you're trying to work in this space, you need to do that.

stuck belfry May 28, 2025, 2:15 AM

#

serene scaffold also, as a reality check, keep in mind that it's very unlikely that you'd be abl...

lol i'm aware, i'm learning it for my own self interest

serene scaffold May 28, 2025, 2:15 AM

#

stuck belfry lol i'm aware, i'm learning it for my own self interest

that's good. I think people should always learn about things that interest them, even if there isn't a profit motive.

stuck belfry May 28, 2025, 2:15 AM

#

i'm going to uni starting this august though, gives me time to refine myself before i go into the workforce with data science

#

or whatever else there might be, i think stuff like numpy is foundational to whatever other applications python has in mathematics as a whole

serene scaffold May 28, 2025, 2:17 AM

#

"data science python" is like its own dialect of python, and many of the distinguishing features of that dialect come from numpy.

silk hull May 28, 2025, 2:17 AM

#

frfr

#

python is such a diverse language

serene scaffold May 28, 2025, 2:18 AM

#

silk hull frfr

hello and welcome to our wonderful data science chat

silk hull May 28, 2025, 2:18 AM

#

hello

#

Im learning data science w python and just data science in general

#

it's soo cool

stuck belfry May 28, 2025, 2:33 AM

#

serene scaffold "data science python" is like its own dialect of python, and many of the disting...

i'll be reading the oreilly textbook mentioned on the list through the link you gave, i'll make sure to pass by here every once in a while to leave some hints on my progress, thank you again!

glacial root May 28, 2025, 3:26 AM

#

serene scaffold I've never heard of openstax

pretty much just a website with free online textbooks, i think it's mainly useful for high school students though

twin relic May 28, 2025, 7:15 AM

#

Hi, how would you advise someone who wants to be an AI Engineer and is considering switching from a full-stack developer? And how would you recommend building the resume to land full-time jobs?

twin relic May 28, 2025, 7:23 AM

#

twin relic Hi, how would you advise someone who wants to be an AI Engineer and is consideri...

I have an intermediate skill level with the MERN stack. I am trying to maximize my chance of landing a job, but I am worried I cannot specialize in any one field. I am anxious about the thought of not pursuing a field and then regretting later for not picking it earlier

frigid sonnet May 28, 2025, 7:26 AM

#

Hey guys

verbal oar May 28, 2025, 10:37 AM

#

how it looks is it used R with python or
R or python?

#

so should I focus more on python or R?

jaunty helm May 28, 2025, 12:21 PM

#

verbal oar so should I focus more on python or R?

R is better if you're doing good ol' statistics, otherwise I'd just use python
e.g. semopy's definitely behind lavaan by quite a margin

#

(and the former's not even being updated anymore)

verbal oar May 28, 2025, 12:25 PM

#

thanks, makes sense

#

R and Python are both popular programming languages for data analysis, but they serve different purposes. R is primarily focused on statistical analysis and data visualization, while Python is a general-purpose language with a broader range of applications, including web development and machine learning. Python is often considered easier to learn and more versatile, while R is favored for its specialized statistical capabilities.

#

not exactly true r is for ml too

#

source google search (probably gemini)

jaunty helm May 28, 2025, 12:41 PM

#

verbal oar not exactly true r is for ml too

I don't know anyone who would choose R over python when it comes to ML

zinc patrol May 28, 2025, 1:24 PM

#

guys halp me

#

is my network dying?

#

import numpy as np
import nnfs
from nnfs.datasets import spiral_data

nnfs.init()

np.random.seed(0)

X = [[1, 2, 3, 2.5],
     [2.0, 5.0, -1.0, 2.0],
     [-1.5, 2.7, 3.3, -0.8]]

X, y = spiral_data(100, 3)

class Layer_Dense:
    def __init__(self, n_inputs, n_neurons):
        # defining layers
        self.weights = 0.10 * np.random.randn(n_inputs, n_neurons)
        self.biases = np.zeros((1, n_neurons))
    def forward(self, inputs):
        # dot product
        self.output = np.dot(inputs, self.weights) + self.biases

class Activation_ReLU:
    def forward(self, inputs):
        self.output = np.maximum(0, inputs)


layer1 = Layer_Dense(2, 5)
#layer2 = Layer_Dense(5, 2)

activation1 = Activation_ReLU()

layer1.forward(X)
print(f"layer output: {layer1.output}")

activation1.forward(layer1.output)
print(f"layer output after ReLU: {activation1.output}")
#layer2.forward(layer1.output)

#print(layer2.output) # should be [[ 0.148296   -0.08397602]# [ 0.14100315 -0.01340469]# [ 0.20124979 -0.07290616]]'''

serene scaffold May 28, 2025, 1:24 PM

#

Neural networks don't "die"

zinc patrol May 28, 2025, 1:25 PM

#

output -

   0.00000000e+00]
 [-8.35815910e-04 -7.90404272e-04 -1.33452227e-03  4.65504505e-04
   4.56846210e-05]
 [-2.39994470e-03  5.93469958e-05 -2.24808278e-03  2.03573116e-04
   6.10024377e-04]
 ...
 [ 1.13291524e-01 -1.89262271e-01 -2.06855070e-02  8.11079666e-02
  -6.71350807e-02]
 [ 1.34588361e-01 -1.43197834e-01  3.09493970e-02  5.66337556e-02
  -6.29687458e-02]
 [ 1.07817926e-01 -2.00809643e-01 -3.37579325e-02  8.72561932e-02
  -6.81458861e-02]]
layer output after ReLU: [[0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
  0.00000000e+00]
 [0.00000000e+00 0.00000000e+00 0.00000000e+00 4.65504505e-04    
  4.56846210e-05]
 [0.00000000e+00 5.93469958e-05 0.00000000e+00 2.03573116e-04
  6.10024377e-04]
  6.10024377e-04]
 ...
 ...
 [1.13291524e-01 0.00000000e+00 0.00000000e+00 8.11079666e-02
 [1.13291524e-01 0.00000000e+00 0.00000000e+00 8.11079666e-02
  0.00000000e+00]
 [1.34588361e-01 0.00000000e+00 3.09493970e-02 5.66337556e-02
  0.00000000e+00]
  0.00000000e+00]
 [1.07817926e-01 0.00000000e+00 0.00000000e+00 8.72561932e-02
 [1.07817926e-01 0.00000000e+00 0.00000000e+00 8.72561932e-02
  0.00000000e+00]]
  0.00000000e+00]]```

#

thats a scary amount of zeros im seeing there

#

oh STELERCUS

#

boy am i glad to see you

serene scaffold May 28, 2025, 1:32 PM

#

why?

#

I'm a bastard.

#

@zinc patrol why do you think so many of the outputs end up being 0?

zinc patrol May 28, 2025, 1:37 PM

#

i think its because of the ReLU, but im not sure.

serene scaffold May 28, 2025, 1:38 PM

#

zinc patrol i think its because of the ReLU, but im not sure.

right. the ReLU clips everything below 0

#

you might try a different activation function.

zinc patrol May 28, 2025, 1:38 PM

#

so i WAS right

weary timber May 28, 2025, 1:38 PM

#

hello guyssss

zinc patrol May 28, 2025, 1:38 PM

#

TAKE THAT RELU

weary timber May 28, 2025, 1:38 PM

#

discord still banned in turkey 😢

zinc patrol May 28, 2025, 1:38 PM

#

oops sorry gota bit carried away

weary timber May 28, 2025, 1:38 PM

#

use sigmoid

#

if youre doing mnist

zinc patrol May 28, 2025, 1:38 PM

#

serene scaffold you might try a different activation function.

I'll try sigmoid

zinc patrol May 28, 2025, 1:38 PM

#

weary timber if youre doing mnist

im not

#

im using nnfs

weary timber May 28, 2025, 1:38 PM

#

zinc patrol im not

which set

zinc patrol May 28, 2025, 1:39 PM

#

but thanks for the idea

weary timber May 28, 2025, 1:39 PM

#

whats that

zinc patrol May 28, 2025, 1:39 PM

#

i think ill use hugging face instead tho

zinc patrol May 28, 2025, 1:39 PM

#

weary timber whats that

some random dataset i found online

weary timber May 28, 2025, 1:39 PM

#

what is nnfs bro

#

the full name

jaunty helm May 28, 2025, 1:39 PM

#

zinc patrol I'll try sigmoid

there's also leaky relu

zinc patrol May 28, 2025, 1:39 PM

#

weary timber the full name

neural networks from scratch

zinc patrol May 28, 2025, 1:40 PM

#

jaunty helm there's also leaky relu

I'll try that too

weary timber May 28, 2025, 1:40 PM

#

zinc patrol neural networks from scratch

oh okay

jaunty helm May 28, 2025, 1:40 PM

#

sigmoid fell out of favor in the first place cause vanishing gradients

river cape May 28, 2025, 2:59 PM

#

hi guys so I want to build a ai website builder , where a user gives a prompt and the ai responds back with with necessary code . I want to use open source LLMs which are good for code-generation. Could y'll suggest me some LLMs that are good for this purpose , and also the parameters they have

verbal oar May 28, 2025, 3:04 PM

#

you can visualize instead of staring at numbers

verbal oar May 28, 2025, 3:04 PM

#

zinc patrol output - ```layer output: [[ 0.00000000e+00 0.00000000e+00 0.00000000e+00 0....

.

#

not comfortable way currently

#

I assume just plot functions output, after relu applied

jaunty helm May 28, 2025, 3:21 PM

#

river cape hi guys so I want to build a ai website builder , where a user gives a prompt an...

depends on your hardware
nowadays LLMs come in all sizes

river cape May 28, 2025, 3:21 PM

#

jaunty helm depends on your hardware nowadays LLMs come in all sizes

I have 16gb ram and 4gb vram

jaunty helm May 28, 2025, 3:22 PM

#

river cape I have 16gb ram and 4gb vram

4gb vram isn't a lot so try something like Qwen 3 <4b models

#

use a quantized version so you can fit it in vram

river cape May 28, 2025, 3:23 PM

#

jaunty helm 4gb vram isn't a lot so try something like Qwen 3 <4b models

I am trying to use the deepseek coder v2 lite

#

Does q4 help?

jaunty helm May 28, 2025, 3:25 PM

#

river cape Does q4 help?

if you can stand 10+ seconds per token generation (and potentially longer), then sure
use the GGUF format so you can put your ram to use

river cape May 28, 2025, 3:25 PM

#

jaunty helm if you can stand 10+ seconds per token generation (and potentially longer), then...

Ohhh okay btw where do I learn the hardware for this? do I have serach llm hardware

jaunty helm May 28, 2025, 3:26 PM

#

river cape Ohhh okay btw where do I learn the hardware for this? do I have serach llm hardw...

you can usually estimate from parameter size
like deepseek coder v2 lite is 16b parameters, so if each of those parameters takes 4 bits (half a byte) you'd need around 8gb memory

#

some more for the context
you probably don't want to use Q4_0 nor Q4_1 but something like Q4_K_M, which technically is higher than 4 bits on average

agile cobalt May 28, 2025, 3:28 PM

#

you also need to take into consideration the memory required for the inputs and outputs though, which varies depending on the context window size

for generating an entire website you'd need of a pretty large output

river cape May 28, 2025, 3:30 PM

#

jaunty helm you can usually estimate from parameter size like deepseek coder v2 lite is 16b ...

Got it , but why not q8 , as it would take the same amount of memory

jaunty helm May 28, 2025, 3:30 PM

#

river cape Got it , but why not q8 , as it would take the same amount of memory

q8 would take 2x the memory as q4
or in other words, 16b model at q8 takes about 16gb memory just for the model itself

river cape May 28, 2025, 3:31 PM

#

jaunty helm q8 would take 2x the memory as q4 or in other words, 16b model at q8 takes about...

Ohhhh okayyy I get it

#

So its not possible to get good results with a open-source LLM?

#

Considering my architecture

jaunty helm May 28, 2025, 3:32 PM

#

river cape So its not possible to get good results with a open-source LLM?

well I mean I suggested the Qwen 3 series
they're overfit AF and write super dry, but you don't care about that when you're just gonna code with it

river cape May 28, 2025, 3:33 PM

#

jaunty helm well I mean I suggested the Qwen 3 series they're overfit AF and write super dry...

Trueeeee , is it okay if use a code-specific LLM like codeLlama , wizardcoder

jaunty helm May 28, 2025, 3:34 PM

#

river cape Trueeeee , is it okay if use a code-specific LLM like codeLlama , wizardcoder

the ones you named here are highly out of date, so I wouldn't really consider them

river cape May 28, 2025, 3:34 PM

#

jaunty helm the ones you named here are highly out of date, so I wouldn't really consider th...

Ohhh got it

jaunty helm May 28, 2025, 3:34 PM

#

like I wouldn't be surprised if Qwen 3 4b beats wizardcoder 15b at this point

river cape May 28, 2025, 3:34 PM

#

The recent one would be qwen and deepseek?

jaunty helm May 28, 2025, 3:35 PM

#

river cape The recent one would be qwen and deepseek?

the ones that are probably good at coding, yes

#

llama4 kinda flopped from what I hear

agile cobalt May 28, 2025, 3:36 PM

#

jaunty helm llama4 kinda flopped from what I hear

even before thinking about performance, it's outright impossible to run it in a consumer grade GPU

#

-# well, might be possible with a ridiculously bad performance, but the size is just absurd

river cape May 28, 2025, 3:37 PM

#

jaunty helm llama4 kinda flopped from what I hear

Ohhh that's bad , anyways on my way to download qwen and deepseek thanks

river cape May 28, 2025, 3:38 PM

#

agile cobalt -# well, might be possible with a ridiculously bad performance, but the size is ...

How huge is it?

agile cobalt May 28, 2025, 3:38 PM

#

over 100B total parameters

jaunty helm May 28, 2025, 3:40 PM

#

agile cobalt even before thinking about performance, it's outright impossible to run it in a ...

well it's not the worst in the world due to it being an MoE, so CPU performance will be somewhat bearable on say a mac or ddr5 ram
for individuals though it's probably still better to just use a smaller, dense model

river cape May 28, 2025, 3:40 PM

#

it says likely too large

#

Does it mean , i dont have enough disk space or i dont have enough ram to run it

jaunty helm May 28, 2025, 3:41 PM

#

river cape it says likely too large

yeah that's 32b, and that's qwen2.5
I said Qwen 3 which come in various sizes; something <4b is probably good for your setup

agile cobalt May 28, 2025, 3:41 PM

#

river cape it says likely too large

I feel like you're unlikely to get good results with complex prompts from anything lower than that tbh

river cape May 28, 2025, 3:42 PM

#

jaunty helm yeah that's 32b, and that's qwen2.5 I said [Qwen 3](https://huggingface.co/colle...

I got that , but is it showing that likely too large , bcoz of my ram?

river cape May 28, 2025, 3:43 PM

#

agile cobalt I feel like you're unlikely to get good results with complex prompts from anythi...

so 32B parameter model is like the bottleneck?

jaunty helm May 28, 2025, 3:43 PM

#

one of the strongest contenders for "low" param coding is honestly probably still QwQ 32b

jaunty helm May 28, 2025, 3:44 PM

#

river cape I got that , but is it showing that likely too large , bcoz of my ram?

or vram, idk I don't use lm studio
either way it's showing you a Q2 quant and I doubt it'll be coherent even if you could run it

river cape May 28, 2025, 3:46 PM

#

jaunty helm or vram, idk I don't use lm studio either way it's showing you a Q2 quant and I ...

I do have the choice to select q4 km

jaunty helm May 28, 2025, 3:46 PM

#

river cape I do have the choice to select q4 km

yeah and I doubt you can reasonably run that

river cape May 28, 2025, 3:47 PM

#

jaunty helm yeah and I doubt you can reasonably run that

I seee , looks like I need to switch to APIs most probably

heavy crow May 28, 2025, 3:56 PM

#

are hard edges hard for 1d convlolutions? i.e when working with (simulated) LiDAR the data look like in the attached picture. It is rather high resolution but very sharp. (also attached a closeup)

green pilot May 28, 2025, 3:59 PM

#

Hey woundering if any one knows if its faster to do simple data cleaning with bash in the command line or with pandas ? Its looking like unix might be faster but woundering if anyone has a suggestion as right now im using chuks and setting it to 500_000 and my large files are taking a long time to clean. Im just removing the header and removing ^ symbols

jaunty helm May 28, 2025, 3:59 PM

#

heavy crow are hard edges hard for 1d convlolutions? i.e when working with (simulated) LiDA...

hm... wdym by "hard" exactly?
there are kernels that can detect edges

heavy crow May 28, 2025, 4:00 PM

#

just significantly making it harder for the model to converge

#

or requiring more parameters / larger kernel sizes to capture meaningful features

#

im trying to overfit on this "clean" simulated data before moving to realworld data but am running into real problems fitting the model

jaunty helm May 28, 2025, 4:02 PM

#

green pilot Hey woundering if any one knows if its faster to do simple data cleaning with ba...

maybe?
sed 's/\^//g' data.txt > cleaned.txt
this simply removes all ^, you can try if it's faster
for the header, depends on if you can reliably detect a header ig

jaunty helm May 28, 2025, 4:05 PM

#

heavy crow im trying to overfit on this "clean" simulated data before moving to realworld d...

I'm not so sure on the convergence difficulty, sry

green pilot May 28, 2025, 4:06 PM

#

Yeah i was thinking or running something like that would prefer to keep the pipeline all in python but i think running it like that might be faster. Right now i am pulling from aws bucket as csv -> coverting to txt -> cleaning with bash -> then gzip scp to a new server -> unzip and load into a netezza server . I really wish i could simplify its pipeline 😂😂😂

heavy crow May 28, 2025, 4:09 PM

#

jaunty helm I'm not so sure on the convergence difficulty, sry

all good 🙂 currently fighting with mode collapse anyways haha. probably some other underlying problem

jaunty helm May 28, 2025, 4:14 PM

#

heavy crow all good 🙂 currently fighting with mode collapse anyways haha. probably some ot...

maybe you could try high dilation since you said it's high resolution?

#

so the kernels could see more of the sequence at once without increasing the parameter count by too much

heavy crow May 28, 2025, 4:15 PM

#

Good idea

lapis flax May 28, 2025, 4:24 PM

#

lapis flax Anyone knowledgeable on how to use the subset dataloader in pytorch to just get ...

bumping this in case anyone has an answer

heavy crow May 28, 2025, 4:31 PM

#

jaunty helm maybe you could try high dilation since you said it's high resolution?

Wow. I was already using dialation but only with a factor of 2-6 or so. Looking at the data my features were way larger! (60 datapoints or so for one "feature" / peak or whatever). I've downsampled by a factor of 10 because i really dont need the resolution for these tests and its training much better! Now the dialation actually covers the features

heavy crow May 28, 2025, 5:39 PM

#

@jaunty helm Went from this

#

to

#

ignore the wonky GT data distributions. Thats from my simulation.

#

all that with under 300k params 🙂 definitely viable and a solvable problem i can invest some more time in.

woven prairie May 28, 2025, 5:51 PM

#

If i want to learn maths and stats for data science which is the best option

wintry pagoda May 28, 2025, 6:03 PM

#

I want to create an AI personal assistant using python. I asked Chat gpt to draft me an outline for the process but I am still lost. Where do I start?

dusty forge May 28, 2025, 6:23 PM

#

👋 Hey everyone! Imagine trying to understand healthcare data privacy rules, but it feels like wandering in a confusing forest. That’s why I built MediRAG Guard.
MediRAG Guard is special because It doesn’t just search for keywords, it understands how pieces of information are connected using a unique hierarchical Context Tree. It uses Python, Groq, LangChain and ChromaDB. This helps it give you clearer, more accurate answers.
It’s like having a guide in the forest, showing you the way! 🌳
Check out the demo and see how it works: https://github.com/pr0mila/MediRag-Guard

GitHub

GitHub - pr0mila/MediRag-Guard: MediRAG Guard: A RAG Proof of Conce...

MediRAG Guard: A RAG Proof of Concept that delivers comprehensive, context-aware insights on healthcare data privacy through a novel knowledge tree. - pr0mila/MediRag-Guard

rich moth May 28, 2025, 10:55 PM

#

lapis flax Anyone knowledgeable on how to use the subset dataloader in pytorch to just get ...

Sure I can help!

rich moth May 28, 2025, 11:02 PM

#

lapis flax Anyone knowledgeable on how to use the subset dataloader in pytorch to just get ...

whats the class mapping look like? What one is it exactly theres small medium large

lapis flax May 28, 2025, 11:33 PM

#

rich moth whats the class mapping look like? What one is it exactly theres small medium la...

i'm not sure i understand what you're asking?

opaque condor May 28, 2025, 11:34 PM

#

What's the next step after convelution nets?

lapis flax May 28, 2025, 11:56 PM

#

here is the meat of the network

#

here are the initializations for the weight and bias terms

serene scaffold May 29, 2025, 12:13 AM

#

@lapis flax it's easier for people when you give all the code as text. not as a screenshot.

#

!code

arctic wedgeBOT May 29, 2025, 12:13 AM

#

Formatting code on Discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

opaque condor May 29, 2025, 12:19 AM

#

I'm working on combination on their own networks but what's after convolution?

lapis flax May 29, 2025, 12:24 AM

#

opaque condor I'm working on combination on their own networks but what's after convolution?

are you asking me this or just asking in general?

opaque condor May 29, 2025, 12:36 AM

#

Just in general cuz I mean a convolutional neural network I know how I can give it different image I just have to put in the folder name or at least the folders with their labeled and then train until there's a specific accuracy that I'm loving granted I haven't implemented a threshold yet I've been meaning to get that done but I've been busy planning a few hundred things

rich moth May 29, 2025, 12:56 AM

#

lapis flax i'm not sure i understand what you're asking?

about the MINST fashion dataset

rich moth May 29, 2025, 1:04 AM

#

lapis flax here is the meat of the network

#

Does that help?

lapis flax May 29, 2025, 1:19 AM

#

rich moth

yes this exactly is what i was wondering about

#

im away from comp but will take a look later, do you know if this is covered in any of the torch documentation tutorials?

#

either way, thank you so much!!

opaque condor May 29, 2025, 1:32 AM

#

lapis flax are you asking me this or just asking in general?

Just in general

lapis flax May 29, 2025, 1:32 AM

#

opaque condor Just in general

i commonly see some kind of max pooling layer in convolutional neural nets

opaque condor May 29, 2025, 1:37 AM

#

So audio or video labeler

halcyon fog May 29, 2025, 2:01 AM

#

what would be the best way to detect an image? im thinking of using cv2 but since the object im trying to detect is a card with text and no images and it seems to detect based on noise, idk if it will work

agile cobalt May 29, 2025, 2:06 AM

#

depends on what you're trying to detect?
some OCR engines can be used to identify text bounding boxes

halcyon fog May 29, 2025, 2:07 AM

#

its like, a bunch of these things, im making an ai that plays a roblox game for me, im trying to win a bet with my friends

#

the issue im running into is that they all look the exact same, except for different colours and text

red shadow May 29, 2025, 3:23 AM

#

hello, can someone please help me with the coding🥺 it's more on django and type reaact🥺

grand minnow May 29, 2025, 3:33 AM

#

red shadow hello, can someone please help me with the coding🥺 it's more on django and type...

That sounds more like a #web-development type of question. If you could share it in that channel with your code and error, someone there might help

red shadow May 29, 2025, 3:33 AM

#

thank youu! i appreciate that🫶

#

i will do it rn

glacial root May 29, 2025, 6:12 AM

#

would a custom image processing library (kind of like PIL) be a good project for a computer vision resume

limpid zenith May 29, 2025, 6:24 AM

#

glacial root would a custom image processing library (kind of like PIL) be a good project for...

it would definitely indicate you know the CV algorithms yeah

void stone May 29, 2025, 1:48 PM

#

Has anyone ever taken this course ?
https://www.udemy.com/course/python-coding/?srsltid=AfmBOopm4dzFrPHU-owY2JQZd_YpaJsjwoca1POPrdMjCt0iPq8aS8Xm&couponCode=ST19MT280525G1
I just want to have some opinions about it so that I can decide if I should purchase this course or not

manic lion May 29, 2025, 2:10 PM

#

Is machine learning use python? i am master yoda

#

or machine learning is pure math

void stone May 29, 2025, 2:15 PM

#

From my understanding so far, machine learning is a combination of math (especially statistics and linear algebra) and programming (popular languages include python, c++ that you will combine with R, sql, Scikit-Learn, etc.)

serene scaffold May 29, 2025, 2:16 PM

#

void stone From my understanding so far, machine learning is a combination of math (especia...

machine learning is an application of theoretical math and you implement the algorithms that do it in Python.

#

you mixed up a few things that are orthogonal. R is a programming language for data analysis. SQL is a query language for tabular data. scikit-learn is a python library for statistical machine learning (ie not neural networks).

verbal oar May 29, 2025, 2:53 PM

#

but in scikit-learn you have perceptron

calm thicket May 29, 2025, 3:07 PM

#

sklearn implements very basic neural networks, but that's it

void stone May 29, 2025, 3:10 PM

#

serene scaffold you mixed up a few things that are orthogonal. R is a programming language for d...

Alright, thanks for the clarification, I am still new to it
What you guys think of this : https://www.udemy.com/course/python-coding/?srsltid=AfmBOopm4dzFrPHU-owY2JQZd_YpaJsjwoca1POPrdMjCt0iPq8aS8Xm&couponCode=ST19MT280525G1
I'm not sure if I should buy it or not ?

orchid moat May 29, 2025, 3:11 PM

#

is shiny posit usefull for beginner?

agile cobalt May 29, 2025, 3:17 PM

#

iirc it's popular in R, but not so much for Python

there are Streamlit, Gradio, Dash and many other similar libraries that have widely used around in the python ecosystem for longer, but I guess Shiny also works

#

arguably even Marimo could count as an alternative for it

There are a lot of options, if you want to make a dashboard or extremely simple web app they are all useful regardless of your level of experience, which one you pick is mostly a personal preference

serene grail May 29, 2025, 4:25 PM

#

It seems like it's all about deep neural networks nowadays, are there any tasks where something else is used for the state of the art?

lapis flax May 29, 2025, 4:32 PM

#

i mean, large language models are sorta what a lot of researchers have turned their attention to

serene scaffold May 29, 2025, 4:33 PM

#

lapis flax i mean, large language models are sorta what a lot of researchers have turned th...

there's a lot going on other than large language models.

lapis flax May 29, 2025, 4:37 PM

#

that’s definitely true, perhaps i should specify that in my little corner of the machine learning and ai world, LLM’s are like the ‘hot new thing’ that a lot of people are thinking about and playing around with

verbal oar May 29, 2025, 6:08 PM

#

attention pun not intended

rich moth May 29, 2025, 7:45 PM

#

wrong channe sorry :\

royal talon May 29, 2025, 9:12 PM

#

what are most important topic/s to in ML/Data science?

serene scaffold May 29, 2025, 10:00 PM

#

royal talon what are most important topic/s to in ML/Data science?

Statistics

rich moth May 30, 2025, 1:14 AM

#

You guys wanna see some cool images. I finally got the validation suite done for UCF..

#

93.1% on Cross domain classifcation

hexed maple May 30, 2025, 4:14 AM

#

guys where can i find pretrained LSTM RNN for time series data

opaque sphinx May 30, 2025, 8:36 AM

#

for self learners of data science, where do you guys go online to learn (free materials).
Been wanting to go into this hence choosing this as an elective course for my degree, but after finishing the course I realize I still got so much more to learn as when I go through other people's project I barely understood anything, so wanna go into it more on my own but courses on coursera and edx are not free

serene grail May 30, 2025, 8:59 AM

#

opaque sphinx for self learners of data science, where do you guys go online to learn (free ma...

Kaggle is pretty good
I also like the statquest YouTube channel for learning statistics

verbal oar May 30, 2025, 9:01 AM

#

I see statquest channel is statquest with (name of person which I forgot)?

serene grail May 30, 2025, 9:04 AM

#

verbal oar I see statquest channel is statquest with (name of person which I forgot)?

statquest with Josh Starmer, yeah
You can search "statquest playlist" on YouTube and his playlists will just come up, I think they're organized pretty well

jaunty helm May 30, 2025, 9:15 AM

#

serene grail It seems like it's all about deep neural networks nowadays, are there any tasks ...

it's because it's "easy" as you can just make it bigger™️ and use more data™️ and usually it'll be close to the top (while using a lot of compute)
but take time series for example, I'm pretty sure the general consensus on those is still "for each problem, the best model varies"

serene grail May 30, 2025, 9:16 AM

#

jaunty helm it's because it's "easy" as you can just make it bigger™️ and use more data™️ an...

That makes sense
I should look into time series then, sounds interesting

opaque sphinx May 30, 2025, 9:27 AM

#

serene grail Kaggle is pretty good I also like the statquest YouTube channel for learning sta...

Wait there’s roadmap/tutorials on kaggle? I’ve always thought kaggle was only for providing datasets

jaunty helm May 30, 2025, 9:27 AM

#

serene grail That makes sense I should look into time series then, sounds interesting

and also there's this that says trees still outperform nns on tabular data, though it's a lil old

serene grail May 30, 2025, 9:27 AM

#

opaque sphinx Wait there’s roadmap/tutorials on kaggle? I’ve always thought kaggle was only fo...

kaggle.com/learn is the link I think, they have mini-courses that teach you stuff

opaque sphinx May 30, 2025, 9:27 AM

#

Damn ok will take a look

zinc marsh May 30, 2025, 9:28 AM

#

https://www.youtube.com/watch?v=KI1aNYDFYEI&ab_channel=TickerSymbol%3AYOU

YouTube

Ticker Symbol: YOU

NVIDIA CEO Jensen Huang Leaves Everyone SPEECHLESS (Supercut)

Highlights from the latest #nvidia keynote at COMPUTEX 2025. Topics include @NVIDIA's MASSIVE new Blackwell Ultra GPUs, new products like DGX Spark, DGX Station, and RTX PRO Server, and how they'll power generative AI models like #chatgpt by #openai and #deepseek R1, reshaping artificial intelligence and computing as we know it. Based on all the...

▶ Play video

serene grail May 30, 2025, 9:28 AM

#

jaunty helm and also there's [this](https://arxiv.org/abs/2207.08815) that says trees still ...

Cool, I just started learning random forests so that's exciting!

opaque sphinx May 30, 2025, 9:28 AM

#

My uni didn’t teach stuffs like time series, MLOps, NLP and so on, saw these things on forums and I have no idea how they work

#

Or even they existed

opaque sphinx May 30, 2025, 9:29 AM

#

serene grail Cool, I just started learning random forests so that's exciting!

Thanks sir lizard

real venture May 30, 2025, 12:32 PM

#

is there a subreddit to promote and sell my python app as crypto
i made AIO pc usage tracker ap from ai code editor called cursor

#

sorry if its wrong channel...

cedar tusk May 30, 2025, 1:14 PM

#

opaque sphinx My uni didn’t teach stuffs like time series, MLOps, NLP and so on, saw these thi...

when did uni teach any important stuff related to the industry?

#

no one teaches databases properly, no one teaches any kind of technologies that are used heavily (docker for example)

#

schools are obsolete af

#

no one teaches taxes, no one teaches insurances...

#

then when people exit the schools they gotta find these all by themselves, whats the point of schooling then? Is shakespeare more important than how to live in the society?

#

bah

#

rant over

jaunty helm May 30, 2025, 2:01 PM

#

cedar tusk when did uni teach any important stuff related to the industry?

well people originally went to uni to learn the theoretical stuff for research
eventually every job started to require a degree though :[

verbal oar May 30, 2025, 2:11 PM

#

I had in the past course in machine learning but it was in R not in python, dont know why this was decision for uni course

cedar tusk May 30, 2025, 2:12 PM

#

verbal oar I had in the past course in machine learning but it was in R not in python, dont...

i assume it was theoretical

#

its fine if thats the case

verbal oar May 30, 2025, 2:12 PM

#

I suppose teacher has knowledge of R

cedar tusk May 30, 2025, 2:12 PM

#

the algos dont change

cedar tusk May 30, 2025, 2:12 PM

#

verbal oar I suppose teacher has knowledge of R

most probably

verbal oar May 30, 2025, 2:12 PM

#

theoretical and practical (labs)

cedar tusk May 30, 2025, 2:13 PM

#

i dk man i feel like suffocating in my current masters degree

#

all these theories, no usecases whatsoever majority of the time

verbal oar May 30, 2025, 2:13 PM

#

yes they teach without context

fickle shale May 30, 2025, 2:25 PM

#

I can buy one which one i need to buy first
Hands-On Large Language Models: Language Understanding and Generation
or
AI Engineering: Building Applications with Foundation Models ?

opaque condor May 30, 2025, 2:47 PM

#

Hello could I please have tips on training a small language model

verbal oar May 30, 2025, 3:05 PM

#

can I have custom theme instead of default light theme in jupyter notebook?

#

dont like these green with gray this is ugly for me

#

dont know vs code with jupyter notebook maybe

#

what color scheme do you have for ai/ml?

#

I'm talking about readability and beuaty also

#

ok first question is googleable rather

cedar tusk May 30, 2025, 5:41 PM

#

verbal oar can I have custom theme instead of default light theme in jupyter notebook?

use vs code

#

infinitely better

wet dome May 30, 2025, 11:14 PM

#

Can anyone recommend a machine learning book for a beginner? I'm thinking about either introduction to statical learning or machine learning with python

zinc patrol May 31, 2025, 3:51 AM

#

hey guys

zinc patrol May 31, 2025, 3:52 AM

#

wet dome Can anyone recommend a machine learning book for a beginner? I'm thinking about ...

I'm not sure books are the best for learning machine learning and neural nets, but if you really want a book, I would advice "Neural Networks from Scratch in python" (aka nnfs)

zinc patrol May 31, 2025, 3:53 AM

#

cedar tusk infinitely better

thats an understatement

#

jupyter is too overrated

zinc patrol May 31, 2025, 3:55 AM

#

wet dome Can anyone recommend a machine learning book for a beginner? I'm thinking about ...

it would also be good if you saw some tutorials on youtube and pay close attention

fair solar May 31, 2025, 7:35 AM

#

i remember wanting to read nnfs 3 years ago

#

except my uni covered ML as a course the next sem and NN from scratch in python was literally an assignment

#

feel the maths/algorithm is more important than "python/implementation"
as long as you're decently confident with one programming language, you'll be able to do it from scratch

odd meteor May 31, 2025, 10:34 AM

#

opaque condor Hello could I please have tips on training a small language model

Do you mean training yours from scratch or fine-tuning an already pretrained SLM?

odd meteor May 31, 2025, 10:46 AM

#

cedar tusk i dk man i feel like suffocating in my current masters degree

You just have brave the storm. Find ways to make it fun. I recently saw a video on YouTube where a guy was complaining of the same thing.

He enrolled for an AI masters program in Germany thinking he'll be doing Neural Nets and all that fun stuff but his first semester was just filled with mathematics, statistical proofs 😅

You must combined the applied side of ML (perhaps by learning and building cool stuff on your own) with the theoretical part majorly covered in technischule.

I think it's kinda cool in Germany cos they have Technical Universities (more suited for people interested in Research & PhD) and School of Applied science (for people interested in the engineering part of ML)

opaque condor May 31, 2025, 10:54 AM

#

odd meteor Do you mean training yours from scratch or fine-tuning an already pretrained SLM...

Training one from scratch

odd meteor May 31, 2025, 11:08 AM

#

opaque condor Training one from scratch

I'm going to presume you've sorted out the data collection part already or you have the means achieve that.

Read this https://lelapa.ai/inkubalm-a-small-language-model-for-low-resource-african-languages/

If you find it interesting and would like to dive more deep, then you should download their published research paper on arXiv. (That's literally all you need and perhaps with a little bit of googling here and there should you encounter any implementation they did that's not clear to you or you find it hard to reimplement.)

If you, however, consider this "a lot of work", then searching on YouTube might probably work best for you.

Lelapa

Nhlawulo Shikwambane

InkubaLM: A small language model for low-resource African languages...

As AI practitioners, we are committed to forging an inclusive future through the power of AI. While AI holds the promise of […]

fallow coyote May 31, 2025, 12:45 PM

#

odd meteor You just have brave the storm. Find ways to make it fun. I recently saw a video ...

Tbf when I first started self learning ML/AI I thought I could just get straight onto using the tools but in reality, the mathematics is much more important. Understanding and learning the maths behind ML is so useful; I can actually understand how the ML libraries work, even if its at a surface level understanding.

zinc patrol May 31, 2025, 1:33 PM

#

hai guys

lavish wraith May 31, 2025, 2:49 PM

#

One confusing i my mind is data science is also make dashboard ??

timber blaze May 31, 2025, 2:50 PM

#

what

lavish wraith May 31, 2025, 2:50 PM

#

Could i need to learn all dashboard like ploty ,powerbi and tableau ??

#

I am not understand where should i go i have learned pandas,numpy and matplotlib ,could i also need to learn Excel ,sql

#

Is data science job role also make dashboard or not ??

timber blaze May 31, 2025, 2:53 PM

#

what are you trying to say

opaque condor May 31, 2025, 2:59 PM

#

odd meteor I'm going to presume you've sorted out the data collection part already or you h...

I haven't yet still trying to figure out what books I should probably teach it first or what text should I teach it first

odd meteor May 31, 2025, 3:23 PM

#

opaque condor I haven't yet still trying to figure out what books I should probably teach it f...

Since you're just trying to understand how to build one yourself, I'd say, forget about collecting your own dataset for now and use any open sourced dataset instead.

More so, you might wanna start from fine-tuning a already trained SLM to see how it works before trying to start training from scratch or even with a custom dataset you created yourself.

opaque condor May 31, 2025, 3:27 PM

#

What would you recommend what libraries should use for a small model?

karmic totem May 31, 2025, 3:27 PM

#

Looking for a guy who knows programming, more specifically Python, in DM please, urgently

opaque condor May 31, 2025, 3:29 PM

#

I know to find somebody with somebody who knows how to work with python just asking pytorch is there already a file for small language models or at least medium size models

odd meteor May 31, 2025, 3:30 PM

#

opaque condor What would you recommend what libraries should use for a small model?

PyTorch, Lightning, HuggingFace, Unsloth

opaque condor May 31, 2025, 3:32 PM

#

I know does pie torch have anything for making a small language model just like how it has the data set that you can download into your network

odd meteor May 31, 2025, 4:11 PM

#

opaque condor I know does pie torch have anything for making a small language model just like ...

No it doesn't.

serene scaffold May 31, 2025, 5:20 PM

#

Anyone have a project they want to submit for the project showcase? #announcements message

opaque condor May 31, 2025, 6:05 PM

#

odd meteor No it doesn't.

So where should I get the data set?

odd meteor May 31, 2025, 6:46 PM

#

opaque condor So where should I get the data set?

You'll mostly find one on HuggingFace. The LelapaAI website I shared also added link the train and dev set they used in training their SLM. Check it out

odd meteor May 31, 2025, 6:57 PM

#

lavish wraith I am not understand where should i go i have learned pandas,numpy and matplotlib...

I’d prioritize learning SQL—but only to a practical level. Focus on mastering CRUD operations (Create, Read, Update, Delete) since that’s what you’ll use most often. You don’t need deep expertise like a Database Administrator; just know enough to query data, make updates, and handle basic tasks efficiently.

opaque condor May 31, 2025, 9:20 PM

#

odd meteor You'll mostly find one on HuggingFace. The LelapaAI website I shared also added ...

Which hugging face data set would you recommend?

smoky swan May 31, 2025, 9:21 PM

#

I have a project where you control your mouse with a gyroscope/accelerometer (like from your phone/smartwatch)
a kalman filter isn't enough, do yall suggest a neural network to filter noise?

smoky swan May 31, 2025, 10:54 PM

#

and if you have any, id be more than happy to hear suggestions on resources regarding NNs, im still having to learn. more specifically i thought about a recurrent NN (long term short term or whatever)

desert oar Jun 1, 2025, 12:12 AM

#

smoky swan I have a project where you control your mouse with a gyroscope/accelerometer (li...

Got an example? I assume you have something like xyz time series

smoky swan Jun 1, 2025, 1:09 AM

#

desert oar Got an example? I assume you have something like xyz time series

I can provide my script if you want me to, but the basic idea is as follows:
you collect accelerometer (and gyroscope) data, about every 5ms, so @200hz.
as of right now, it's being pulled from a local server hosted by phyphox on my phone. it does a great job at refining sensor data (kalman filter usw), but it's not enough
data looks like this:

acceleration = [(x1, y1, z1),
                 (x2, y2, z2),
                 (...),
                 (xn, yn, zn)]
rotation = [(x1,y1,z1),...] # you get the drill...

I measure the data live, so I can collect however much I want

tropic ridge Jun 1, 2025, 6:59 AM

#

Hey guys does anyone know where I can get the most recent annual mean cost of commodities like wheat,petrol and other energy sources per country?
I tried using our world in data and well there are a lot of incomplete data sets even for as far back as 2022

void geyser Jun 1, 2025, 8:25 AM

#

I created ai anyone have some idea to improve it and make it better

jaunty helm Jun 1, 2025, 9:03 AM

#

void geyser I created ai anyone have some idea to improve it and make it better

you have to be more specific than that for any help

near fractal Jun 1, 2025, 10:55 AM

#

I want to train an AI image model, where would I find a dataset for images?

north ether Jun 1, 2025, 10:55 AM

#

i want to use csv files and train datasets using tensorflow and stuff
Which version of python and tensorflow should i use?

north ether Jun 1, 2025, 10:55 AM

#

near fractal I want to train an AI image model, where would I find a dataset for images?

kaggle

#

or roboflow

limpid zenith Jun 1, 2025, 10:55 AM

#

north ether i want to use csv files and train datasets using tensorflow and stuff Which ver...

is this your first time using a DL framework?

north ether Jun 1, 2025, 10:55 AM

#

limpid zenith is this your first time using a DL framework?

yep

#

because i know it gets error

#

version not matches etc

limpid zenith Jun 1, 2025, 10:56 AM

#

I wouldn't suggest Tensorflow anymore, it's being phased out by Google. I would use Keras with Jax or Pytorch.
Or PyTorch manually or PyTorch with PyTorch Lightning these days

#

It's also easier for beginners and has friendly API and is the most use one in academia.

late blade Jun 1, 2025, 10:57 AM

#

near fractal I want to train an AI image model, where would I find a dataset for images?

what kind of image model ?

north ether Jun 1, 2025, 10:57 AM

#

limpid zenith It's also easier for beginners and has friendly API and is the most use one in a...

my teacher uses tensorflow so they said us to use that too

late blade Jun 1, 2025, 10:57 AM

#

object detection or image classification ?

near fractal Jun 1, 2025, 10:57 AM

#

late blade what kind of image model ?

generative AI, its for a school project so i just dont want deformed hands 😭

late blade Jun 1, 2025, 10:58 AM

#

near fractal generative AI, its for a school project so i just dont want deformed hands 😭

it is very very expensive

limpid zenith Jun 1, 2025, 10:58 AM

#

north ether my teacher uses tensorflow so they said us to use that too

ah okay, that's different then. It's still out of date.

Anyways what was the error you got?

near fractal Jun 1, 2025, 10:58 AM

#

late blade it is very very expensive

my teacher is permitting me to use the school's gpu array, i think its mostly 1080s but theres a few more modern models

#

unless you mean the data is expensive 😭

late blade Jun 1, 2025, 10:59 AM

#

near fractal my teacher is permitting me to use the school's gpu array, i think its mostly 10...

if you have atleast dozen h100 we can talk 😂

near fractal Jun 1, 2025, 10:59 AM

#

n e v e r m i n d

limpid zenith Jun 1, 2025, 10:59 AM

#

lmao

#

Deep Learning is EXPENSIVE

near fractal Jun 1, 2025, 10:59 AM

#

😭 what did he expect me to do he literally noticed my interest in AI and he said "oh you should try to make your own"

late blade Jun 1, 2025, 11:00 AM

#

you can make non generative image model cheaply and easily

near fractal Jun 1, 2025, 11:00 AM

#

text generative?

late blade Jun 1, 2025, 11:00 AM

#

nah

#

try yolo object detection

near fractal Jun 1, 2025, 11:00 AM

#

yea i have a classification model

late blade Jun 1, 2025, 11:01 AM

#

near fractal yea i have a classification model

improve on that it is easier than generative models

limpid zenith Jun 1, 2025, 11:01 AM

#

Actually you can fine tune your own model, don't train your own from scratch...that's expensive.

Look into Parameter Efficient LoRA

#

and 4 Bit Quantization

late blade Jun 1, 2025, 11:01 AM

#

limpid zenith Actually you can fine tune your own model, don't train your own from scratch...t...

that is another way

limpid zenith Jun 1, 2025, 11:01 AM

#

you can fine tune on a single 4090RTX with this and unsolth

late blade Jun 1, 2025, 11:01 AM

#

limpid zenith and 4 Bit Quantization

4 bit quant works with image models too ? i thought only text was possible

north ether Jun 1, 2025, 11:02 AM

#

limpid zenith ah okay, that's different then. It's still out of date. Anyways what was the e...

ImportError: DLL load failed while importing _pywrap_tensorflow_internal: A dynamic link library (DLL) initialization routine failed.


Failed to load the native TensorFlow runtime.
See https://www.tensorflow.org/install/errors for some common causes and solutions.
If you need help, create an issue at https://github.com/tensorflow/tensorflow/issues and include the entire stack trace above this error message.```

limpid zenith Jun 1, 2025, 11:02 AM

#

it's just quantization, float precision afaik...never tried it with image models...but why wouldn't it work with image models

late blade Jun 1, 2025, 11:03 AM

#

limpid zenith it's just quantization, float precision afaik...never tried it with image models...

just never thought about that

near fractal Jun 1, 2025, 11:03 AM

#

late blade improve on that it is easier than generative models

what about trend models? im decent at math for my age (im 16) and ive recently landed an internship at a quant firm, i want to see if i can make one that does low frequency trading off momentum instead of high freq trading like most quant firms. i think i can do something to smooth out the noise from the stock data by averaging the price per time period and using smth like fourier sequence to fit a sine approximation to the data

near fractal Jun 1, 2025, 11:03 AM

#

limpid zenith Actually you can fine tune your own model, don't train your own from scratch...t...

fine tuning sure yea ill look towards that

late blade Jun 1, 2025, 11:03 AM

#

near fractal what about trend models? im decent at math for my age (im 16) and ive recently l...

i am too dumb for impleamenting Ai models bruh

limpid zenith Jun 1, 2025, 11:04 AM

#

north ether ```from tensorflow.python._pywrap_tensorflow_internal import * ImportError: DLL ...

how did you install this package? was there any error? if not, try again with this

pip install tensorflow --force-reinstall

near fractal Jun 1, 2025, 11:04 AM

#

does anyone here have experience in stock market? does momentum trading provide reliable profit? even just 1% will be fine, i care more about a good sharpe ratio versus a good profit rater

north ether Jun 1, 2025, 11:05 AM

#

limpid zenith how did you install this package? was there any error? if not, try again with th...

using pip

late blade Jun 1, 2025, 11:05 AM

#

near fractal what about trend models? im decent at math for my age (im 16) and ive recently l...

i think adding llm for market news context will make better trades ?

north ether Jun 1, 2025, 11:05 AM

#

limpid zenith how did you install this package? was there any error? if not, try again with th...

it works fine with image dataset but it does not works with csv

limpid zenith Jun 1, 2025, 11:05 AM

#

north ether it works fine with image dataset but it does not works with csv

show full traceback

near fractal Jun 1, 2025, 11:06 AM

#

late blade i think adding llm for market news context will make better trades ?

news context usually is already shown by stock movements. of course, human analysts are used by quant funds since most signals are exhausted the second they show up, but then human analysts also cost a lot. im assuming that LLMs wont be able to provide as strong of a market context as human analysts, but itll be a good starting poiont. however news doesnt cover small firms as much, which is the market id want to target because of the signals there lasting for longer than 1/2 ms

north ether Jun 1, 2025, 11:07 AM

#

limpid zenith show full traceback

  File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, i    from tensorflow.python._pywrap_tensorflow_internal import *
ImportError: DLL load failed while importing _pywrap_tensorflow_internal: A dynamic link library (DLL) initialization routine failed.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "c:\Users\laptops galaxy\Desktop\BSAI-5A\ANN\logon\gan_project.py", line 4, in <module>
    import tensorflow as tf
  File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\__init__.py", line 40, in <module>      
    from tensorflow.python import pywrap_tensorflow as _pywrap_tensorflow  # pylint: disable=unused-import
    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 88, in <module>
    raise ImportError(
ImportError: Traceback (most recent call last):
  File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, in <module>
    from tensorflow.python._pywrap_tensorflow_internal import *
ImportError: DLL load failed while importing _pywrap_tensorflow_internal: A dynamic link library (DLL) initialization routine failed.


Failed to load the native TensorFlow runtime.
See https://www.tensorflow.org/install/errors for some common causes and solutions.
If you need help, create an issue at https://github.com/tensorflow/tensorflow/issues and include the entire stack trace above this error message.```

limpid zenith Jun 1, 2025, 11:08 AM

#

what's your code?

north ether Jun 1, 2025, 11:09 AM

#

limpid zenith what's your code?

Gans Model

📎 message.txt

arctic wedgeBOT Jun 1, 2025, 11:09 AM

#

north ether Gans Model

~~Please react with ✅ to upload your file(s) to our paste bin, which is more accessible for some users.~~

north ether Jun 1, 2025, 11:18 AM

#

north ether Gans Model

@limpid zenith what should i do...

limpid zenith Jun 1, 2025, 11:20 AM

#

north ether <@778585347562930208> what should i do...

https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-170#latest-microsoft-visual-c-redistributable-version

Install Visual C Redistributable -> https://aka.ms/vs/17/release/vc_redist.x64.exe

Latest supported Visual C++ Redistributable downloads

This article lists the download links for the latest versions of Visual C++ Redistributable packages.

north ether Jun 1, 2025, 11:21 AM

#

limpid zenith https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=ms...

okayy

jaunty helm Jun 1, 2025, 11:39 AM

#

near fractal 😭 what did he expect me to do he literally noticed my interest in AI and he sai...

"make your own" is probably not possible, not only do you need lotta compute you also need lotta data
finetuning / making a lora should be more doable

#

as for deformed hands... I've heard that flux mostly fixed that issue? tbf I've never ran it cause I don't have the hardware

north ether Jun 1, 2025, 11:50 AM

#

limpid zenith https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=ms...

still same issue

limpid zenith Jun 1, 2025, 11:54 AM

#

north ether still same issue

did you reinstall tensorflow?

north ether Jun 1, 2025, 4:40 PM

#

limpid zenith did you reinstall tensorflow?

Yep

odd meteor Jun 1, 2025, 7:00 PM

#

tropic ridge Hey guys does anyone know where I can get the most recent annual mean cost of co...

Maybe check WEF, or better still, try using LLM

tropic ridge Jun 1, 2025, 7:48 PM

#

odd meteor Maybe check WEF, or better still, try using LLM

Thanks man

upper niche Jun 2, 2025, 3:47 AM

#

hi

#

anyone here

serene scaffold Jun 2, 2025, 3:57 AM

#

upper niche anyone here

No.

upper niche Jun 2, 2025, 4:23 AM

#

I was about to ask in svr make_pipeline, and the use of standard scaler, does it only apply standard scaler() on X , or does it also apply to y

#

and also, you sound more... cheerful

past meteor Jun 2, 2025, 6:20 AM

#

upper niche I was about to ask in svr make_pipeline, and the use of standard scaler, does it...

Just on X

grand minnow Jun 2, 2025, 6:24 AM

#

To anyone who's hosting their own LLM, what is the cheapest GPU VPS service available?

#

I want to try and build a live proof of concept

dusty forge Jun 2, 2025, 7:10 AM

#

Introducing MediBeng-Whisper-Tiny! 🚀
We’ve fine-tuned OpenAI Whisper-Tiny on Hugging Face to 𝗧𝗥𝗔𝗡𝗦𝗟𝗔𝗧𝗘 code-switched Bengali-English speech into English. This helps improve doctor-patient transcription and makes clinical records more accurate. 🏥🎙️
Bonus: It’s an easy way to fine-tune Whisper for translation tasks!
Check out the repo for more details and try it out! 
🔗 MediBeng-Whisper-Tiny GitHub Repo : https://github.com/pr0mila/MediBeng-Whisper-Tiny
🔗 Hugging Face Model:hugging_fire: : https://huggingface.co/pr0mila-gh0sh/MediBeng-Whisper-Tiny
If you like it, don’t forget to give a ⭐ and 👍!

#AI #SpeechTranslation #HuggingFace #WhisperTiny #HealthTech #AudioToText

GitHub

GitHub - pr0mila/MediBeng-Whisper-Tiny: MediBeng Whisper Tiny impr...

MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speech into English, making it easier for analysis, record-keeping...

pr0mila-gh0sh/MediBeng-Whisper-Tiny · Hugging Face

runic parcel Jun 2, 2025, 10:34 AM

#

Can anyone please help me with fine tuning my tesseract ocr
https://discord.com/channels/267624335836053506/1379045149916528691

wintry pagoda Jun 2, 2025, 10:35 AM

#

Can anyone tell me about "JSON handling for data exchange"??

grand minnow Jun 2, 2025, 12:32 PM

#

wintry pagoda Can anyone tell me about "JSON handling for data exchange"??

That depends™️

What do you wanna know about it?

lethal pendant Jun 2, 2025, 12:59 PM

#

is it possible that i take the most advanced ai model from hugging face and tune it to generate youtube thumbnails

like https://pikzels.com

Pikzels

Pikzels - Viral Thumbnails & Titles in Seconds

Ideate, create & enhance viral YouTube thumbnails to skyrocket your views! Join the new era of AI-powered content creation.

serene scaffold Jun 2, 2025, 1:13 PM

#

lethal pendant is it possible that i take the most advanced ai model from hugging face and tune...

If you pick "the most advanced" model, you probably won't have enough GPU memory to fine tune it. But what you said is possible in principle

warm verge Jun 2, 2025, 1:30 PM

#

near fractal news context usually is already shown by stock movements. of course, human analy...

Someone highlighted your message to me which is why I rejoined the server. This message is very much off the mark, if you're interviewing then don't mention any of this

toxic mortar Jun 2, 2025, 1:36 PM

#

Hello why do I get significantly different results between my Google Colab (GPU) and local (CPU) runs?

both of them are using ```py
pipeline = Pipeline([
('tfidf', TfidfVectorizer(sublinear_tf=True, strip_accents='unicode', analyzer='char',
ngram_range=(2, 6), max_features=40_000)),
('clf', XGBClassifier(random_state=42, enable_categorical=True, device="gpu"))
])
le = LabelEncoder()
y_encoded = le.fit_transform(dataset['class'])

pipeline, y_test, y_pred = train_e2e(dataset['text'], y_encoded, pipeline)

#

CPU colab results are the same as the CPU local ones

wooden sail Jun 2, 2025, 2:30 PM

#

toxic mortar CPU colab results are the same as the CPU local ones

do you have a local gpu to also compare with? what immediately comes to mind is that gpus by default use 32 bit floats while cpus tend to use 64. most of the time this isn't an issue, but sometimes it can be. you can also set the bit depth to 32 bits for your cpu runs and see if they now match the gpu result

tropic ridge Jun 2, 2025, 3:05 PM

#

odd meteor Maybe check WEF, or better still, try using LLM

Thanks man

fallow coyote Jun 2, 2025, 3:13 PM

#

whats a good beginner book for linear algebra? Ive tried reading LinearAlgebraDoneRight but reading through it, i feel like im missing out on a lot of the context as my only exposure to linear algebra is vectors and matrices. Ill probably go back to LADR once I now about linear algebra but need the base knowledge first

tender egret Jun 2, 2025, 3:25 PM

#

can someone help me when i m importing another python in the same folder its not getting imported why

serene scaffold Jun 2, 2025, 3:26 PM

#

tender egret can someone help me when i m importing another python in the same folder its not...

what's the error message and the folder structure?

tender egret Jun 2, 2025, 3:30 PM

#

serene scaffold what's the error message and the folder structure?

folder struc
ai/dataset.py
model.py
prediction.py
train.py

error is cominhg moddule not found something like this

wooden sail Jun 2, 2025, 3:58 PM

#

fallow coyote whats a good beginner book for linear algebra? Ive tried reading LinearAlgebraD...

you can try gilbert strang's linear algebra for something based on matrices and vectors and with applied examples

wintry pagoda Jun 2, 2025, 4:05 PM

#

grand minnow That depends™️ What do you wanna know about it?

When do I use it?

verbal oar Jun 2, 2025, 5:59 PM

#

why I see mode 0 50000 instead of mode 50000 in example on simplilearn demo? ignore 0?

#

or maybe this 0 has some meaning

#

hmm I assume its index 0

#

mode() of pandas dataframe

#

see docs?

#

I confusingly thought its bimodal with 2 values but its probably not

#

untold bloom Jun 2, 2025, 6:34 PM

#

yes it's index 0

Always returns Series even if only one value is returned.

fallow coyote Jun 2, 2025, 6:36 PM

#

wooden sail you can try gilbert strang's linear algebra for something based on matrices and ...

Ill have a look. Ive seen that book recommended quite a bit. If I were to use that book as my primary source, what other books should I read for furthering my understanding (particularly in applying linear algebra to ML and other areas of programming/science that require it)?

wooden sail Jun 2, 2025, 6:36 PM

#

fallow coyote Ill have a look. Ive seen that book recommended quite a bit. If I were to use th...

probably a book on ml

fallow coyote Jun 2, 2025, 6:44 PM

#

wooden sail probably a book on ml

Thats actually reminded me of a book which i downloaded called mathematics for ml or something like that (youll know the one as its commonly used)

urban canopy Jun 2, 2025, 7:00 PM

#

I hear AI all the time. But I never hear about the people going into the weights and trying to figure out how it all works. Which is strange considering how common AI is.

iron basalt Jun 2, 2025, 7:01 PM

#

fallow coyote Ill have a look. Ive seen that book recommended quite a bit. If I were to use th...

I'm not sure about books. But IMO one of the strongest increases in understanding on linear algebra's applications is when you understand its connection to graph theory, and representing problems with graphs.

#

This is because both linear algebra and graph theory can be applied in so many cases, and they work nicely together.

iron basalt Jun 2, 2025, 7:02 PM

#

urban canopy I hear AI *all the time*. But I never hear about the people going into the weigh...

What do you mean by "going into the weights?"

sturdy shadow Jun 2, 2025, 7:03 PM

#

urban canopy I hear AI *all the time*. But I never hear about the people going into the weigh...

people dont need to understand all levels of abstraction to meaningfully extract value from something

#

and yeah weights in what sense? the sentence is analogous to "people going into the transistors of electronics": AI covers a huge field, and not all models will even have weights

odd meteor Jun 2, 2025, 7:06 PM

#

tender egret folder struc ai/dataset.py model.py prediction.py train.py error i...

Fixed it yet? If no, I'd like to see how you tried making the import

tender egret Jun 2, 2025, 7:07 PM

#

odd meteor Fixed it yet? If no, I'd like to see how you tried making the import

nah

#

not fixed

tender egret Jun 2, 2025, 7:08 PM

#

odd meteor Fixed it yet? If no, I'd like to see how you tried making the import

its like from dataset import datset . dataset is a tuple name inside dataset.py file

odd meteor Jun 2, 2025, 7:12 PM

#

tender egret its like from dataset import datset . dataset is a tuple name inside dataset.py ...

Cool. Are you using VSCode or Jupyter? Where are you making the import ?

Can you show the structure of your directory?

tender egret Jun 2, 2025, 7:13 PM

#

odd meteor Cool. Are you using VSCode or Jupyter? Where are you making the import ? Can y...

vs code

#

ai/dataset.py
model.py
prediction.py
train.py
structure

urban canopy Jun 2, 2025, 7:13 PM

#

sturdy shadow people dont need to understand all levels of abstraction to meaningfully extract...

I don't expect everyone to play with the weights, since some people just want to use the tool. Different strokes for different folks as they say.

I find the extreme lack of people interested in playing with the weights strange.

Most AI, such as LLMs and generative art has weights. I am also curious about AI that does not have weights of course, as it works in a different way.

odd meteor Jun 2, 2025, 7:14 PM

#

tender egret vs code

Good. So on which script are you making this import?

tender egret Jun 2, 2025, 7:14 PM

#

i fkin used all sort of gpts , ai , llms everything its not working

tender egret Jun 2, 2025, 7:14 PM

#

odd meteor Good. So on which script are you making this import?

train.py

sturdy shadow Jun 2, 2025, 7:15 PM

#

urban canopy I don't expect *everyone* to play with the weights, since some people just want ...

"Most AI has weights" what? maybe the flavour de jour nowadays is genai, but theres many industries and hobbyists who are using more old fashioned ML stuff every day lol

#

adjusting individual weights is basically just backpropagating but less accurately. would say its better to mess around with actual network architecture than spending any time on weights

tender egret Jun 2, 2025, 7:16 PM

#

can someone teach me how to make a neural network like in detail but in an easy way

odd meteor Jun 2, 2025, 7:17 PM

#

tender egret train.py

I'd like to see a full traceback if possible. Meanwhile, inside your ai directory, add an empty __init__. py script there as well

tender egret Jun 2, 2025, 7:17 PM

#

odd meteor I'd like to see a full traceback if possible. Meanwhile, inside your ai director...

i added it didnt worked

urban canopy Jun 2, 2025, 7:19 PM

#

sturdy shadow "Most AI has weights" what? maybe the flavour de jour nowadays is genai, but the...

I am curious how they train the old fashioned AI? Is it more of a hand-made decision tree?

sturdy shadow Jun 2, 2025, 7:21 PM

#

sorry what are you talking about

serene grail Jun 2, 2025, 7:22 PM

#

urban canopy I hear AI *all the time*. But I never hear about the people going into the weigh...

Are you perhaps talking about AI interpretability? That is a thing that some people are working on, sure

urban canopy Jun 2, 2025, 7:22 PM

#

serene grail Are you perhaps talking about AI interpretability? That is a thing that some peo...

Yes I would like to reach out to these people more.

odd meteor Jun 2, 2025, 7:23 PM

#

tender egret i added it didnt worked

Hmmm... Just double check once again from VSCode terminal, that you're indeed inside the ai directory. If not, you have to cd into that directory for it to work.

small path Jun 2, 2025, 7:25 PM

#

Ok so for the past 6 months i have been trying to work on a basic chatbot, but it's failing miserably

What i have tried:

Different types of model:

Chat generation
Rag
Seq2Seq

But none of that works, I'm taking my data from HF about 100k lines of conversational data

My latest model's params were 18M but yet it is still spewing gibberish

Any suggestions?

#

Now we wait ☕

#

Oh also I'm only using pytorch

tender egret Jun 2, 2025, 7:48 PM

#

odd meteor Hmmm... Just double check once again from VSCode terminal, that you're indeed in...

DONE EVERYTHING ITS NOT FKIN WORKING

sturdy shadow Jun 2, 2025, 7:49 PM

#

tender egret DONE EVERYTHING ITS NOT FKIN WORKING

would be very helpful if you provided screenshots, tracebacks, more detailed logs

agile cobalt Jun 2, 2025, 7:50 PM

#

tender egret DONE EVERYTHING ITS NOT FKIN WORKING

if you don't want to debug nor cooperate with people trying to help, just install a package manager like uv and create a new project following its documentation, then copy/paste your code into the template it generates

odd meteor Jun 2, 2025, 7:53 PM

#

tender egret DONE EVERYTHING ITS NOT FKIN WORKING

I get that it can be frustrating when our code isn't going as planned, however, behind every error message, there's always a clue therein.

Can you send the full traceback?

tender egret Jun 2, 2025, 7:59 PM

#

odd meteor I get that it can be frustrating when our code isn't going as planned, however, ...

ya like its that man tbh it took like 5 hours for that still it wasnt working

tender egret Jun 2, 2025, 8:00 PM

#

agile cobalt if you don't want to debug nor cooperate with people trying to help, just instal...

if u think why did i use caps lock . it was toggled for another work but i used it for it

elder flume Jun 2, 2025, 8:00 PM

#

Guys

tender egret Jun 2, 2025, 8:01 PM

#

elder flume Guys

yes sir

elder flume Jun 2, 2025, 8:01 PM

#

Can someone give me a path to be a data science? Bc I want to to this later

tender egret Jun 2, 2025, 8:01 PM

#

elder flume Can someone give me a path to be a data science? Bc I want to to this later

i aint a data science eng sry

#

vivek babu wsp

elder flume Jun 2, 2025, 8:02 PM

#

tender egret i aint a data science eng sry

limpid zenith Jun 2, 2025, 8:03 PM

#

elder flume Can someone give me a path to be a data science? Bc I want to to this later

math, lots of math is a good path to it

study calculus and linear algebra, stats & probability theory
then learn python and or r and then either take a course on data science or watch videos or read books on it
learn to work with kaggle environments and such

elder flume Jun 2, 2025, 8:04 PM

#

Ok that's really cool, I like math

#

Thanks

tender egret Jun 2, 2025, 8:06 PM

#

limpid zenith math, lots of math is a good path to it study calculus and linear algebra, stat...

probability is easy tho

#

buwet calculas and linear algebra are hard asf

limpid zenith Jun 2, 2025, 8:07 PM

#

wdym? probability is basically just combinatorics in disguise with kolmogorov axioms applied to scenarios

#

and combinatorics is difficult

tender egret Jun 2, 2025, 8:08 PM

#

bro tryin be shashi tharoor

elder flume Jun 2, 2025, 8:08 PM

#

limpid zenith math, lots of math is a good path to it study calculus and linear algebra, stat...

Ah and I also have a question. I'll learn all that stuff in 2,3 Years. Should I learn Python, R etc. First??

#

I mean math

#

I'm 7.grader🥲

limpid zenith Jun 2, 2025, 8:09 PM

#

elder flume Ah and I also have a question. I'll learn all that stuff in 2,3 Years. Should I ...

you can both at the same time, study math and python / r

elder flume Jun 2, 2025, 8:12 PM

#

Ight, do u think I can learn many stuff of python and R in 2-3 years(also math) and I can get a job?

#

I can already understand the basics of python and it's very interesting

limpid zenith Jun 2, 2025, 8:14 PM

#

i can't gauruntee anything since the job market is really bad rn, but it;s definitely a good asset to have learned

elder flume Jun 2, 2025, 8:19 PM

#

Ok👍

runic parcel Jun 2, 2025, 8:24 PM

#

Hey guys can anyone help me for finetuning my tesseract for getting better accuracy

#

really need the help

spring field Jun 2, 2025, 9:46 PM

#

runic parcel Hey guys can anyone help me for finetuning my tesseract for getting better accur...

what exactly do you need help with?
which part of the fine-tuning process are you struggling with?
otherwise this seems like a good start: https://github.com/Matleo/Tesseract_fine_tuning_training

runic parcel Jun 2, 2025, 9:47 PM

#

spring field what exactly do you need help with? which part of the fine-tuning process are yo...

Thanks alot for text

#

i will explain to you

#

So like i was trying to finetune, and i got really bad result, did using this doc. https://arcruz0.github.io/posts/finetuning-tess/
so like as said created files with image and its gt.txt value about 200. But when i tried to test it it gave me really bad answers, this is fully wrong and it detect nothing.

#

#

Here are the graphs of loss curve, which shows the result is good

runic parcel Jun 2, 2025, 9:48 PM

#

spring field what exactly do you need help with? which part of the fine-tuning process are yo...

i tired it but it really doesnt seem to work, i really need some help

spring field Jun 2, 2025, 9:55 PM

#

what text are you parsing? can you share your dataset? I'm afraid 200 images might not be enough

#

does your dataset contain data that is similar to that which you are trying to parse or is it just some random generic dataset? I can't tell how some of the images I saw in your dataset in your help post relate to the parsed text in your image above

glacial root Jun 2, 2025, 10:47 PM

#

tender egret DONE EVERYTHING ITS NOT FKIN WORKING

yo chill

#

you gain nothing from crashing out on discord

hoary wigeon Jun 3, 2025, 6:53 AM

#

Has anyone ever used spink - moj python library

an entity-resolution / entity-record deduplication tools. For. e.g. you have customer table and there are duplicate entry of customers i.e. distinct customer id and may have slight error in name and other details being same or slight change in contact...etc

#

I'm facing perfromance reated issue

austere swift Jun 3, 2025, 5:55 PM

#

Does anyone know a source where I can download the "Blackbird" dataset? (https://github.com/mit-aera/Blackbird-Dataset)

The domain mentioned in that repo is no longer up and I can't find any sources for the data anymore. There is a torrent for it on academictorrents but nobody is seeding it right now so it's just stalled on my client.

If that specific dataset can't be found do you guys know of a similar dataset to it? It's a flight perception dataset for drones that includes camera streams, rpm and imu measurements, and ground truths for pose estimation. I'd gladly look into anything that resembles this at a fairly large scale, the datasets i've found so far are very tiny (a few gb at most)

GitHub

GitHub - mit-aera/Blackbird-Dataset

Contribute to mit-aera/Blackbird-Dataset development by creating an account on GitHub.

#

atp I might just start contacting the researchers to see if they have it

spring field Jun 3, 2025, 9:45 PM

#

that's kind of the basics of economics (which is to say, I'm not sure how useful of a hypothesis it is, or if it's even much of a hypothesis in the first place)
https://www.investopedia.com/terms/e/economic-cycle.asp

#

I do otherwise like the idea though, it seems cool and might turn out to be more accurate than a monkey throwing darts after all 😄

exotic star Jun 3, 2025, 10:32 PM

#

Hi guys, i'll be spending 2 months in the USA this summer and i wanna make most of my time by getting into ML, i started learning pandas and numpy a bit so far. How would i learn the most while still learning everything throughly to actually learn it not just scratch on the surface?

weary crown Jun 4, 2025, 1:03 AM

#

is the reason why we use MSE as opposed to MAE for gradient descetnt so we only have 1 global optimum?

serene scaffold Jun 4, 2025, 1:10 AM

#

weary crown is the reason why we use MSE as opposed to MAE for gradient descetnt so we only ...

MSE has a smoother curve that gets even more gradual the closer you are to zero

#

I guess it's not smoother per se but it's definitely more gradual

weary crown Jun 4, 2025, 1:29 AM

#

ah okay that makse sense

#

oh another thing

#

with SGD right you are never reaching the global minimum as opposed to batch - thats fine obv for most purposes but my question is

#

does that
A. Every prediction is slightly slightly off
B. Some predictions are still spot on but most are off

#

pretty sure its A right but could the parameters accidentally line up and make some perfect predictions like in case B or nah

jaunty helm Jun 4, 2025, 3:12 AM

#

weary crown is the reason why we use MSE as opposed to MAE for gradient descetnt so we only ...

even prior to machine learning taking off, there are many statistical reasons for why MSE would be used (that I'm too dumb to understand to explain to people)

iron basalt Jun 4, 2025, 10:48 AM

#

jaunty helm even prior to machine learning taking off, there are many statistical reasons fo...

Assumption of errors being the result of many small independent effects, which when combined results in a Gaussian distribution by central limit theorem. Taking that Gaussian assumption it naturally follows from the objective (argmax_theta L_theta[data] (MLE)).

#

(The square comes from the square in the Gaussian)

#

There is also a geometric understanding to this. Basically you are assuming that the probability density "cloud" is spherically symmetric, and from that the best estimate is the one closest in distance to the observed data, where this distance is the Euclidean distance (L2 (quadratic norm)).

wooden sail Jun 4, 2025, 11:07 AM

#

weary crown is the reason why we use MSE as opposed to MAE for gradient descetnt so we only ...

following up on what squiggle just wrote, using MAE is like assuming that the error in each estimate (each entry of the vector you're looking for) is laplace distributed independently from the other estimates (instead of gaussian distributed)

#

the geometrical interpretation follows in a similar way, except we now think of more general "norm balls" instead of circular symmetry

#

these things would fall under "maximum likelihood estimation," in case you wanna take a deeper look yourself

remote gulch Jun 4, 2025, 12:04 PM

#

I guys i am new to data science and i want to build an nlp project for my portfolio which is not too generic but also give me somethings to learn.

#

Can i please get resources related to that?

#

That's nice

#

its very creative ngl

#

how can i buid it?

#

what tools to use?

#

okok

#

can i get your receipts?

#

I have never done too much shopping😭

#

ok makes sense

#

well i will start shopping then😂

#

Yeah will do that

#

Dm maybe?

serene scaffold Jun 4, 2025, 12:28 PM

#

No--people need to buy it themselves

primal tulip Jun 4, 2025, 12:29 PM

#

serene scaffold No--people need to buy it themselves

Understood. I'll delete my comment. Thanks for the quick reply.

serene scaffold Jun 4, 2025, 12:29 PM

#

it's disrespectful to the authors to send people free copies of their work

primal tulip Jun 4, 2025, 12:31 PM

#

serene scaffold it's disrespectful to the authors to send people free copies of their work

As in, what's AI doing with its ilegally sourced training data to output images?

serene scaffold Jun 4, 2025, 12:32 PM

#

primal tulip As in, what's AI doing with its ilegally sourced training data to output images?

what point are you trying to make? I'm not in charge of that.

primal tulip Jun 4, 2025, 12:34 PM

#

I'm messing a bit with you. No ill intention, just pulling your feet for your profile pic. I worked within a legal tech, that aided fight copyright infringement, sharing something I legally bought is not wrong. It's pretty much a grey area, that's why piracy is really hard to punish, especially in those cases no money transactions is involved

burnt hearth Jun 4, 2025, 12:36 PM

#

Hello I am new in python i am creating a model audio to text, for used openai/wisper large-3 but my audio duration is 10 min or + how that process it been take very long time
and this process run on my local system any buddy tell me if can make faste

primal tulip Jun 4, 2025, 12:37 PM

#

Sorry if this is getting sidetracked. @serene scaffold already helped me answer my question. I'll keep my unrelated posts to a minimum.

@final kiln both. Either helping companies, big or small or individuals that suffered from it.

serene scaffold Jun 4, 2025, 12:37 PM

#

burnt hearth Hello I am new in python i am creating a model audio to text, for used openai/wi...

I'm not sure if there's a way to get good performance with Whisper without a GPU.

runic sundial Jun 4, 2025, 12:37 PM

#

Hey everyone. Just got in to a company that has an ongoing AI Project. Would love to have some advice on what I should focus on learning, but based on the direction of the project it seems we'll be automating a lot of stuff and probably develop some more projects for better quality assurance, do you think learning some Machine Learning or Data Science would help me keep up with the team? Also, are there any courses that you would recommend I take just so I can dip my fingers and pretty much understand the bigger picture of this project that I'll become a part of.

burnt hearth Jun 4, 2025, 12:39 PM

#

serene scaffold I'm not sure if there's a way to get good performance with Whisper without a GPU...

hmmm you are right, is there any platform to use GPU

primal tulip Jun 4, 2025, 12:41 PM

#

burnt hearth hmmm you are right, is there any platform to use GPU

I have tried AWS and GCP. GCP is a bit cheaper in my experience and usecases.
You have like 200~$ for the first two months for free when creating a new account for the first time.

woven prairie Jun 4, 2025, 12:51 PM

#

See I have purchased open ai key

#

I am using there gpt4o model can anyone tell what is the context window size of this model.

primal tulip Jun 4, 2025, 12:52 PM

#

woven prairie I am using there gpt4o model can anyone tell what is the context window size of ...

You have a range you can explicitly define. Give me one second and I'll look it out

woven prairie Jun 4, 2025, 12:52 PM

#

Yes

primal tulip Jun 4, 2025, 12:53 PM

#

From 1 token, to 16384. Default is 2048 if undefined

woven prairie Jun 4, 2025, 12:54 PM

#

2048 tokens by default

jaunty helm Jun 4, 2025, 12:54 PM

#

woven prairie I am using there gpt4o model can anyone tell what is the context window size of ...

128k would be my first guess

woven prairie Jun 4, 2025, 12:54 PM

#

jaunty helm 128k would be my first guess

Yes some in google chanting it

#

What's the truth range

jaunty helm Jun 4, 2025, 12:55 PM

#

woven prairie What's the truth range

128k context
16k is how much it can output at once

woven prairie Jun 4, 2025, 12:56 PM

#

1 lakh 28 thousand is input and output is 16 thousand

jaunty helm Jun 4, 2025, 12:56 PM

#

note that the quality of responses likely degrades before hitting the 128k

woven prairie Jun 4, 2025, 12:56 PM

#

Context window is the total of input and output

primal tulip Jun 4, 2025, 12:57 PM

#

If you're using the API, it should be the same as https://platform.openai.com/playground, shouldn't it?

jaunty helm Jun 4, 2025, 12:58 PM

#

primal tulip If you're using the API, it _should_ be the same as https://platform.openai.com/...

maybe it doesn't allow 128k on playground to prevent abuse?
idrk

primal tulip Jun 4, 2025, 1:02 PM

#

jaunty helm maybe it doesn't allow 128k on playground to prevent abuse? idrk

No, you were right.
https://platform.openai.com/docs/models/gpt-4o

full context window is 128000, and max output is 16Kish tokens

woven prairie Jun 4, 2025, 1:03 PM

#

What parameter in payload sets the context window

burnt hearth Jun 4, 2025, 1:05 PM

#

primal tulip I have tried AWS and GCP. GCP is a bit cheaper in my experience and usecases. Y...

if i use my local machine for in develop case what is minimum rquirement of local system can atleast 10 minute audio,
current system is i5 7th gen, 2GB nvida

#

Because I am a huge model It not only audio to text
it compete transcript model which using as open source model

primal tulip Jun 4, 2025, 1:06 PM

#

woven prairie What parameter in payload sets the context window

I believe this is used in the hidden chain of thoughts layer, we don't know what it's thinking process is due to the way they built it. I'm unaware if they changed it, but the last year I wasn't able to mess with that.

agile cobalt Jun 4, 2025, 1:07 PM

#

woven prairie What parameter in payload sets the context window

it is literally your input (prompt)

the only part of it you can control is the maximum amount of output tokens

primal tulip Jun 4, 2025, 1:07 PM

#

burnt hearth if i use my local machine for in develop case what is minimum rquirement of loca...

AI is really demanding when you want to launch it locally. I don't think you can do much with it, but I'll search hugging face to see if there's a minimum model that you can withstand. What's your NVIDIA?

woven prairie Jun 4, 2025, 1:08 PM

#

agile cobalt it is literally your input (prompt) the only part of it you can control is the ...

Yes by max token I can set the output length

#

Thanks for your response

#

One more thing , how can I improve prompt

agile cobalt Jun 4, 2025, 1:09 PM

#

write better

burnt hearth Jun 4, 2025, 1:09 PM

#

primal tulip AI is really demanding when you want to launch it locally. I don't think you can...

yaa I am trying to use hugging face, NVIDIA graphic card

primal tulip Jun 4, 2025, 1:09 PM

#

woven prairie One more thing , how can I improve prompt

https://www.promptingguide.ai/

Prompt Engineering Guide – Nextra

A Comprehensive Overview of Prompt Engineering

jaunty helm Jun 4, 2025, 1:10 PM

#

woven prairie One more thing , how can I improve prompt

clearly state your requirements
like, in what format is the data passed in
what do you expect it to output

woven prairie Jun 4, 2025, 1:13 PM

#

Ok , thanks I am going to readit .

primal tulip Jun 4, 2025, 1:19 PM

#

burnt hearth yaa I am trying to use hugging face, NVIDIA graphic card

Do you want to do Text To Speech (TTS)?
Also, you also NEED to know about your graphics card model, in some cases you have to download a specific driver to be able to use certain AIs.

I'm assuming you use linux. I found this that you can either run it on a raspberry pi 4 or at linux with python.
https://github.com/rhasspy/piper

This is amazingly fast and lightweight, so I'd check it out if I were you. Also, if you're already at Huggingface looking at a certain model, read their documentation, that's better than anything else.

GitHub

GitHub - rhasspy/piper: A fast, local neural text to speech system

A fast, local neural text to speech system. Contribute to rhasspy/piper development by creating an account on GitHub.

burnt hearth Jun 4, 2025, 1:34 PM

#

primal tulip Do you want to do Text To Speech (TTS)? Also, you also NEED to know about your ...

it good but some of language which i am looking it not available, simple
Audio to Text
Text to Transcript (any lang)
Transcript to Speech

that is my goal for now

I have very long vision regarding this model but currently need to ony these step

and yes if you good model for translation let me know

I use Argos Translate for translation but it not given expected output as compare with Google translation

agile cobalt Jun 4, 2025, 1:35 PM

#

burnt hearth it good but some of language which i am looking it not available, simple Audio ...

what do you mean by "Transcript"?

transcription is the process of converting audio to text

burnt hearth Jun 4, 2025, 1:35 PM

#

agile cobalt what do you mean by "Transcript"? transcription is the process of converting au...

I mean translated - script

#

Like audio in English get text and translated into other lang then use TTS for text to speech with new translated text

agile cobalt Jun 4, 2025, 1:38 PM

#

burnt hearth it good but some of language which i am looking it not available, simple Audio ...

I don't think you'll find many open source TTS models with a descent quality that support more languages than piper though

maybe if you only need of a specific set of European languages, or Chinese/Japanese/Korean, but for other languages it can be very hard to find open source models that don't sound very bad

burnt hearth Jun 4, 2025, 1:39 PM

#

agile cobalt I don't think you'll find many open source TTS models with a descent quality tha...

https://github.com/coqui-ai/TTS

no i am found this and check example

GitHub

GitHub - coqui-ai/TTS: 🐸💬 - a deep learning toolkit for Text-...

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production - coqui-ai/TTS

#

once it done i am work on voice cloning for good sound

agile cobalt Jun 4, 2025, 1:40 PM

#

burnt hearth https://github.com/coqui-ai/TTS no i am found this and check example

it supports way less than languages than piper?
edit; ah you mean Fairseq Massively Multilingual Speech? I would expect for it to sound kinda bad, I cannot even find any samples online though
also their model weights license is a bit awkward

burnt hearth Jun 4, 2025, 1:43 PM

#

okay but i am finding hindi also included

#

if there any other model have support hindi lang let me

versed fractal Jun 4, 2025, 2:25 PM

#

Is open router good for free api

#

@burnt hearth

jaunty helm Jun 4, 2025, 2:26 PM

#

versed fractal Is open router good for free api

50 free calls / day iirc

#

unless you put 10$ into it, then like 1000 / day (you don't have to spend it, just having it in)

versed fractal Jun 4, 2025, 2:26 PM

#

Is 50 free calls a day good

jaunty helm Jun 4, 2025, 2:27 PM

#

versed fractal Is 50 free calls a day good

no

hollow pagoda Jun 4, 2025, 2:27 PM

#

What iirc mean

jaunty helm Jun 4, 2025, 2:27 PM

#

hollow pagoda What iirc mean

if I remember correctly

versed fractal Jun 4, 2025, 2:27 PM

#

If I remember correctly

versed fractal Jun 4, 2025, 2:27 PM

#

jaunty helm no

What is good

hollow pagoda Jun 4, 2025, 2:27 PM

#

I be seeing it and not knowing

jaunty helm Jun 4, 2025, 2:28 PM

#

versed fractal What is good

depends on your usecase? maybe 50 is enough
imo 50 will be spent rather quickly unless it's just some one-off tests

versed fractal Jun 4, 2025, 2:29 PM

#

What is 50 a day

jaunty helm Jun 4, 2025, 2:29 PM

#

versed fractal What is 50 a day

like you can get 50 responses per day

versed fractal Jun 4, 2025, 2:29 PM

#

Only

#

Nah that's bad

#

What is a good free api website

agile cobalt Jun 4, 2025, 2:30 PM

#

Mistral and Google have some pretty nice free tiers, just be aware that any data you send to free APIs is as good as public as far as privacy goes

versed fractal Jun 4, 2025, 2:30 PM

#

Should I use Google gemini

jaunty helm Jun 4, 2025, 2:30 PM

#

I can say that for mistral, you're highly unlikely to hit the limit as an individual

versed fractal Jun 4, 2025, 2:30 PM

#

Or mistral

jaunty helm Jun 4, 2025, 2:31 PM

#

the "limit" is like 1 billion tokens per month

versed fractal Jun 4, 2025, 2:31 PM

#

I think mistral is the best

#

How much is a token btw

agile cobalt Jun 4, 2025, 2:32 PM

#

it varies~~, usually each word is 2~4 tokens in English iirc~~
https://ai.google.dev/gemini-api/docs/tokens?lang=python

For Gemini models, a token is equivalent to about 4 characters. 100 tokens is equal to about 60-80 English words.

versed fractal Jun 4, 2025, 2:32 PM

#

Oh good

#

Is there Arabic (just asking)

jaunty helm Jun 4, 2025, 2:32 PM

#

agile cobalt it varies~~, usually each word is 2~4 tokens in English iirc~~ <https://ai.googl...

I think it's closer to 1-3 now

agile cobalt Jun 4, 2025, 2:33 PM

#

versed fractal Is there Arabic (just asking)

most commercial LLMs work with nearly any language with a significant presence in the internet

if you can find online communities speaking that language odds are it'll work

versed fractal Jun 4, 2025, 2:33 PM

#

agile cobalt it varies~~, usually each word is 2~4 tokens in English iirc~~ <https://ai.googl...

How many tokens a day

jaunty helm Jun 4, 2025, 2:33 PM

#

versed fractal Is there Arabic (just asking)

I think so
mistral's models are very often advertized as having been trained on a variety of languages

agile cobalt Jun 4, 2025, 2:33 PM

#

versed fractal How many tokens a day

I don't remember out of the top of my head, check their pricing/documentation

versed fractal Jun 4, 2025, 2:33 PM

#

agile cobalt most commercial LLMs work with nearly any language with a significant presence i...

Oh so I should u mistrals

versed fractal Jun 4, 2025, 2:33 PM

#

versed fractal Oh so I should u mistrals

Use mistral

agile cobalt Jun 4, 2025, 2:34 PM

#

versed fractal Oh so I should u mistrals

Gemini works just as well? what from that makes it sounds like mistral would be better or gemini worse

versed fractal Jun 4, 2025, 2:34 PM

#

I want something with more tokens

jaunty helm Jun 4, 2025, 2:35 PM

#

versed fractal I want something with more tokens

elaborate?
as in, larger context window? more lenient rate limits?

versed fractal Jun 4, 2025, 2:36 PM

#

jaunty helm elaborate? as in, larger context window? more lenient rate limits?

I would like something that lasts long for many users

agile cobalt Jun 4, 2025, 2:36 PM

#

versed fractal I want something with more tokens

you are billed per token, and after a certain point models tend to work worse the more tokens are in your prompt

in general it is best for the tokenizer to compress your inputs into as few possible tokens as possible

#

(even on a free API, it'll affect response time and may affect rate limits)

versed fractal Jun 4, 2025, 2:37 PM

#

I think there isn't something good free

agile cobalt Jun 4, 2025, 2:37 PM

#

Mistral and Google Gemini do have good free tiers (as long as you disregard data privacy)

versed fractal Jun 4, 2025, 2:38 PM

#

They will use it to train models??

woven prairie Jun 4, 2025, 2:38 PM

#

What the token size for open ai 3-4

agile cobalt Jun 4, 2025, 2:38 PM

#

versed fractal They will use it to train models??

why else would they give out something that expensive for free

versed fractal Jun 4, 2025, 2:38 PM

#

Yeah that's okay I wouldn't send something private to the ai anyway

#

I would use it

#

Should I make the ai for a phone or pc

agile cobalt Jun 4, 2025, 2:39 PM

#

web, then it doesn't matters which device you are using it from

jaunty helm Jun 4, 2025, 2:39 PM

#

versed fractal I would like something that lasts long for many users

again that would depend on how many users there are and how token intensive your product is
just take a look at their respective limits that they clearly state:

mistral: https://docs.mistral.ai/deployment/laplateforme/tier/
gemini: https://ai.google.dev/gemini-api/docs/rate-limits

versed fractal Jun 4, 2025, 2:40 PM

#

Ok

#

So just want an opinion should I make my ai for PC or phone

#

For the first prototype before I start updating

#

I will use python backend for the model and the ai itself

sage gust Jun 4, 2025, 2:43 PM

#

Does anybody here know how I can use my GPU when training with tensorflow? I'm a bit uneducated on setting this up myself and could use some help

toxic pilot Jun 4, 2025, 3:57 PM

#

agile cobalt Mistral and Google Gemini do have good free tiers (as long as you disregard data...

i feel like disregarding data privacy is pretty much a given at this point unless ur hosting ur own LLaMa model or whatever

toxic pilot Jun 4, 2025, 3:58 PM

#

sage gust Does anybody here know how I can use my GPU when training with tensorflow? I'm a...

python3 -m pip install 'tensorflow[and-cuda]'

#

tf.config.list_physical_devices('GPU')

jaunty helm Jun 4, 2025, 4:14 PM

#

and, if you're not required to use tf, pytorch is a much better choice given that most of the attention that was on devving tf has moved on to jax

#

or heck, if you don't need a really low level api just use karras, works with whatever backend you like

sage gust Jun 4, 2025, 4:37 PM

#

We're using Tensorflow in school and it is very frustrating

#

But that's just because I'm really bad at coding

#

Maybe you can help me

#

Im currently working on a project that should recognize the genre classifying it in the 10 genres that GTZAN gives you

#

I trained my own model, and one using VG16 as base model

#

When I test using a file from the training data, it always shows "pop" for my own model as genre, and it always shows "classical" for the VGG16 one

#

And now im stuck

#

mel spectrograms are the training data

#

224x224

#

yeah, first i tried by using the images that gtzan has in its dataset, but my models didnt train at all

#

so i just converted them myselves using the wav files

toxic pilot Jun 4, 2025, 5:42 PM

#

wait i was under the impression that jax was designed to optimize torch/tf/tensor operations

#

i read the message wrong lmao

#

i read it as someone was saying that jax was a drop in replacement for tf

toxic pilot Jun 4, 2025, 5:48 PM

#

toxic pilot i read it as someone was saying that jax was a drop in replacement for tf

so basically i misunderstood the message 💀 mb

#

is it good?

#

jax i mean. did you see a non negligible performance boost

#

ah

#

what about usability?

#

better than TF is the lowest bar out there lmao

#

icl tf syntax is atrocious

#

hmm maybe i’ll check it out

#

pytorch is all right. it’s package management is brain damage tho

#

can’t believe i have to make sure all my versions are compatible when i’m using torch vision, torch audio and torch text

lime grove Jun 4, 2025, 9:00 PM

#

can someone tell me if the prices quoted on this website are accurate?
https://thedigitalprojectmanager.com/reviews/n8n-pricing/

The Digital Project Manager

n8n Pricing Tiers & Costs

Explore n8n's pricing tiers, additional costs, and comparisons with competitors to find the best plan for your needs.

#

$20/mo - $50/mo - $??/mo

agile cobalt Jun 4, 2025, 9:30 PM

#

lime grove can someone tell me if the prices quoted on this website are accurate? https://t...

just check the official website? https://n8n.io/pricing/
they have have a free community version with a very restrictive license

n8n Plans and Pricing - n8n.io

Discover n8n's pricing alternatives for cost-effective workflow automation solutions. Find the perfect plan to maximize your workflows.

wet dome Jun 4, 2025, 9:45 PM

#

Complete beginner to ml here, how do you know what model to use by just looking at a dataset?

#

I want to do some kaggle competitions to learn but how do I know what model to use, like a linear regression or something else

serene scaffold Jun 4, 2025, 9:48 PM

#

wet dome Complete beginner to ml here, how do you know what model to use by just looking ...

You have to decide what the model is intended to do, and then consider what kind of models can learn to do that in terms of the data.

#

The composition of the dataset doesn't necessarily dictate what you can do with it. So "I want to train a model on this dataset" is an incomplete goal. Train it to do what?

wet dome Jun 4, 2025, 9:50 PM

#

serene scaffold You have to decide what the model is intended to do, and then consider what kind...

So what is a linear regression intended to do

#

Cause that's the simplest isn't it

serene scaffold Jun 4, 2025, 9:51 PM

#

wet dome So what is a linear regression intended to do

Linear regression is a process. It's not an action that a model does.

#

"a thing that a model does" would be something like "predict the value of a home given its square footage and number of rooms"

wet dome Jun 4, 2025, 9:52 PM

#

OK so for that example you just gave, from that what model would you use

serene scaffold Jun 4, 2025, 9:53 PM

#

xgboost, maybe?
I do natural language processing.

wet dome Jun 4, 2025, 9:53 PM

#

Is this something you learn about when you learn ml, when to use said model?

serene scaffold Jun 4, 2025, 9:54 PM

#

You develop a sense for it.

wet dome Jun 4, 2025, 9:56 PM

#

Isn't nlp like coding in more human like language or something

serene scaffold Jun 4, 2025, 9:58 PM

#

It's not about coding in human language. It's just "AI and ML as it pertains to human/natural language"

#

ChatGPT is an application of NLP

spring field Jun 4, 2025, 10:04 PM

#

it's not "natural language programming", it's "natural language processing" 😄

coarse valve Jun 4, 2025, 11:01 PM

#

Help me understand...I went to a ML presentation from a vendor today. Our team provided them with a lot of data. The step in their process so far was to build out an unsupervised model to cluster our client base. This is all paraphrasing but essentially they did so feature engineering, parameter hypertesting...cleaning ecc. They then tried Kmean clustering with PCA...testing features i think of 5, 10, 20. I believe the Kmeans was insufficient resulting in ~800 clusters so they tried dbscan + a centroid distance based regrouping they got it to 30 clusters with ...i forget the term but essentially Edit: silhouette score ~~positive rate~~ of .51. We were tasked with analysing the different clusters and assigning labels to these clusters.

My question is...the only outcome from this model would be to then use those clusters/labels as "bucket" end points to then classify new incoming data correct? Some one my team (management) seem that this is the first step in further classifying our clients as like buyers/not ecc. I tried to really drive the point that what we define as labels WILL BE the classification we get...If I am completely off base or misunderstanding please let me know...im not smert

serene scaffold Jun 4, 2025, 11:46 PM

#

coarse valve Help me understand...I went to a ML presentation from a vendor today. Our team ...

With k means clustering, you pick a number for k, and it finds k centroids that represent the k most cohesive clusters it could find

#

You have to decide what significance each cluster has, if any

#

If you have a new instance that you want to classify, it belongs to whichever cluster it has the closest centroid to.

deft goblet Jun 5, 2025, 2:15 AM

#

Hey, I’m a CS grad from UPenn with a tech background. I just built and agent builder that lets you create any AI agent in 3 simple steps (takes about 3 minutes). The adds-on it takes care of the architecture, finding and connecting apis, and you get the full deployable code, turn it into an API, or export it to Lovable to try it out quickly.

If you want to give it a try:
https://search-dream-weaver-kit.lovable.app/

search-dream-weaver-kit

Lovable Generated Project

weary crown Jun 5, 2025, 3:41 AM

#

so im working with a dataset right now

#

from a league of legends API
the data for one of the features appears to be quite off/inaccurate, but i checked the script used to get the data as well as the API docs, and it should work

#

so should i just assume its broken xd cuz i cant find anything online about it talknig about the inaccuracy

lapis sequoia Jun 5, 2025, 5:39 AM

#

Is there any ML Engineer here doing NLP work using Windows natively (no WSL, no dual-boot, no VM), and everything works fine? how did you get everything working smoothly?

glass jetty Jun 5, 2025, 6:32 AM

#

Don't advertise here, see the #rules.

harsh verge Jun 5, 2025, 7:43 AM

#

ello

#

i need help with an error in my code

#

basically im trying to make a loss function for my neural network

#

and idk how to add a forward pass here

#

    #calculates data and regularization of losses
    #given model output and ground truth values
    def calculate(self, output, y):

        #calculate sample losses
        sample_losses = self.forward(output, y)

        #calculate mean loss
        data_loss = np.mean(sample_losses)

        #loss
        return data_loss```

jaunty helm Jun 5, 2025, 10:25 AM

#

bit of a weird one:
time series classification, I know my data is from int16 sensors, and I can see that some parts of it sit at exactly 32767 for an extended period of time
would you try "fixing" the time series somehow? or say ignore it and make a complementing series that's 1 if the data is clipped and 0 otherwise

wooden sail Jun 5, 2025, 10:27 AM

#

jaunty helm bit of a weird one: time series classification, I know my data is from int16 se...

do you know why it is that happens? if you're certain it's missing/erroneous data, then yeah, you can use a mask and treat it as a problem with missing data

jaunty helm Jun 5, 2025, 10:28 AM

#

wooden sail do you know why it is that happens? if you're certain it's missing/erroneous dat...

I'm pretty confident that it's cause the sensors are int16
like the only other value this happens at is -32768

#

and there are no greater/lower values than 32767/-32768

wooden sail Jun 5, 2025, 10:29 AM

#

mhm

#

but i guess the way you would treat it is different if the problem is clipping as opposed to just missing data

#

in either case though, it does make sense to mark the entries

jaunty helm Jun 5, 2025, 10:35 AM

#

wooden sail but i guess the way you would treat it is different if the problem is clipping a...

an example slice of the time series where I think clipping is happening
(technically there are 6 series but I don't think that's too important for this issue)
other than clipping, it looks to have pretty high periodicity
ig what I'm asking is, are there techniques to somehow transform the data so it'd be like if it wasn't clipped?

wooden sail Jun 5, 2025, 10:36 AM

#

aha, now that's a very different question 😛

#

what is this measuring?

jaunty helm Jun 5, 2025, 10:37 AM

#

the one that has the hover is the angular velocity during a hand movement in the x direction

jaunty helm Jun 5, 2025, 10:38 AM

#

wooden sail aha, now that's a very different question 😛

ye I thought this would be a good learning opportunity, cause I've never ever really tried "unscrewing" time series before

wooden sail Jun 5, 2025, 10:38 AM

#

did you write the code that reads the sensor data? or using something found online?

#

phase unwrapping is not really simple if you don't have an accompanying quantity

jaunty helm Jun 5, 2025, 10:39 AM

#

wooden sail did you write the code that reads the sensor data? or using something found onli...

it's a competition setting, so I only receive the data

wooden sail Jun 5, 2025, 10:39 AM

#

aha

#

what are the other curves?

jaunty helm Jun 5, 2025, 10:40 AM

#

wooden sail what are the other curves?

acceleration in xyz and angular velocity in xyz

wooden sail Jun 5, 2025, 10:42 AM

#

you can probably do some sort of fitting that requires a couple of derivatives to be continuous

#

should be doable with splines

#

that's like enforcing smooth motion

jaunty helm Jun 5, 2025, 10:44 AM

#

wooden sail should be doable with splines

hm, so something like the SplineTransformer I assume?
I'll look up on it

wooden sail Jun 5, 2025, 10:47 AM

#

do you know for a fact that the data is periodic? if that's a fair assumption, there are periodic splines you could try. but yeah, you can try masking out the clipped values and interpolating them back in with quadratic or cubic splines

#

there are fancier things you can do, but i think this is a good starting point. maybe someone else has better suggestions

jaunty helm Jun 5, 2025, 10:47 AM

#

alright cool, ty!

wooden sail Jun 5, 2025, 10:48 AM

#

you could grab the differential equations of motion and relate the 3 components of linear and angular velocity to them, then have your network spit a solution to the differential equation

#

that'd be a more inverse kinematics approach, what people nowadays call "physics-informed neural network"

#

or if you have enough data, fr you can probably just mask out that data and train a black box network as is 😛

#

but splines are a very manageable first step and benchmark. polynomial good 💻 🐒

coarse valve Jun 5, 2025, 1:20 PM

#

serene scaffold With k means clustering, you pick a number for k, and it finds k centroids that ...

Thanks. This makes sense as a way to ID clusters that have some business importance from an "unknown" combination of factors, but if we have the "Y" value already...doesn't it make sense to use that as the target in a supervised learning from the start? Or at least see if our classification is stronger that way?>

eager lance Jun 5, 2025, 2:47 PM

#

hey guys where can i learn probability and statistics for data science?

fallow coyote Jun 5, 2025, 2:54 PM

#

I've just found this ML project repository which has a ton of projects. I was wondering if itd be worthwile pinning it. Feel like itd help beginners like me who have understood the maths but not quite know how to use the tool effectively

#

https://github.com/shsarv/Machine-Learning-Projects/tree/main

GitHub

GitHub - shsarv/Machine-Learning-Projects: This repository showcase...

This repository showcases a selection of machine learning projects undertaken to understand and master various ML concepts. Each project reflects commitment to applying theoretical knowledge to pra...

woven prairie Jun 5, 2025, 3:10 PM

#

Hi

#

I need some help in prompting

#

If anyone could help me , please mention me

#

This is the sytem prompt I am giving to open ai gpt4o model .

📎 system_prompt.txt

arctic wedgeBOT Jun 5, 2025, 3:13 PM

#

woven prairie This is the sytem prompt I am giving to open ai gpt4o model .

~~Please react with ✅ to upload your file(s) to our paste bin, which is more accessible for some users.~~

woven prairie Jun 5, 2025, 3:27 PM

#

✅

woven prairie Jun 5, 2025, 4:40 PM

#

Can anyone help me ?

verbal oar Jun 5, 2025, 5:09 PM

#

is simplilearn good or choose other channels and which?

#

about machine learning and data science

#

just want to watch just enough to have better understanding when doing some project

#

dont want to watxh redundant videos also dont want to watch while yt about ml and data science

#

simplilearn has playlist with 160 videos about data science and approx 500 about ai

#

but looks like or not they are overlapibg

woven prairie Jun 5, 2025, 5:15 PM

#

verbal oar is simplilearn good or choose other channels and which?

Bro it is a paid course or free ?

verbal oar Jun 5, 2025, 5:15 PM

#

free

woven prairie Jun 5, 2025, 5:15 PM

#

Are u comfortable in hindi

verbal oar Jun 5, 2025, 5:15 PM

#

no

woven prairie Jun 5, 2025, 5:15 PM

#

Then I don't have any suggestions for you

verbal oar Jun 5, 2025, 5:15 PM

#

i see its also outside of yt too

woven prairie Jun 5, 2025, 5:15 PM

#

You can visit Krish Naik

verbal oar Jun 5, 2025, 5:18 PM

#

I watched ml teach by doing but its not enough

#

from vizuara

#

I want some path like most important parts of machine learning and data science and then straight to practice project

#

or just start project and fill gaps

#

dont want to watch everything and not starting project

#

looks then like tutorial hell

woven prairie Jun 5, 2025, 5:27 PM

#

Bro

#

I don't know about vizuara but there before using any algorithm like linear regression and other you must know the maths behind it , then you can enjoy learning

#

Simply calling the library isn't going to help you

steel spindle Jun 5, 2025, 5:34 PM

#

Can someone explain pathfinding AI to me?

verbal oar Jun 5, 2025, 5:38 PM

#

you have start and target node you find shortest path to reach target with dijkstra or a*

steel spindle Jun 5, 2025, 5:44 PM

#

verbal oar you have start and target node you find shortest path to reach target with dijks...

And what are a* and dijkstra and which is easier?

verbal oar Jun 5, 2025, 5:48 PM

#

algorithms to find shortest path

unkempt apex Jun 5, 2025, 5:48 PM

#

steel spindle And what are a* and dijkstra and which is easier?

try to understand how this works

#

you can also try with pen and paper

verbal oar Jun 5, 2025, 5:49 PM

#

dijkstra is easier, a* has h heuristic

#

dijkstra is taught in discrete math courses

#

a* seems like robotics games rather

#

yes you can do dijkstra with table

#

look also at pseudocodes of them

spring field Jun 5, 2025, 5:59 PM

#

I'm not sure dijkstra is exactly easier, dijkstra is really just a specific heuristic for a* (essentially)

toxic pilot Jun 5, 2025, 6:03 PM

#

@final kiln sorry for ping, but im looking into jax rn. do you have any opinions between flax, vs equinox vs haiku for neural networks?

steel spindle Jun 5, 2025, 6:08 PM

#

Thx

smoky swan Jun 5, 2025, 6:22 PM

#

say i make an RNN to estimate velocity from acceleration that has the usual kinks you get from an accelerometer,
do i run it with a set value of past measurements (like 200) every time i get a new one, or do i just keep feeding it and look at the memory gate?
second one is much faster but i think my PC should be able to handly both, so im looking for the best runtime

also, just a regular RNN isn't enough, ive tried, even hand trained it, so im making an LSTM model

verbal oar Jun 5, 2025, 7:30 PM

#

I meant when you know bfs then dijkstra is easier

opaque condor Jun 5, 2025, 11:38 PM

#

How many photos do I need for each subject for a folder and do they have to be in separate folders or just one mixed up one?

tranquil juniper Jun 6, 2025, 12:14 AM

#

I was wondering if someone was slightly or more experienced with AlphaZero's type of model and could help explain why when i use the code from the most searched youtube tutorial it works, but when i modify it to keras instead of pytorch it doesn't give me any errors and i think i understand the code, but then it always learns the wrong policy as when i test it for a given board state it doesn't give a high probability for the "correct" move that wins them the game. Anyone who could help or point me in the right direction? Here is the pytorch code github: https://github.com/foersterrobert/AlphaZeroFromScratch/blob/main/7.AlphaTweaks.ipynb and here is my code: https://codeshare.io/5X843z

GitHub

AlphaZeroFromScratch/7.AlphaTweaks.ipynb at main · foersterrobert/...

Contribute to foersterrobert/AlphaZeroFromScratch development by creating an account on GitHub.

wet dome Jun 6, 2025, 12:24 AM

#

I've just been learning about linear regressions. Is there a reason we would use gradient descent to help find our coefficients for our model, instead of using the "normal equation" which to me looks like something we can just compute instead of having to iterate over our dataset

lapis sequoia Jun 6, 2025, 12:27 AM

#

What is the most used modul in python for Data Science?

#

And Is there any source I can learn Data Science with? like an Arabic Source or English

magic pasture Jun 6, 2025, 1:46 AM

#

wet dome I've just been learning about linear regressions. Is there a reason we would use...

Hi! You're correct in that sense, yet gradient descent is more preferred because gradient descent handles large datasets easier and more efficiently. If we use the normal equation, we have to compute the inverse of matrix, which can be computationally expensive and can lead to an undesired big O complexity.

cloud cosmos Jun 6, 2025, 2:54 AM

#

Hey im trying to code a transformer and learn about ai coding does anyone have any good tips or things to study?

vocal cove Jun 6, 2025, 12:09 PM

#

I'm trying to define a good cost to push fidelity to be exactly 1+0j.

def loss(
        mps: qtn.MatrixProductState,
        target_mps: qtn.MatrixProductState
    ) -> float:
    fidelity = mps @ target_mps.conj()
    return abs(fidelity - 1)**2

Need something that can get a bit more performance.

#

If you want to try it out (small code, so can just try on your side if you prefer) lmk.

quaint mulch Jun 6, 2025, 3:23 PM

#

wet dome I've just been learning about linear regressions. Is there a reason we would use...

on top of the other answer, we also usually use linear regression as way to teach gradient descent, especially for methods where analytical method is not applicable, like NN

quaint mulch Jun 6, 2025, 3:24 PM

#

cloud cosmos Hey im trying to code a transformer and learn about ai coding does anyone have a...

https://github.com/EleutherAI/cookbook

GitHub

GitHub - EleutherAI/cookbook: Deep learning for dummies. All the pr...

Deep learning for dummies. All the practical details and useful utilities that go into working with real models. - EleutherAI/cookbook

verbal oar Jun 6, 2025, 4:02 PM

#

so nn regressor does automatically linear regression?

#

what about generalized regression nn grnn is it still used, I remember I had about it on ml course

#

I mean I never met this outside of course, like only when googling about it

humble bone Jun 6, 2025, 4:06 PM

#

this channel being about ai; if someone is up to the challenge, could you answer this question

#

how many nodes in the input, hidden and output layer, how is this determined ?

wooden sail Jun 6, 2025, 4:08 PM

#

do you know how matrix multiplication is defined?

humble bone Jun 6, 2025, 4:08 PM

#

col ^rows...?

#

col * rows

verbal oar Jun 6, 2025, 4:09 PM

#

dot product

wooden sail Jun 6, 2025, 4:09 PM

#

and so what is the restriction regarding the sizes of the columns and rows when you multiply two matrices

#

given a matrix A of size m x n and a matrix B of size p x q, if we want to calculate AB, what do we know about the sizes m, n, p, and q?

verbal oar Jun 6, 2025, 4:10 PM

#

size of rows of A must equal size of columns of B or vice versa dont remember

wooden sail Jun 6, 2025, 4:10 PM

#

(this is the answer to your question regarding the number of nodes in the input, output, etc)

humble bone Jun 6, 2025, 4:11 PM

#

ohh ok think i got it

verbal oar Jun 6, 2025, 4:11 PM

#

or it is size of cols of A must equal size of rows of B

#

thats why must transpose matrix

jaunty helm Jun 6, 2025, 4:12 PM

#

verbal oar so nn regressor does automatically linear regression?

NNRegressor? which package has that

#

I only know MLPRegressor which is in sklearn

verbal oar Jun 6, 2025, 4:13 PM

#

no just generally

#

just some neural network regressor

humble bone Jun 6, 2025, 4:13 PM

#

so input layer = 3, hidden layer = 2 and output layer = 1

weak oxide Jun 6, 2025, 4:15 PM

#

I might as well ask here. Does any of you use Darts?

jaunty helm Jun 6, 2025, 4:16 PM

#

verbal oar no just generally

if you assume a nn has a width of 1 for simplicity, you can think of it kinda like input -> linreg -> activation -> linreg -> activation -> ... -> output
edit: well ig a width of 1 would make the inner layers really uninteresting, so just ignore that part

verbal oar Jun 6, 2025, 4:16 PM

#

interesting insight

weak oxide Jun 6, 2025, 4:16 PM

#

I was looking between TSLearn, GluonTS, and Darts for time series forecasting. I used to use manual forecasting, statsmodels, pmdarima but I needed a more united package

#

Manual forecasting meaning like I build like a regressor from sklearn and plot the predictions against actual.

verbal oar Jun 6, 2025, 4:19 PM

#

can also look at some time series R package if you are allowed to use it

#

or if only allowed python then different story

jaunty helm Jun 6, 2025, 4:20 PM

#

weak oxide I was looking between TSLearn, GluonTS, and Darts for time series forecasting. I...

there's also Nixtla which people say is faster I think

weak oxide Jun 6, 2025, 4:20 PM

#

verbal oar can also look at some time series R package if you are allowed to use it

I used R in the past still, but Python it's kinda challenging. My colleagues use Python

#

With R they have model time which is really nice

jaunty helm Jun 6, 2025, 4:21 PM

#

like Nixtla is focused on forecasting while the others include more analysis utilities

verbal oar Jun 6, 2025, 4:21 PM

#

wait you said forecasting not analysis ok I meant ts analysis then

weak oxide Jun 6, 2025, 4:22 PM

#

It's why I joined this server because my colleagues are like pmdarima and manual forecasting all the way. And I'm thinking there got to be a better answer

weak oxide Jun 6, 2025, 4:22 PM

#

verbal oar wait you said forecasting not analysis ok I meant ts analysis then

It's both

weak oxide Jun 6, 2025, 4:22 PM

#

jaunty helm like Nixtla is focused on forecasting while the others include more analysis uti...

I'll take a look

jaunty helm Jun 6, 2025, 4:24 PM

#

weak oxide It's why I joined this server because my colleagues are like pmdarima and manual...

"manual" forecasting is still competitive though
cause you can just include some lag features and turn it into a tabular prediction task, where gradient boosting trees excel

verbal oar Jun 6, 2025, 4:26 PM

#

please not Excel 😂

weak oxide Jun 6, 2025, 4:26 PM

#

jaunty helm "manual" forecasting is still competitive though cause you can just include some...

Yeah I seen colleagues do a manual xgboost and lightgbm, but I was hoping for like a united interface encompassing deep learning, classic forecasting, machine learning, and hierarchal all in one package . I do include lag features though my colleagues get super sticky on the specifics (will not mention)

jaunty helm Jun 6, 2025, 4:28 PM

#

weak oxide Yeah I seen colleagues do a manual xgboost and lightgbm, but I was hoping for li...

there's also sktime (and aeon which is a fork)
I've a rant in here a few weeks back about how I'm not impressed by the performance when compared to just raw dogging it in polars, but that could be because polars is fast and not that sktime is slow

weak oxide Jun 6, 2025, 4:29 PM

#

jaunty helm there's also `sktime` (and `aeon` which is a fork) I've a rant in here a few wee...

It's sad I never learned polars

#

I should but I'm used to pandas already

#

On a personal level too

jaunty helm Jun 6, 2025, 4:30 PM

#

weak oxide Yeah I seen colleagues do a manual xgboost and lightgbm, but I was hoping for li...

deep learning, classic forecasting, machine learning
yeah nothing like that exists, it would cover too broad a scope to not be bloat hell I imagine

weak oxide Jun 6, 2025, 4:30 PM

#

Yikes

#

Oh well

jaunty helm Jun 6, 2025, 4:32 PM

#

well ig technically a good chunk of the forecasting libs mentioned above (sktime, aeon, darts, nixtla, idk others) do include neural network methods
it's just that you can't alter the architecture as easily as say if you were to use pytorch

agile cobalt Jun 6, 2025, 4:32 PM

#

weak oxide I should but I'm used to pandas already

so were most people that use polars
I'd very strongly recommend learning it

weak oxide Jun 6, 2025, 4:32 PM

#

agile cobalt so were most people that use polars I'd very strongly recommend learning it

Does polars have something like pandas.ta

#

It's just convenient for a lot of packages I use

jaunty helm Jun 6, 2025, 4:33 PM

#

you have polars_df.to_pandas() and polars.from_pandas() to switch between them, if your package only takes pandas dataframes

weak oxide Jun 6, 2025, 4:34 PM

#

Like Edgar tools is the best SEC parsing package and they only parse to pandas. Idk I guess I'll try to learn it

verbal oar Jun 6, 2025, 4:34 PM

#

so transformers for ts forecasting is overkill for this case?

weak oxide Jun 6, 2025, 4:35 PM

#

verbal oar so transformers for ts forecasting is overkill for this case?

What's your data looking like

verbal oar Jun 6, 2025, 4:35 PM

#

just ask generally

jaunty helm Jun 6, 2025, 4:35 PM

#

verbal oar so transformers for ts forecasting is overkill for this case?

for now time series remain a field where (un?)fortunately the best model is "it depends on your data"

verbal oar Jun 6, 2025, 4:35 PM

#

oh ok

weak oxide Jun 6, 2025, 4:35 PM

#

jaunty helm for now time series remain a field where (un?)fortunately the best model is "it ...

Yeah

agile cobalt Jun 6, 2025, 4:35 PM

#

weak oxide Does polars have something like pandas.ta

there are some plugins like https://github.com/wukan1986/polars_ta and https://github.com/Yvictor/polars_ta_extension, I have never used either of them myself though

#

polars does have official support for plugins though, which is pretty neat

weak oxide Jun 6, 2025, 4:36 PM

#

How did I never heard about this. They hype up pandas ta with 130 technical indicators wtf

weak oxide Jun 6, 2025, 4:38 PM

#

jaunty helm for now time series remain a field where (un?)fortunately the best model is "it ...

Generally though for government economic data, they usually use ARIMA, SARIMAX

So it depends on the data but also the type of data

#

So my solution based on the chat seems to be Darts and Nixtla. I'll check both of them fully out

jaunty helm Jun 6, 2025, 4:41 PM

#

verbal oar oh ok

it's not even just transformers, sometimes deep learning just loses to a xgboost for example: https://arxiv.org/pdf/2101.02118

weak oxide Jun 6, 2025, 4:42 PM

#

jaunty helm it's not even just transformers, sometimes deep learning just loses to a xgboost...

Me personally you can combine models

weak oxide Jun 6, 2025, 4:45 PM

#

jaunty helm it's not even just transformers, sometimes deep learning just loses to a xgboost...

They didn't show any visualizations sad

desert oar Jun 6, 2025, 6:47 PM

#

smoky swan I can provide my script if you want me to, but the basic idea is as follows: you...

Not sure if you discussed with anyone else yet, but the reason I asked to see an example is because it's hard to know just from this description what kind of noise you have and what "not enough" means.

crimson jackal Jun 6, 2025, 7:05 PM

#

Is it possible to compute the eigenvalus of a matrix symbolically using numpy?

#

Sympy is too slow for larger matrices.

quaint mulch Jun 6, 2025, 7:12 PM

#

darts is the closest.
If darts is not good enough, then manual.

smoky swan Jun 6, 2025, 7:23 PM

#

desert oar Not sure if you discussed with anyone else yet, but the reason I asked to see an...

it's exactly what you expect:

(of course), random deviation from the exact value, especially when accel is high
sensor ringing (a sine curve) after changes in acceleration, especially when stopping, tapping or hitting something
gaps between measurement points contribute a lot to drift after stopping, almost impossible to filter out

all filters had either or multiple of the following problems:

not enough; still drift after some time or after fast movements
cursor moves back to where it came from (like with a vanilla RNN or forgetting integration
cursor starts absolutely tweaking once you move any faster than a slow glide
input felt highly unnatural, cursor was hard to move at all (often happened when you overdid it with filters that else aren't enough)

#

zero velocity updates (set velocity=0 when acceleration=0 and delta_rotation=0) either don't detect or detect too eagerly

exotic star Jun 6, 2025, 9:45 PM

#

hi guys, i dont know how to go about learning the math of ML? im learning the data libraries of python and i wanna start learning the core math of ML as well before i jump into deeper into ML and ML concepts so i can actually understand whats going on at a deeper level. How would i go about this? do u have any resources and guidence?

soft ermine Jun 6, 2025, 11:08 PM

#

exotic star hi guys, i dont know how to go about learning the math of ML? im learning the da...

u still struggling with this?

#

i understand machine learning at the lowest level possible so im sure i can help

exotic star Jun 6, 2025, 11:44 PM

#

soft ermine u still struggling with this?

yea

#

im not sure how to approach actually learning ML but i'll learn pandas/numpy/matplot parallel with math for ML

#

and try to implement the math with python

#

tho im not sure where and how to learn all of those

#

same about ML concepts

soft ermine Jun 6, 2025, 11:50 PM

#

i mean

soft ermine Jun 6, 2025, 11:51 PM

#

exotic star tho im not sure where and how to learn all of those

start with a single 2 input with weight -> 1 output with weight design. understand how that works and then how to retrain it to approximate towards a goal.

once youve done this its just a matter of scaling

#

essentially current ml models require training pools. the more neurons the more accurately that the machine can create an outcome. the more layers the more complex the result can be. understanding how a signal machine neuron works first is essential

exotic star Jun 6, 2025, 11:54 PM

#

soft ermine essentially current ml models require training pools. the more neurons the more ...

ok, i'll start there

#

thanks

soft ermine Jun 6, 2025, 11:56 PM

#

exotic star ok, i'll start there

ok bro good luck. if u need any help along the way lmk

exotic star Jun 6, 2025, 11:57 PM

#

soft ermine ok bro good luck. if u need any help along the way lmk

got it

soft ermine Jun 7, 2025, 12:03 AM

#

the increased bus width is representing the analogue signal to avoid unnecessary digital clock based/space unoptimised processes

spring field Jun 7, 2025, 12:03 AM

#

(oh god, I was imagining something like one made out of wood 😭)

soft ermine Jun 7, 2025, 12:03 AM

#

the ram contains the weights, though the architecture allows for localised ram for each neuron set

soft ermine Jun 7, 2025, 12:03 AM

#

spring field (oh god, I was imagining something like one made out of wood 😭)

wood isnt conductive...

#

ill be physically building it soon once ive got the modularisation down

spring field Jun 7, 2025, 12:04 AM

#

well, I was apparently conflating "physical" with "mechanical", mb

soft ermine Jun 7, 2025, 12:05 AM

#

i mean it will be physical soon. like basically small connectable cubes that can stack onto each other to create a massive neural network

spring field Jun 7, 2025, 12:05 AM

#

(I was thinking something not involving electricity (perhaps, with a hand crank ducky_skull ))

soft ermine Jun 7, 2025, 12:06 AM

#

i mean... i can power it with a hand crank but why would i do that when i can plug it in...

#

also that would involves some crazy gear work 😭

#

i just cant think creatively enough to modularise it effectively

iron basalt Jun 7, 2025, 12:25 AM

#

spring field (I was thinking something not involving electricity (perhaps, with a hand crank ...

I mess around with mechanical computers and such, here is a good starting point for some ideas: https://www.youtube.com/watch?v=s1i-dnAH9Y4 I recommend starting with Lego.

YouTube

Excelsior10000

Mechanical Computer (All Parts) - Basic Mechanisms In Fire Control ...

A 1953 training film for a mechanical fire control computer aboard Navy Ships. Amazing how problems of mathematical computation were solved so elegantly in "permanent" mechanical form, before microprocessors became inexpensive and commonplace.

▶ Play video

#

(I have done NNs)

soft ermine Jun 7, 2025, 12:29 AM

#

iron basalt I mess around with mechanical computers and such, here is a good starting point ...

bro this is so goofy

#

i done it though. ive discovered the perfect method to physically modularised neural networks

jaunty helm Jun 7, 2025, 2:51 AM

#

crimson jackal Sympy is too slow for larger matrices.

have you tried using symengine?

#

it's in c++ rather than python so should be more performant

weak oxide Jun 7, 2025, 3:11 AM

#

quaint mulch darts is the closest. If darts is not good enough, then manual.

Yeah about that, I saw the full documentation and I have to say Darts is awesome.

#

I was able to implement most of what I wanted

#

And then I supplement the rest with pytorch forecasting

#

It doesn't have SARIMAX explicitly but I can change it around to get SARIMAX

vale breach Jun 7, 2025, 11:02 AM

#

guys what is the

#

difference between iloc and loc in pandass

jaunty helm Jun 7, 2025, 11:18 AM

#

vale breach difference between iloc and loc in pandass

iloc takes integers while loc takes labels

vale breach Jun 7, 2025, 11:18 AM

#

wdym?

jaunty helm Jun 7, 2025, 11:20 AM

#

vale breach wdym?

like you do df.iloc[:, 2] to get the 3rd column
or, if the 3rd column is named "fruit," you can do df.loc[:, 'fruit']

vale breach Jun 7, 2025, 11:21 AM

#

jaunty helm like you do `df.iloc[:, 2]` to get the 3rd column or, if the 3rd column is named...

3 column and all the rows

vale breach Jun 7, 2025, 11:21 AM

#

jaunty helm like you do `df.iloc[:, 2]` to get the 3rd column or, if the 3rd column is named...

is it fixed to being a two parameter function or what

#

what I dont understand is the syntax df[[something, something]

#

#

i have this vue dataframe

#

and loc[row, column]

vale breach Jun 7, 2025, 11:25 AM

#

jaunty helm like you do `df.iloc[:, 2]` to get the 3rd column or, if the 3rd column is named...

what if loc only contains one param, wtf?

#

@jaunty helm

jaunty helm Jun 7, 2025, 11:27 AM

#

vale breach what if loc only contains one param, wtf?

the only difference in loc and iloc, is that loc uses labels (str) to access stuff while iloc uses indices (int)
it's df.loc[<row stuff>, <column stuff>], and if you omit the second one it's just df.loc[<row stuff>]

jaunty helm Jun 7, 2025, 11:27 AM

#

vale breach <@208918673178492929>

if you think I type too slowly just see here https://pandas.pydata.org/docs/user_guide/indexing.html

vale breach Jun 7, 2025, 11:27 AM

#

jaunty helm if you think I type too slowly just see here https://pandas.pydata.org/docs/user...

is fine, I gotchu

#

where can I check function definition of loc()?

#

because, if I only have one parameter you know is row

#

because loc(row, col) right

jaunty helm Jun 7, 2025, 11:28 AM

#

https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.loc.html

vale breach Jun 7, 2025, 11:29 AM

#

jaunty helm if you think I type too slowly just see here https://pandas.pydata.org/docs/user...

is df.loc(5, :) equivalent to df.loc(5)

jaunty helm Jun 7, 2025, 11:29 AM

#

vale breach is df.loc(5, \:) equivalent to df.loc(5)

not (), []

vale breach Jun 7, 2025, 11:29 AM

#

yeah

#

is df.loc[5, :] equivalent to df.loc[5]

vale breach Jun 7, 2025, 11:30 AM

#

jaunty helm not `()`, `[]`

ok

#

I think I get it tho, this cose in here

#

this code is saying df.loc[5, :] = [stuff to put in corresponding r,w ]

#

like

#

it will put in the corresponding columns

#

"heinz, armando" will be put in row = 5, column = 1

#

1255 will be put in row 5 column 2

#

'a' will be put in row 5 column 3

#

and "flex" will be put in row 5 column 4

#

I get it I think @jaunty helm

vale breach Jun 7, 2025, 11:32 AM

#

jaunty helm not `()`, `[]`

❤️

#

i appreciate it tho, you broke down the principal differences and I think I start to get it tho

vale breach Jun 7, 2025, 11:33 AM

#

jaunty helm not `()`, `[]`

are you from america?

vale breach Jun 7, 2025, 11:34 AM

#

jaunty helm not `()`, `[]`

how does df.groupby work ?

#

like grouped = df.groupby("columnnnn")

jaunty helm Jun 7, 2025, 11:39 AM

#

vale breach like grouped = df.groupby("columnnnn")

prepares the dataframe for an operation, such that it's applied to each group
like df.sum() gives you the sum of everything, df.groupby('a').sum() would get you the sums of each group

vale breach Jun 7, 2025, 11:39 AM

#

what about df.groupby("column")['something'].max()

#

@jaunty helm

jaunty helm Jun 7, 2025, 11:40 AM

#

vale breach <@208918673178492929>

same

vale breach Jun 7, 2025, 11:40 AM

#

what the flip does the [] do
after the groupby

jaunty helm Jun 7, 2025, 11:40 AM

#

vale breach what the flip does the [] do after the groupby

access a column

vale breach Jun 7, 2025, 11:41 AM

#

#

like I have this df

vale breach Jun 7, 2025, 11:42 AM

#

jaunty helm access a column

is hard

jaunty helm Jun 7, 2025, 11:43 AM

#

jaunty helm if you think I type too slowly just see here https://pandas.pydata.org/docs/user...

the indexing guide here is a lot more complete than what I can give you

vale breach Jun 7, 2025, 11:44 AM

#

i just need to guess which line of code makes the following operation on this df

#

I think it will be df.groupby("asiento")['fila'].sort()

vale breach Jun 7, 2025, 11:45 AM

#

jaunty helm the indexing guide here is a lot more complete than what I can give you

well, certainly the groupby, selects a column that you will be using

vale breach Jun 7, 2025, 11:45 AM

#

jaunty helm prepares the dataframe for an operation, such that it's applied to each group li...

group = column or row

vale breach Jun 7, 2025, 11:54 AM

#

jaunty helm the only difference in `loc` and `iloc`, is that `loc` uses labels (`str`) to ac...

#

you can also do loc on multiple columns

#

which is equivalent to this one

vale breach Jun 7, 2025, 11:56 AM

#

jaunty helm the only difference in `loc` and `iloc`, is that `loc` uses labels (`str`) to ac...

loc is the same thing as using df[label, label]

#

unless I am tripping @jaunty helm

vale breach Jun 7, 2025, 12:06 PM

#

jaunty helm access a column

I think groupby groups by the rows

#

and then we access the column with the []

#

something like that no?

vale breach Jun 7, 2025, 12:07 PM

#

jaunty helm prepares the dataframe for an operation, such that it's applied to each group li...

yeah so in this case we have the row 'a' that is going to be summed entirely or something

#

row 'a' from the dataframe

#

#

for this df we grouped by the animal row and accesed the column age to put the mean of each age of the animal

verbal oar Jun 7, 2025, 2:23 PM

#

groupby i think is like SQL's?

#

and used with some aggregation function

#

do I need some knowledge of hpc and parallel computing for machine learning?

#

or only when someone manages infrastructure of it?

#

so not neccessary

soft ermine Jun 7, 2025, 2:59 PM

#

verbal oar do I need some knowledge of hpc and parallel computing for machine learning?

tbh for machine learning u just need a module to do it and a training set :p

fallow coyote Jun 7, 2025, 4:24 PM

#

are there any books which teach linear algebra and its application in computing? ngl it just seems long learning about linear algebra. I just want to know enough linear algebra for machine learning. Im going to use some A level textbooks to learn matrix basics and then use gilbert strangs book to from chpater 3 onwards but i wouldnt mind using a book that teaches linear algebra ground up and learning to apply it

soft ermine Jun 7, 2025, 4:38 PM

#

fallow coyote are there any books which teach linear algebra and its application in computing?...

bro. linear algebra is insanely easy

fallow coyote Jun 7, 2025, 4:39 PM

#

Cool.

verbal oar Jun 7, 2025, 4:43 PM

#

for example when u calculate characteristic polynomial/equation then its similar to high school algebra

crimson jackal Jun 7, 2025, 4:43 PM

#

Is there any way to use wolfram mathematica inside python to compute eigenvectors symbolically?

serene scaffold Jun 7, 2025, 4:45 PM

#

crimson jackal Is there any way to use wolfram mathematica inside python to compute eigenvector...

I think you can just use sympy for that.

crimson jackal Jun 7, 2025, 4:46 PM

#

serene scaffold I think you can just use sympy for that.

No. I have a matrix 11 by 11 with a lots of zeros, and if i just introduce one variable it will take too much time to compute it

jaunty helm Jun 7, 2025, 4:47 PM

#

crimson jackal Is there any way to use wolfram mathematica inside python to compute eigenvector...

have you tried symengine

crimson jackal Jun 7, 2025, 4:47 PM

#

jaunty helm have you tried symengine

I dont know that one. I will check it.

soft ermine Jun 7, 2025, 4:51 PM

#

crimson jackal No. I have a matrix 11 by 11 with a lots of zeros, and if i just introduce one v...

have you used a tensor for the matrix?

crimson jackal Jun 7, 2025, 4:51 PM

#

soft ermine have you used a tensor for the matrix?

No.

soft ermine Jun 7, 2025, 4:52 PM

#

r.i.p

crimson jackal Jun 7, 2025, 4:52 PM

#

Why?

soft ermine Jun 7, 2025, 4:52 PM

#

gpus are really good at matrix functions c:

#

cuda probably has the protocols to do this efficiently

crimson jackal Jun 7, 2025, 4:54 PM

#

Wait, when you say tensor is not using Matrx(...) right? The question may be dumb. Ahahhah

soft ermine Jun 7, 2025, 4:54 PM

#

tensors are essentially a 3d array

#

an 11x11 matrix could fall into the tensors realm

crimson jackal Jun 7, 2025, 4:55 PM

#

How?

soft ermine Jun 7, 2025, 4:55 PM

#

if ofc it meets the criteria

iron basalt Jun 7, 2025, 4:55 PM

#

crimson jackal No. I have a matrix 11 by 11 with a lots of zeros, and if i just introduce one v...

You have tried sympy but it's too slow?

crimson jackal Jun 7, 2025, 4:56 PM

#

Yea. Tooooo slow.

iron basalt Jun 7, 2025, 4:56 PM

#

Is Mathematica a lot faster for this?

crimson jackal Jun 7, 2025, 4:56 PM

#

Yea, by a lotttttt

soft ermine Jun 7, 2025, 4:56 PM

#

cuda might be the optimal approach here

crimson jackal Jun 7, 2025, 4:57 PM

#

Mathematica can make it in seconds and using sympy may take at least some hours.

#

I may not have cuda.

soft ermine Jun 7, 2025, 4:57 PM

#

cuda should allow this to happen in millisecond latency

iron basalt Jun 7, 2025, 4:57 PM

#

If the difference is this large I doubt it's a hardware issue.

#

Sympy is probably just doing something horribly wrong.

soft ermine Jun 7, 2025, 4:58 PM

#

no its most likely that sympy is using cpu fetch decode execute cycles instead of gpu tensor capabilities

#

which is the wrong approach to digital matrices

iron basalt Jun 7, 2025, 4:59 PM

#

It's an 11x11 matrix.

#

With mostly zeros.

soft ermine Jun 7, 2025, 4:59 PM

#

yeah tensors should be perfect for that

crimson jackal Jun 7, 2025, 4:59 PM

#

I can show you how the matrix is.

iron basalt Jun 7, 2025, 4:59 PM

#

No I mean like 20 year old hardware can demolish that task.

soft ermine Jun 7, 2025, 5:00 PM

#

yeah

iron basalt Jun 7, 2025, 5:00 PM

#

Something is just being done really wrong.

#

No CUDA needed, or anything like that.

crimson jackal Jun 7, 2025, 5:02 PM

#

This is 12 by 12 but its basically to remove one row and one column.

#

I want to substitute all of them except maybe 3 variables.

crimson jackal Jun 7, 2025, 5:03 PM

#

iron basalt Something is just being done really wrong.

I can send the code.

iron basalt Jun 7, 2025, 5:03 PM

#

crimson jackal I can send the code.

If you can create a short MRE for the people here that would be great.

#

Maybe with a smaller simple matrix as an example too of what you want.

crimson jackal Jun 7, 2025, 5:04 PM

#

MRE?

iron basalt Jun 7, 2025, 5:04 PM

#

Minimal reproducible example.

crimson jackal Jun 7, 2025, 5:04 PM

#

Oh okok

#

Yea. Ok, one minute.

verbal oar Jun 7, 2025, 5:05 PM

#

so its sparse

iron basalt Jun 7, 2025, 5:05 PM

#

verbal oar so its sparse

Looks like about 50% sparsity.

verbal oar Jun 7, 2025, 5:06 PM

#

yes right

iron basalt Jun 7, 2025, 5:06 PM

#

So in between, but still worth storing in sparse form. Although this is about symbolic computation, so things are a bit different in how you do things.

#

My guess is that sympy stores its structures in a pretty slow way compared to Mathematica, but this probably does not explain the massive difference.

jaunty helm Jun 7, 2025, 5:15 PM

#

symengine is like sympy but in c++, which is why I suggested it

iron basalt Jun 7, 2025, 5:16 PM

#

crimson jackal This is 12 by 12 but its basically to remove one row and one column.

Something quick to try in Sympy is cse (common subexpression). It can be used to optimize by having it replace all the common subexpressions with symbols so the simplifier does not chug through a ton of unneeded work.

#

Note that anything above 3x3 is going to start getting really bad for symbolic computation on this task, especially if the engine is not making use of it being sparse.

crimson jackal Jun 7, 2025, 5:17 PM

#

import sympy as sp

# Define symbols
F, G, A, B, epsilon, mu = sp.symbols('F G A B epsilon mu')

# Define the matrix
M = sp.Matrix([
    [0, 0, F, 0, 0, 0, 0],
    [0, 0, G, 0, 0, 0, 0],
    [F, G, epsilon, 0, A, 0, B],
    [0, 0, 0, 0, 0, mu, 0],
    [0, 0, A, 0, 0, 0, mu],
    [0, 0, 0, mu, 0, 0, 0],
    [0, 0, B, 0, mu, 0, 0]
])

# Compute symbolic eigenvectors
eigen_data = M.eigenvects()

#

Even doing this is already very bad...

iron basalt Jun 7, 2025, 5:18 PM

#

!code

arctic wedgeBOT Jun 7, 2025, 5:18 PM

#

Formatting code on Discord

Here's how to format Python code on Discord:

```py
print('Hello world!')
```

These are backticks, not quotes. Check this out if you can't find the backtick key.

For long code samples, you can use our pastebin.

crimson jackal Jun 7, 2025, 5:18 PM

#

So yea. I may try first to use symengyne.

crimson jackal Jun 7, 2025, 7:02 PM

#

I cant find the documentation of the functions of symengine.

lilac aspen Jun 7, 2025, 7:12 PM

#

crimson jackal I cant find the documentation of the functions of symengine.

Look in to specialized methods for "sparse matrices". They can be loads faster by skipping all the zeros.

crimson jackal Jun 7, 2025, 7:13 PM

#

lilac aspen Look in to specialized methods for "sparse matrices". They can be loads faster ...

That works if i want it symbolically?

lilac aspen Jun 7, 2025, 7:15 PM

#

I don't know. Like I said, look into it.

But the idea is at least 40 years old. Someone's probably made one that works for Computer Algebra Systems by now, as well as for numerical methods.

crystal pier Jun 7, 2025, 10:58 PM

#

👋🏾 anyone got a project they need help with?

surreal salmon Jun 8, 2025, 4:07 PM

#

Hi