#data-science-and-ml | Python | Page 184

iron basalt May 3, 2026, 2:43 AM

#

(And there is still a lot of things that can be done with that simple ML (it's not all just LLMs and video generation, etc, lots of practical problems to be solved with simple ML))

primal hemlock May 3, 2026, 2:46 AM

#

Could liquid cooling work?

iron basalt May 3, 2026, 2:47 AM

#

(Even if you are doing ML research btw, see LeCun right now for example, you start small and scale up later, make sure the idea works first in small)

iron basalt May 3, 2026, 2:47 AM

#

primal hemlock Could liquid cooling work?

Where does the heat go?

#

Out of the PC, so it keeps running, but it needs to go somewhere.

#

Think about it as moving around the heat.

#

Ideally out of your house.

primal hemlock May 3, 2026, 2:48 AM

#

iron basalt Where does the heat go?

Ice bucket lol

#

Strip down an old fridge and use it

iron basalt May 3, 2026, 2:49 AM

#

Keep replacing the bucket and that can work, if you are up for that 24/7.

#

But yeah, datacenters have this big problem, so they act like one giant PC that is the size of the whole building, and pump out the heat.

#

Liquid cooling.

#

(Locals hate it)

#

You can convert your home into a mini datacenter, if you really don't mind price.

primal hemlock May 3, 2026, 2:51 AM

#

Heat pipe to the neighbors home, problem solved

iron basalt May 3, 2026, 2:51 AM

#

Open their window, pipe from your window, they surely won't notice /s.

primal hemlock May 3, 2026, 2:52 AM

#

Paint it camo

#

“This is not a heat pipe”

#

Bootlegged magnetocaloric effect fridge

primal hemlock May 3, 2026, 4:55 AM

#

Where would you suggest I start with the beginner stuff?

versed pilot May 3, 2026, 10:13 AM

#

For a total ML beginner start with sklearn. Linear regression etc. Maybe XGBoost once you hit the limits of what you can do with sklearn?

grim storm May 3, 2026, 6:35 PM

#

Hello guys anyone worked with anomaly detection on Agriculture sensors ?

warm dune May 4, 2026, 3:18 AM

#

grim storm Hello guys anyone worked with anomaly detection on Agriculture sensors ?

could you explain more? I was genuinely curious

serene scaffold May 4, 2026, 3:21 AM

#

grim storm Hello guys anyone worked with anomaly detection on Agriculture sensors ?

that's really niche. chances are that no one has, but you can probably still get help if you ask your actual question.

warm dune May 4, 2026, 3:27 AM

#

serene scaffold that's really niche. chances are that no one has, but you can probably still get...

wassup bro

serene scaffold May 4, 2026, 3:27 AM

#

I'm just fabulous. what do you think about data science and ML?

warm dune May 4, 2026, 3:28 AM

#

serene scaffold I'm just fabulous. what do you think about data science and ML?

remember we were talking about NLP a few days late?

serene scaffold May 4, 2026, 3:28 AM

#

sure, what about it?

warm dune May 4, 2026, 3:28 AM

#

serene scaffold sure, what about it?

I took some time to study and I think I finally understood embedding and the architecture of transformers

#

like i genuinely could explain to my mother and and she understood

#

now i will to check a little RAG and LangChain

#

just to see what is it

half pulsar May 4, 2026, 3:29 AM

#

grim storm Hello guys anyone worked with anomaly detection on Agriculture sensors ?

Hydroponics related or something?

half pulsar May 4, 2026, 3:30 AM

#

warm dune just to see what is it

Good place to be looking

warm dune May 4, 2026, 3:31 AM

#

half pulsar Good place to be looking

i've heard of RAG, is it too important for ml?

#

langchain i have no ideia for what is it

half pulsar May 4, 2026, 3:35 AM

#

warm dune langchain i have no ideia for what is it

LangChain is a good example of the direction that AI is starting to head in simply so it'd be good for you to learn about, Its just Agentic AI

warm dune May 4, 2026, 3:43 AM

#

half pulsar LangChain is a good example of the direction that AI is starting to head in simp...

so, no math here

#

thank god

serene scaffold May 4, 2026, 3:46 AM

#

warm dune i've heard of RAG, is it too important for ml?

When you do RAG, all the machine learning has already been done

#

The same is true when you do agentic development

#

That's why #agents-and-llms is a separate channel

warm dune May 4, 2026, 3:49 AM

#

serene scaffold When you do RAG, all the machine learning has already been done

just a question

serene scaffold May 4, 2026, 3:49 AM

#

What question?

warm dune May 4, 2026, 3:49 AM

#

transformers will learn the model how to read

#

and RAG will give the context

#

its like that?

serene scaffold May 4, 2026, 3:51 AM

#

The reason we do RAG is because we can't trust LLMs to function as knowledge stores. They tend to form coherent sentences that make sense but are just false.

#

RAG is just the idea of looking up potentially relevant text from a knowledge store, and then putting that text after the user's question, and then letting the LLM generate text from there.

warm dune May 4, 2026, 3:54 AM

#

serene scaffold The reason we do RAG is because we can't trust LLMs to function as knowledge sto...

yes, that's cuz we can have specific contexts, like some game released 1 week late and if the model receives all the data it can from all contexts it would take years to train then we make the model learn to read and write Oh, the rag goes there and gives the context, rules and specific situations, right?

#

i think I understand

serene scaffold May 4, 2026, 3:58 AM

#

warm dune yes, that's cuz we can have specific contexts, like some game released 1 week la...

You can't really train an LLM to understand specific facts, because they just get lost in the sea of all the other text they were trained on.

#

But you can trust them to synthesize information that's immediately available to them

warm dune May 4, 2026, 3:59 AM

#

serene scaffold But you can trust them to synthesize information that's immediately available to...

got it, i'll see more later

#

thks pope

serene scaffold May 4, 2026, 4:00 AM

#

I absolve thee

dull flicker May 4, 2026, 6:32 AM

#

@versed pilot @grim storm my dms are open!

livid oasis May 4, 2026, 7:39 AM

#

i am just curious as a beginner, the libraries like numpy and pandas, how they're used in later on stages of machine learning !!

#

or the 80/20 rule applies here?

jaunty helm May 4, 2026, 8:48 AM

#

livid oasis i am just curious as a beginner, the libraries like numpy and pandas, how they'r...

to do data processing
most of the time you spend in ml is the data processing while 'fancy modeling' takes surprisingly little

livid oasis May 4, 2026, 8:57 AM

#

jaunty helm to do data processing most of the time you spend in ml is the data processing wh...

ohh okayy

#

like data cleaning and pre-processing

warm dune May 4, 2026, 6:13 PM

#

Guys, just a review question, neural networks are feature extractors

That is, the weights of neurons are, in part, vectors that simulate characteristics (after training, with the weights adjusted)

And through the dot product, we can see the similarity of the neuron (which carries a feature) and our input vector (the data) so if that vector has the features our dot product will send an 'intensity' to the next layer

That the next layer will do the same feature simulation, and now it will be kind of a 'feature of the feature', until it reaches the exit layer

And with each layer pass, the result of the 'intensity' that will be passed as an input vector, so we can extract the 'characteristic from the characteristic' and also modify the space, since this intensity ends up becoming the coordinates for a new space, so to speak

serene scaffold May 4, 2026, 6:19 PM

#

warm dune Guys, just a review question, neural networks are feature extractors That is, t...

this sounds essentially correct to me.

#

though it's not really as simple as "this layer identifies one of the features". the feature extraction is something that emerges from the whole network.

warm dune May 4, 2026, 6:22 PM

#

serene scaffold though it's not really as simple as "this layer identifies one of the features"....

sure, thk

stuck swallow May 5, 2026, 10:10 AM

#

mild dirge May 5, 2026, 10:30 AM

#

stuck swallow

almost

unreal condor May 5, 2026, 1:08 PM

#

stuck swallow

When I tweaked a random hyper-parameter in my model:

warm dune May 5, 2026, 3:22 PM

#

guys in the context of transformers, whats the main difference between heads and blocks?

unreal condor May 5, 2026, 3:36 PM

#

warm dune guys in the context of transformers, whats the main difference between heads and...

blocks are just a bunch of layers grouped together, they are not unique to transformers.

#

Head is a special block computed with 3 special matrices Key, Query, Value (divided into 3 from the result of the previous layer) I think these 3 are inspired by the concept of information retrieval. And if you compute this block multiple times in parallel you have multi-head attentions

#

Also, I have given up ML long ago so pls fact check : )

warm dune May 5, 2026, 4:06 PM

#

unreal condor Also, I have given up ML long ago so pls fact check : )

sure, i'll

tawdry heart May 5, 2026, 6:17 PM

#

@warm dune there's a 3b1b vid on transformers which is pr good if u haven't seen it

warm dune May 5, 2026, 6:32 PM

#

tawdry heart <@980601465645199430> there's a 3b1b vid on transformers which is pr good if u h...

going to watch

frigid niche May 5, 2026, 8:41 PM

#

I am currently working on a Language Model that runs on the TI 84 Plus CE. It is 200k parameters! It uses syllables as a tokenization system. I have it running on the actual hardware, but did testing with an emulator first. I should have all of the documentation ready in a few days or so, but I was really excited to share a sneak peek!

iron basalt May 6, 2026, 6:07 AM

#

primal hemlock Where would you suggest I start with the beginner stuff?

https://www.youtube.com/watch?v=C1lhuz6pZC0&list=PLUl4u3cNGP619EG1wp0kT-7rDE_Az5TNd&index=2

YouTube

MIT OpenCourseWare

1. Introduction, Optimization Problems (MIT 6.0002 Intro to Computa...

MIT 6.0002 Introduction to Computational Thinking and Data Science, Fall 2016
View the complete course: http://ocw.mit.edu/6-0002F16
Instructor: John Guttag

Prof. Guttag provides an overview of the course and discusses how we use computational models to understand the world in which we live, in particular he discusses the knapsack problem and g...

▶ Play video

#

https://www.youtube.com/watch?v=Gv9_4yMHFhI

YouTube

StatQuest with Josh Starmer

A Gentle Introduction to Machine Learning

Machine Learning is one of those things that is chock full of hype and confusion terminology. In this StatQuest, we cut through all of that to get at the most basic ideas that make a foundation for the whole thing. These ideas are simple and easy to understand. After watching this StatQuest, you'll be ready to learn all kinds of new and exciting...

▶ Play video

#

After which I would find some resource on deep learning, and after that, pick some topic, such as computer vision. But it's really important to get these foundations covered from resources like that MIT course I linked. Without them you won't really know what it's all based on, and how to correctly evaluate various methods (and how not to do statistics (many ways to mess it up)).

#

Small note on the MIT course, they use outdated Python libraries, specifically PyLab, use matplotlib.pyplot to plot things instead and other replacements for things they do.

warm dune May 6, 2026, 5:30 PM

#

Guys, in the context of fine tuning the model (LLM), such as specializing in a subject, transforming it into a chatbot and more, have a place where I can explore that?

#

a video, article or anything

serene scaffold May 6, 2026, 6:04 PM

#

warm dune Guys, in the context of fine tuning the model (LLM), such as specializing in a s...

you can look into how it's done conceptually, but fine-tuning also requires a lot of data and VRAM, so it might still not be feasible.
this tutorial looks about right to me: https://huggingface.co/blog/dvgodoy/fine-tuning-llm-hugging-face
it's over a year old, but the consensus in the AI community is that fine-tuning LLMs is a waste of time.

warm dune May 6, 2026, 6:10 PM

#

serene scaffold you can look into how it's done conceptually, but fine-tuning also requires a lo...

I saw a video of a guy saying that

Models like GPT and more, are trained with text corns (pre-training)

And in this context, he doesn't know how to respond like an assistant, he would just complete the sentences. Then Fine Tuning would emerge

The Fine Tunnig in the video, the person explained that we would 'train' the model, but only some weights, and gave an example of LORA, who would then make the model respond like an assistant

I was trying to talk about it, I don't know if I used the wrong terms

agile cobalt May 6, 2026, 6:16 PM

#

warm dune I saw a video of a guy saying that 1. Models like GPT and more, are trained wit...

both things are called fine tuning,

going from a pre-trained model into a instruct tuned chat model
further tuning an already instruct-tuned chat model to follow some specific formatting/guidelines
but are entirely different beasts, the format requiring orders of magnitude more data and compute than the later

all major chatbot models like chatgpt, gemini, deepseek, qwen etc. go through some pre-training and fine-tuning, but there is relatively little to gain from further fine tuning models afterwards unless you have some very specific use case

warm dune May 6, 2026, 6:24 PM

#

agile cobalt both things are called fine tuning, - going from a pre-trained model into a inst...

I was watching Kaparthy's video where he creates NanoGPT, and the predictions were based on the text itself. Then I started thinking: "If I use a dataset like Shakespeare's, it won't respond like a chatbot." So I looked into it and discovered fine-tuning LLMs. That I can take a ready-made model like GPT2 and transform it into whatever I want. Since it's already trained to recognize context and more. I saw a comment on Twitter saying that this was the industry standard.

They train the model to learn context from the entire internet -> They do fine-tuning so it acts like a chatbot.

But you said there's little gain, so how would it work to have a greater gain?

#

like to transform a model to a chatbot

agile cobalt May 6, 2026, 6:52 PM

#

warm dune I was watching Kaparthy's video where he creates NanoGPT, and the predictions we...

the open source models labs publish on huggingface and such already do everything we know of that leads to a greater gain

there is little (if anything) to improve that would lead to better general purpose usage, most fine tunes are either trying to remove censorship, improve the performance for niche scenarios at the cost of general performance, or aim for some ultra specific task

warm dune May 6, 2026, 7:13 PM

#

agile cobalt the open source models labs publish on huggingface and such already do everythin...

i search a little more and discover

fine tunning (stf) and alignment (rlfh)

which one its used for transform a model like LLAMA into a chatbot?

agile cobalt May 6, 2026, 7:35 PM

#

both
see the description of https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct for example, preferably also look into the actual papers and technical reports

Model Architecture: Llama 3.1 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
(in that release series, -Instruct is the chatbot, https://huggingface.co/meta-llama/Llama-3.1-8B is the base model ; some others invert it by adding a -Base suffix to the base model and no suffix to the instruct model)

frigid niche May 7, 2026, 7:32 AM

#

Hello there everyone! I have recently completed a syllable-level autoregressive language model that runs entirely on a TI-84 Plus CE calculator! It generates original English prose and poetry from a seed phrase, doing all inference on-device with no external hardware! The architecture is something I am really proud of. Rather than working at the word or character level, the model tokenizes language into its phonetic syllable components, onset, nucleus, coda, stress, and word boundary, and predicts each one through six separate factored output heads. The hidden layer is 198 neurons split into two 99-neuron chunks to fit the TI-84's matrix constraints, with 21-dimensional embeddings per component and a context window of 10 syllables. There is also a 16-dimensional discourse state and an 8-dimensional word state that carry meaning across the generation, giving it a sense of narrative continuity! The full input dimension ends up at 874. The biggest challenge was getting inference to run at all on 154KB of RAM. I precompute the token-context H1 contributions ahead of time so the calculator only has to add vectors instead of multiplying full matrices at runtime, and the output weights are repacked in column-major order for a further speedup. Even with all of that, a full generation run takes about 2.5 to 3 hours on the calculator. You also have to keep an eye on it and confirm garbage collection prompts periodically, which I find adds a certain charm to the experience!

I hope that others will find joy, intrigue, or inspiration from this project. If anyone checks it out, please let me know what you think!

https://github.com/exploratorystudios/TILM2

GitHub

GitHub - exploratorystudios/TILM2: Syllable-level autoregressive la...

Syllable-level autoregressive language model that runs on a TI-84 Plus CE calculator. Full pipeline: corpus generation, training, PC inference, and export to native TI variable files. Generates poe...

frigid niche May 7, 2026, 7:50 AM

#

The constraint does not limit the work. It becomes the work.

wintry brook May 7, 2026, 7:54 AM

#

Is there any docs to study ds & ml from ?

grand minnow May 7, 2026, 8:40 AM

#

wintry brook Is there any docs to study ds & ml from ?

Lots in the pinned messages.

wintry brook May 7, 2026, 8:56 AM

#

grand minnow Lots in the pinned messages.

Ohh thanks

limpid zenith May 7, 2026, 11:04 AM

#

frigid niche Hello there everyone! I have recently completed a syllable-level autoregressive ...

you could share in #1468524576479641744

half pulsar May 7, 2026, 11:44 AM

#

frigid niche Hello there everyone! I have recently completed a syllable-level autoregressive ...

Nice project man

devout osprey May 7, 2026, 1:27 PM

#

any resource for pandas aside docs?

#

more of a practical way to learn ?

grand minnow May 7, 2026, 1:46 PM

#

devout osprey more of a practical way to learn ?

Here's a short one https://www.kaggle.com/learn/pandas

Learn Pandas Tutorials

Solve short hands-on challenges to perfect your data manipulation skills.

devout osprey May 7, 2026, 1:47 PM

#

grand minnow Here's a short one https://www.kaggle.com/learn/pandas

Thanks

oak veldt May 7, 2026, 9:09 PM

#

devout osprey any resource for pandas aside docs?

"Python for Data Analysis" by Wes Mckinney. Really good book for Pandas.

gloomy fractal May 8, 2026, 11:04 AM

#

#

i didnt get how the tutor did the last step

#

@\

#

serene scaffold May 8, 2026, 1:03 PM

#

I don't see where x_2 is ever defined? am I blind, @gloomy fractal?

gloomy fractal May 8, 2026, 1:04 PM

#

you could see the 2nd image

gloomy fractal May 8, 2026, 1:05 PM

#

serene scaffold I don't see where x_2 is ever defined? am I blind, <@1194198498451656745>?

but hey, i learned the other way solution

#

of last step

warm dune May 8, 2026, 3:32 PM

#

guys, is there any lib for RAG, or something like that? what is the standard of the industry?

trim dock May 8, 2026, 4:51 PM

#

https://discord.com/channels/267624335836053506/1502347032940118016

trim dock May 8, 2026, 4:52 PM

#

trim dock https://discord.com/channels/267624335836053506/1502347032940118016

Any help?

unreal condor May 8, 2026, 5:49 PM

#

trim dock Any help?

it's locked, also maybe you dataset is imbalance I think

#

you probably have more animal_migration instances in your dataset

trim dock May 8, 2026, 5:56 PM

#

unreal condor you probably have more `animal_migration` instances in your dataset

All are 5000 images

#

I checked mutiple times

unreal condor May 8, 2026, 5:57 PM

#

trim dock All are 5000 images

have you tried other metrics when training ?

#

apart from acc, like Precision, Recall, F1?

trim dock May 8, 2026, 5:59 PM

#

unreal condor have you tried other metrics when training ?

Uhm... no i just know the very basics of this and am just a hobbiysts, but i would try the ones that you have mentioned also i will re-re-check the image distribution you mentioned

Will see these when i receive the free credit things on colab

And i have re-opened the thread
Replies could be late as i am studying!

unreal condor May 8, 2026, 6:02 PM

#

Accuracy is probably the least trustworthy metric tbh

trim dock May 8, 2026, 6:02 PM

#

unreal condor Accuracy is probably the least trustworthy metric tbh

Okay that kinda made sense to me so i went with it

#

Can i ping you here or in threads when i work on it again on colab?

unreal condor May 8, 2026, 6:03 PM

#

whatever works tbh

#

also disclaimer, my ML knowledge is kinda rusty since I have given up ML for a while

#

also, did you test your model with custom inputs?

trim dock May 8, 2026, 6:05 PM

#

unreal condor also disclaimer, my ML knowledge is kinda rusty since I have given up ML for a w...

Would still be infinitely better than mine xD

trim dock May 8, 2026, 6:05 PM

#

unreal condor also, did you test your model with custom inputs?

Yeah, I actually did that its mentioned in the description

#

It was working whenever i gave it a photo from test set but its fails miserably when i doodle myself it 99% misclasifies it as animal_migration no matter what i doodle

unreal condor May 8, 2026, 6:07 PM

#

trim dock Yeah, I actually did that its mentioned in the description

from what I saw, the animal_migration class doesn't have any clear patterns

trim dock May 8, 2026, 6:07 PM

#

Was saying this animal migration 😂

trim dock May 8, 2026, 6:07 PM

#

unreal condor from what I saw, the `animal_migration` class doesn't have any clear patterns

Yeah, its just random vertical line birds you draw in a group

unreal condor May 8, 2026, 6:07 PM

#

unreal condor from what I saw, the `animal_migration` class doesn't have any clear patterns

so random stuffs could be it

unreal condor May 8, 2026, 6:08 PM

#

trim dock Was saying this animal migration 😂

have you re-processed this?

trim dock May 8, 2026, 6:08 PM

#

unreal condor so random stuffs could be it

That clock and many others i doodled were actually pretty good

trim dock May 8, 2026, 6:08 PM

#

unreal condor have you re-processed this?

Yeah i pass it the processed image this is the original

unreal condor May 8, 2026, 6:08 PM

#

trim dock Yeah i pass it the processed image this is the original

can I see the result?

trim dock May 8, 2026, 6:19 PM

#

unreal condor can I see the result?

Yeah, it definitely aint supposed to look like this xD

This was an ant

unreal condor May 8, 2026, 6:21 PM

#

trim dock Yeah, it definitely aint supposed to look like this xD This was an ant

ye now you probably know why it's a animal_migration lol

trim dock May 8, 2026, 6:21 PM

#

I do now understand why it says it is animal_migration

trim dock May 8, 2026, 6:21 PM

#

unreal condor ye now you probably know why it's a `animal_migration` lol

Yeah

trim dock May 8, 2026, 6:22 PM

#

unreal condor ye now you probably know why it's a `animal_migration` lol

How do i fix this?

#

Like i made digit_recognizer once its processed image (28x28) didnt looked like this they still preserved info

unreal condor May 8, 2026, 6:23 PM

#

I think your input is also kinda wrong

#

the object should be white and the background black

unreal condor May 8, 2026, 6:24 PM

#

trim dock Yeah, it definitely aint supposed to look like this xD This was an ant

this isn't the same as your data

#

your data instances are more pixelated too

unreal condor May 8, 2026, 6:25 PM

#

trim dock Like i made digit_recognizer once its processed image (28x28) didnt looked like ...

I have no idea?

trim dock May 8, 2026, 6:25 PM

#

unreal condor your data instances are more pixelated too

Yeah, that i meant too

unreal condor May 8, 2026, 6:26 PM

#

This is one of the toy datasets that has like no application in the real world tbh so your custom inputs will have a hard time to fit in the model :/

trim dock May 8, 2026, 6:27 PM

#

See this is a preproccsed image from my the digit recigniset

trim dock May 8, 2026, 6:27 PM

#

unreal condor This is one of the toy datasets that has like no application in the real world t...

I am trying to make something from this please i dont wanna tell that to anyone

unreal condor May 8, 2026, 6:27 PM

#

could opencv2 do your thing?

trim dock May 8, 2026, 6:27 PM

#

trim dock See this is a preproccsed image from my the digit recigniset

See it retains the info

unreal condor May 8, 2026, 6:28 PM

#

trim dock See this is a preproccsed image from my the digit recigniset

how did you do this?

trim dock May 8, 2026, 6:28 PM

#

unreal condor could opencv2 do your thing?

Probably no it doesnt work natively on android phone

unreal condor May 8, 2026, 6:28 PM

#

trim dock Probably no it doesnt work natively on android phone

wait, you use ur phone to reprocess the images?

trim dock May 8, 2026, 6:29 PM

#

unreal condor how did you do this?

I used the PIL module in the later one right now i am using tf.keras.utils module i think i should use the PIL

trim dock May 8, 2026, 6:30 PM

#

unreal condor wait, you use ur phone to reprocess the images?

Yeah, i program using my phone thats why i have to wait till colab allows me to use their gpu for free xD

unreal condor May 8, 2026, 6:31 PM

#

no? all of this preproccess stuffs don't need GPU?

trim dock May 8, 2026, 6:31 PM

#

unreal condor no? all of this preproccess stuffs don't need GPU?

Yeah it doesnt but training the model sure does

unreal condor May 8, 2026, 6:31 PM

#

use it to preprocess data then

trim dock May 8, 2026, 6:31 PM

#

Tho i trained the digit_recogniser on phone since it was small but not this sht

unreal condor May 8, 2026, 6:32 PM

#

there is also kaggle for free GPU

trim dock May 8, 2026, 6:32 PM

#

unreal condor use it to preprocess data then

I use it to pre-process data when use it doodle using pygame

trim dock May 8, 2026, 6:33 PM

#

unreal condor there is also kaggle for free GPU

Thats nice but i dont wanna go and like double the code base that i gotta handle

unreal condor May 8, 2026, 6:33 PM

#

trim dock Thats nice but i dont wanna go and like double the code base that i gotta handle

use github ?

#

idk, I haven't tried preproccess images like this before

#

but Opencv2 is literally for working with images so you should check it out if can

trim dock May 8, 2026, 6:35 PM

#

unreal condor but Opencv2 is literally for working with images so you should check it out if c...

Hm.... yeah that wont work on android for some reason

#

Thank you for pointing this out tho as i had forgotten about this i will go re-write the preprocess code using PIL since it worked last time

trim dock May 8, 2026, 7:07 PM

#

@unreal condor using PIL to pre-process data worked like a charm its atleast classifying correctly however when it doesnt it throws it in animal_migration category which tbh is frustating but hey one step closer.
Thank you greg!

dense kite May 9, 2026, 6:14 PM

#

I have a Query?

When humans see something, we immediately build mental stories and simulate possible futures. Current AI models generate predictions based on patterns in data, but do not seem to have internal simulation or understanding.

Do you think large neural networks are developing a form of internal world-model or imagination-like process, where they can simulate future outcomes beyond pattern completion? Or is this still fundamentally different from human cognition?

serene scaffold May 9, 2026, 6:40 PM

#

dense kite I have a Query? When humans see something, we immediately build mental stories ...

you're right that current models don't emulate consciousness. Neural networks by themselves can't spontaneously start emulating consciousness, but you could use them in a system that does.
whether this is different from human cognition is a philosophical question that I'm not sure will ever be answered.

abstract wasp May 9, 2026, 6:46 PM

#

Hi do u guys use cursor ide and uv for building ai agents? I’m new to agents, is it better than just using vscode?

serene scaffold May 9, 2026, 6:48 PM

#

abstract wasp Hi do u guys use cursor ide and uv for building ai agents? I’m new to agents, is...

you're looking for #agents-and-llms

iron basalt May 9, 2026, 8:04 PM

#

dense kite I have a Query? When humans see something, we immediately build mental stories ...

World models are a thing in ML, they do already exist, but it's about how to do that well. It's part of the whole. Humans have many different subsystems for specific things, a general system wrapped around all of that (literally), and a meta system (which may or may not be called "consciousness" depending on who you ask / how you define that). What we have with a lot of things right now in AI/ML is basically taking one part of one of those systems, making a very crude approximation or just loosely inspired by it and scaling that up really big. But another big thing is just the high level design goals of these things. Humans for example will do things without being prompted, they will say "IDK" instead of always giving an answer with confidence, they are aligned (to varying degrees) with other humans in terms of goals and "taste," they don't need to be trained all ahead of time on a massive dataset (they learn "online"), they have a meta algorithm applied to the whole population (evolution), etc. A lot of things in AI/ML just don't even have these design goals, they are meant to do some specific job or set of jobs. Very different from a thing that just exists/survives and does stuff on its own (lots of interacting parts / goals). Human cognition involves this dance of all these systems interacting (this is not including the rest of the body which is also part of it all (and also social, etc)).

iron basalt May 9, 2026, 8:51 PM

#

dense kite I have a Query? When humans see something, we immediately build mental stories ...

https://arxiv.org/pdf/1803.10122 Might interest you.

warm dune May 9, 2026, 10:42 PM

#

someone knows a good article for model monitoring?

limpid zenith May 10, 2026, 7:22 AM

#

dense kite I have a Query? When humans see something, we immediately build mental stories ...

there is a huge body of evidence that suggests that there is no world model in LLMs, they're the Myhill-Nerode theorem and similar results show case this

https://arxiv.org/abs/2406.03689v1

arXiv.org

Evaluating the World Model Implicit in a Generative Model

Recent work suggests that large language models may implicitly learn world models. How should we assess this possibility? We formalize this question for the case where the underlying reality is governed by a deterministic finite automaton. This includes problems as diverse as simple logical reasoning, geographic navigation, game-playing, and che...

devout osprey May 10, 2026, 7:31 AM

#

hey i have done with python , numpy and pandas , might look into matplotlib and seaborn later ,
can anyone tell me good resources to go learning ml/dl , and mathmatics required for ml/dl.
??

grand minnow May 10, 2026, 7:34 AM

#

devout osprey hey i have done with python , numpy and pandas , might look into matplotlib and ...

There are short ones on http://kaggle.com/learn

Learn Python, Data Viz, Pandas & More | Tutorials | Kaggle

Practical data skills you can apply immediately: that's what you'll learn in these no-cost courses. They're the fastest (and most fun) way to become a data scientist or improve your current skills.

devout osprey May 10, 2026, 7:36 AM

#

grand minnow There are short ones on http://kaggle.com/learn

thanks, anything other than this , what about maths?

grand minnow May 10, 2026, 7:36 AM

#

devout osprey thanks, anything other than this , what about maths?

Check the pinned messages

devout osprey May 10, 2026, 7:59 AM

#

grand minnow Check the pinned messages

okay , thanks

gloomy fractal May 10, 2026, 3:14 PM

#

anyone active?

serene scaffold May 10, 2026, 3:34 PM

#

gloomy fractal anyone active?

Just ask your question and people can look at it when they visit the channel.

gloomy fractal May 10, 2026, 3:37 PM

#

hi @serene scaffold

#

can i create all linalg concepts from scratch

#

is it a good idea?

gloomy fractal May 10, 2026, 3:38 PM

#

gloomy fractal can i create all linalg concepts from scratch

by writing in python

#

as codes

serene scaffold May 10, 2026, 3:39 PM

#

don't ping people to say hi before you say the thing that you actually want them to read and respond to. that's like calling someone on the phone and then immediately putting them on hold, and is rude

#

you can implement linalg algorithms in python, yes. you'll get worse performance than if you had used numpy.

gloomy fractal May 10, 2026, 3:39 PM

#

oh....mb

gloomy fractal May 10, 2026, 3:40 PM

#

serene scaffold you can implement linalg algorithms in python, yes. you'll get worse performance...

you'll get worse performance than if you had used numpy.
wizard — 21:09

#

?

serene scaffold May 10, 2026, 3:40 PM

#

what about that statement do you find confusing?

gloomy fractal May 10, 2026, 3:41 PM

#

numpy is better for linalg? and doing the other way is worse?

serene scaffold May 10, 2026, 3:41 PM

#

numpy is implemented in C and can do atomic operations in parallel using CPU magic, so it scales much better than pure python.

gloomy fractal May 10, 2026, 3:41 PM

#

i want to build to learn..revision is boring, maybe building helps

serene scaffold May 10, 2026, 3:42 PM

#

but writing something from scratch is a great way to learn, so go ahead and do it in pure python if you think that will help.

gloomy fractal May 10, 2026, 3:42 PM

#

okay

foggy jay May 10, 2026, 5:01 PM

#

Hi everyone

#

I want to build ML projects any suggestions?

serene scaffold May 10, 2026, 5:20 PM

#

foggy jay I want to build ML projects any suggestions?

explore some tabular/CSV data and traing a basic classifier on it. you can use pandas to read and manipulate the data, and train a model from sklearn.

#

I don't expect you to know what all of that means, but you'll be able to figure it out.

grand tulip May 10, 2026, 9:06 PM

#

I have a research project involving the use of camera object detection and Id like to gain a solid understanding of OpenCV before (tool in research might not be OpenCV but at the end of the day they’re all similar) .What are the best ressources ?

left tartan May 10, 2026, 10:29 PM

#

grand tulip I have a research project involving the use of camera object detection and Id li...

Opencv has a good tutorial, I'd start there

#

https://docs.opencv.org/4.x/df/d65/tutorial_table_of_content_introduction.html

mellow spruce May 11, 2026, 3:25 AM

#

serene scaffold May 11, 2026, 3:42 AM

#

mellow spruce

care to elaborate?

half pulsar May 11, 2026, 6:20 AM

#

mellow spruce

I checked your Github and I’m not seeing AGI here. I’m seeing heavily branded LLM/tooling projects with AI-generated imagery and inflated claims.

If there’s real substance, explain it plainly. Otherwise call it an agent framework, not AGI.

gloomy fractal May 11, 2026, 3:50 PM

#

how often OOP is used in ML

#

and in data science

serene scaffold May 11, 2026, 3:54 PM

#

gloomy fractal how often OOP is used in ML

depends on what you think OOP is
you need to understand how classes work in Python to be able to do ML in Python

gloomy fractal May 11, 2026, 3:56 PM

#

serene scaffold depends on what you think OOP is you need to understand how classes work in Pyth...

do i need to go deep?
i mean my educator is teaching the OOP in depth

serene scaffold May 11, 2026, 3:56 PM

#

what are they teaching you about OOP that you feel is depthful?

gloomy fractal May 11, 2026, 3:58 PM

#

most of the dunder methods..,callables, using specific libs, descriptors, enumeration and many more....i covered only classes part in one month

gloomy fractal May 11, 2026, 3:58 PM

#

serene scaffold what are they teaching you about OOP that you feel is depthful?

^

serene scaffold May 11, 2026, 3:59 PM

#

a lot of ML libraries use dunder methods, so it's good to understand them

gloomy fractal May 11, 2026, 4:00 PM

#

so is it good?

serene scaffold May 11, 2026, 4:01 PM

#

yeah

warm dune May 11, 2026, 4:02 PM

#

gloomy fractal how often OOP is used in ML

https://github.com/karpathy/nanoGPT/blob/master/model.py the repo of NanoGPT by kaparthy, maybe help you to see how works

GitHub

nanoGPT/model.py at master · karpathy/nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs. - karpathy/nanoGPT

#

I don't think it gets much more than that about OOP

pseudo sundial May 12, 2026, 11:00 AM

#

Hi,I have 5 years in game industry as an animator is it possible to switch careers to the data field (data engineer or data analyst) at age 27?

half pulsar May 12, 2026, 11:30 AM

#

pseudo sundial Hi,I have 5 years in game industry as an animator is it possible to switch caree...

It is never too late! You can switch anytime you feel like it, just as long as you're happy pursue it!

pseudo sundial May 12, 2026, 11:30 AM

#

half pulsar It is never too late! You can switch anytime you feel like it, just as long as y...

Ayyy thankyouu for your answer 👋👋😄

#

Yeahh I really want to change career into the data fields. So far I join online course @half pulsar hopefully works well 😄😄

half pulsar May 12, 2026, 4:01 PM

#

pseudo sundial Yeahh I really want to change career into the data fields. So far I join online ...

I hope it works out well for you too, Good luck!

ashen echo May 12, 2026, 4:38 PM

#

looking to use PandasAI to do some data analysis, any suggestion for which underlying LLM i should use outside of openAI. I am mostly doing some transforming and analysis of excel files?

true pollen May 12, 2026, 11:52 PM

#

pseudo sundial Hi,I have 5 years in game industry as an animator is it possible to switch caree...

it seems like we're in a same boat, except I'm looking to transition from QA automation engineering

fading wigeon May 13, 2026, 3:33 AM

#

ashen echo looking to use PandasAI to do some data analysis, any suggestion for which under...

Why are you using an LLM for data analysis?

royal raven May 13, 2026, 9:31 AM

#

anyone here tried doing a sentiment analysis for book reviews?

ashen echo May 13, 2026, 11:20 AM

#

fading wigeon Why are you using an LLM for data analysis?

Pandas, is the defacto, but I feel like automating some of my weekly analysis on certain excels I pull down from out crm system, could have certain conclusions automatically, and get a second angle to look at my data sometimes. I think the Pandas AI could help with that. Unless you have another suggestion?

mossy blaze May 13, 2026, 12:32 PM

#

I've made progress on my neuro-symbolic hybrid AI project. My latest work is available here: https://github.com/Julien-Livet/aicpp/tree/dsl_engine

GitHub

GitHub - Julien-Livet/aicpp at dsl_engine

Artificial intelligence with a network of connected neurons - Julien-Livet/aicpp

half pulsar May 13, 2026, 1:11 PM

#

mossy blaze I've made progress on my neuro-symbolic hybrid AI project. My latest work is ava...

I really like how clean and grounded this work is. I especially respect that it reports concrete benchmark results and openly states current limitations instead of overselling the system. The eval results are still limited, but that honesty makes the project feel more credible. The architecture is understandable, testable, and built around a clean separation between LLM-guided proposal and deterministic symbolic verification. Very nice work overall consider me impressed!

fading wigeon May 13, 2026, 1:57 PM

#

ashen echo Pandas, is the defacto, but I feel like automating some of my weekly analysis on...

I guess if you're looking for surface insights maybe it's fine, although imo the hallucination rate is too high for comfort and maybe the pandas AI has reached a point where it will run those analyses for you as opposed to just returning an answer.

brisk lantern May 13, 2026, 2:16 PM

#

what would be a good free platform for building a chatbot for a uni assignment?
we are planning to build an expert system that basically functions as a sorta knowledge base that allows users to ask basic questions and learn more about a specific topic. Overall it is not going to be a very complex system.

warm dune May 13, 2026, 2:36 PM

#

royal raven anyone here tried doing a sentiment analysis for book reviews?

for this specifically no, but I could help maybe

warm dune May 13, 2026, 3:45 PM

#

does anyone know of a good, up-to-date article about model monitoring?

jaunty helm May 14, 2026, 7:29 AM

#

brisk lantern what would be a good free platform for building a chatbot for a uni assignment? ...

"expert system" as in this which has a very specific definition?
or just rag

brisk lantern May 14, 2026, 7:38 AM

#

yep it matches that definition of an expert system

jaunty helm May 14, 2026, 7:44 AM

#

brisk lantern yep it matches that definition of an expert system

streamlit can probably work to quickly build the ui

brisk lantern May 14, 2026, 7:45 AM

#

i see, ill check that out

#

thank you

jaunty helm May 14, 2026, 7:45 AM

#

you can deploy it to a website for free p easily

sharp apex May 14, 2026, 3:36 PM

#

im fresh out of high school and i want to get into data science
is there a roadmap for this field?

#

ive seen people saying python-> SQL -> apache airflow for data science
i know some other PLs so learning python shouldnt be hard, ive learned a thing or two about SQL as well but idk anything abt apache airflow

#

sorry if this is the wrong channel to ask questions

heavy crow May 15, 2026, 6:47 AM

#

I have a question on object detection transformer architectures. Standard softmax attention artificially dilutes attention across multiple objects and forces unnatural focus onto empty backgrounds. E.g if the image doesnt contain any objects it still has to attend somewhere! And if there are many objects or one object is made up of two patches that are far away from each other, it has to split its attention across them. Wouldn't an independent, per-token sigmoid activation fix this by allowing the model to flexibly attend to multiple targets simultaneously or completely ignore the background?

heavy crow May 15, 2026, 7:09 AM

#

Here a plot to visualize, with what i belive happens with softmax on the left and what i would think would happen with sigmoid on the right

AEir0wIZGImgMsfwyeetYIhFyALodJqYRUsNiwI5P9TxwdjJtcBcN-iWnF61Tj7PiA5l-frWSL0Iko7XdMS00Qd7_ox7Ef0TdnD991pDWNECL3PiidgkR_55p-iv_3fn7HQVPid4TxfT7yNRv1NwF9W21SI2QJayZb75C-75sU0wFjcNqB1SMqUpJALyuKT0ILxMWBJTa2z9jB6c-xdwocx2blDOsvo5gFFtYi-oLEIxysbHgEr71vahlNeykt6yjF7MeWAEHb9wsx1fDVxF-7kG_kvFdaAp0aHvw0Y0yvf6hgzWYToFEwQnCZHKNc6yPzr0J3kpGmz9ZfmvyFqsIIrJod8s1600.png

#

it might still attend a bit to the first token because its a bit different than the other background tokens but less than the softmax.

unreal condor May 15, 2026, 11:56 AM

#

heavy crow I have a question on object detection transformer architectures. Standard softma...

Wouldn't an independent, per-token sigmoid activation fix this by allowing the model to flexibly attend to multiple targets simultaneously or completely ignore the background?
wdym by this?

pulsar crow May 15, 2026, 1:28 PM

#

Whate are some Examples of Quantitative Data Analysis Methods?

#

mean, median, mode, standard deviation...

orchid lance May 15, 2026, 4:36 PM

#

I'm looking for guidance to break into tier 1, buy-side quantitative hedge funds. I'm already a quant, but at a lower level (in risk & control side). My resume is probably good enough to get interviews with although I lack the pedigree. It would be nice if anyone can help me understand this industry because I don't really have connections in the space.

#

I'm currently studying the "Green Book" (A Practical Guide to Quantitative Finance Interviews) and doing NeetCode top 300 problems. I don't know if this is enough. I was thinking about also setting up an algorithmic trading bot and building out several machine learning projects to bolster my resume.

heavy crow May 15, 2026, 4:47 PM

#

unreal condor > Wouldn't an independent, per-token sigmoid activation fix this by allowing the...

In normal attention, we use softmax. This normalizes in such a way that the sum is equal to 1. But that means that it always has to equal one. it has to spend attention somewhere. it cant just ignore everything if it belives nothing is there

unreal condor May 15, 2026, 4:50 PM

#

heavy crow In normal attention, we use softmax. This normalizes in such a way that the sum ...

Are you talking about the output of the attention block or the final output of the final layer

heavy crow May 15, 2026, 4:52 PM

#

just from the scaled dot-product attention. so softmax(Q*K/sqrt(d_k))*V

orchid lance May 15, 2026, 4:52 PM

#

pulsar crow Whate are some Examples of Quantitative Data Analysis Methods?

In risk & control side, it doesn't go much farther than that plus a few more concepts. There's also outlier shooting algos, decision trees, RF, ATT/BTT analysis, confusion matrices for testing/validation, LASSO/Ridge regression, rule-based modeling. Time series for these models is almost always on monthly/quarterly/yearly basis, or rolling windows of 3/6/12 months; often, these windows are compared to the same window of the preceding year. All of that is for alert generation for a given model and there tends to be another layer that manages alerts across all models and can alter weights of the feeder models. If you say all of that, you'll 100% break into this field easily haha. That's the cheat sheet.

#

Being a quant in risk and control is like data scientist lite tbh. What I've just mentioned is pretty bottom of the barrel in terms of what other data scientists can do.

#

I was just really hoping someone here knew the process of becoming a quant trader/researcher/strategist at a tier 1 firm. I'm not sure how to differentiate myself and be taken seriously by the interviewers. I'm not even really sure about the interview topics.

unreal condor May 15, 2026, 5:02 PM

#

heavy crow In normal attention, we use softmax. This normalizes in such a way that the sum ...

I can't see how "it's always has to equal one" since the chance of equal one is astronomically low after the input has been passed through so many layers. And "it has to spend its attention somewhere" doesn't sound right because the attention block isn't the final block. And also the phrase "the model can pay attention using the attention mechanism" is kinda overly romanticized. Truth is deep within the layers of a neural net, things work like a blackbox so you shouldn't think of "attention" too literally

heavy crow May 15, 2026, 5:15 PM

#

I can't see how "it's always has to equal one" since the chance of equal one is astronomically low after the input has been passed through so many layers.
Why? softmax ensures this.

unreal condor May 15, 2026, 5:36 PM

#

heavy crow just from the scaled dot-product attention. so softmax(Q\*K/sqrt(d_k))\*V

You mean the sum of all output equal to 1? Also, the output is between 0 and 1 not 1, hence, the word "soft". If the output is strictly rounded to either 0 or 1 it would be called "hardmax"

heavy crow May 15, 2026, 5:42 PM

#

Yes, i mean it sums to 1. That means it cant output zero across the board.

unreal condor May 15, 2026, 5:45 PM

#

why do you want 0?

#

Like I said, it would be hardmax and iirc Andrew Ng explained why softmax is preferred

heavy crow May 15, 2026, 5:48 PM

#

im working with object detection and am noticing that for scenes with no objects the model has a hard time predicting a low background confidence

#

thats why i thought it might be because of the softmax

unreal condor May 15, 2026, 5:50 PM

#

is background a class that need to be classified in your dataset?

heavy crow May 15, 2026, 5:50 PM

#

this is pointcloud data so the model predicts a confidence and a position vote, dataset is about 1:3 balanced for bg vs fg

unreal condor May 15, 2026, 5:52 PM

#

so it's either bg or fg?

#

no other classes?

heavy crow May 15, 2026, 5:55 PM

#

right. So single class segmentation with regression

unreal condor May 15, 2026, 5:57 PM

#

oh, so segmentation

#

I thought you meant object detection like drawing bounding boxes around the objects

heavy crow May 15, 2026, 5:58 PM

#

well it is detection, each token casts a vote for the centroid of the object and its bb

#

but it does this for all points (or a subset) in the scene, which makes the confidence score more of a segmentation task

unreal condor May 15, 2026, 5:59 PM

#

object segmentation and object detection are two different problems tho

heavy crow May 15, 2026, 6:00 PM

#

Here the model does both.

unreal condor May 15, 2026, 6:02 PM

#

Is it a new problem or sth? Combining both detection and segmentation? I quit ML like a long time ago so I don't update myself anymore

heavy crow May 15, 2026, 6:04 PM

#

its similar to this: https://arxiv.org/pdf/1904.09664

unreal condor May 15, 2026, 6:04 PM

#

heavy crow this is pointcloud data so the model predicts a confidence and a position vote, ...

but anyway, I just had a quick lookup. Seems like your dataset is 3d and I had no experience working with this before. But I do think the principle still stands. The best is you just experiment with your method and the result will speak for itself

obtuse acorn May 15, 2026, 7:03 PM

#

anybody able to help me do something more effiently?

#

basically ive got a pandas dataframe that i got from reading json, and its got a column with a list in it

#

and im wanting to compare the lists of each row and store the overlapping data

#

currently im doing this but it doesnt seem very efficent

#

im 99.99% sure theres a better method

#

but my brain isnt coming up with it

serene scaffold May 15, 2026, 7:10 PM

#

@obtuse acorn remember to always share code as text. Not as a screenshot.

I think your code would be faster if you skipped pandas entirely and used sets.

obtuse acorn May 15, 2026, 7:11 PM

#

newData = []
for card in ids:
  for card2 in ids:
    card3 = pd.Series(card)
    card4 = pd.Series(card2)
    
   
    compared = card3[card4.isin(card3)]
    if (compared.count() > 0):
      newData.append(compared)

serene scaffold May 15, 2026, 7:11 PM

#

What type is ids?

obtuse acorn May 15, 2026, 7:14 PM

#

they are strings

#

i think

#

a list of strings

serene scaffold May 15, 2026, 7:18 PM

#

@obtuse acorn I'm busy (at pycon no less) but look into sets and set intersection in python. It's designed to solve this exact problem

viscid wigeon May 15, 2026, 7:50 PM

#

Hey guys, I am a beginner in ML and data science, I want to know what are the concepts that I have to know. For instance, I am a jr web developer and I want to implement a model that predicts disasters, in an weather app

obtuse acorn May 15, 2026, 8:01 PM

#

serene scaffold <@307928742167314433> I'm busy (at pycon no less) but look into sets and set int...

this is still gonna take a while isnt it, theres a lot of sets to compare

serene scaffold May 15, 2026, 8:05 PM

#

obtuse acorn this is still gonna take a while isnt it, theres a lot of sets to compare

It will probably be faster than what you're doing.
You could probably use multiprocessing

obtuse acorn May 15, 2026, 8:23 PM

#

i figured out why it was going so slow

#

i had exported it wrong and it had turned each character of the strings in the list into a set

primal hemlock May 16, 2026, 2:19 AM

#

I just realized how much money this turing pi thing really costs.

#

Damn near a thousand. Is there anything else I could use to learn ML?

#

turingpi.com for reference

iron basalt May 16, 2026, 2:20 AM

#

primal hemlock Damn near a thousand. Is there anything else I could use to learn ML?

A laptop is sufficient. ML has existed since the 50s (with that name, it existed prior to this term being coined).

primal hemlock May 16, 2026, 2:21 AM

#

Alright then

mellow vector May 16, 2026, 3:20 AM

#

primal hemlock I just realized how much money this turing pi thing really costs.

google colab has free gpus you can use, they're throttled by demand but if you plan to go over feed forward or convolution networks (LLM predecessors) it's great and free

primal hemlock May 16, 2026, 4:48 AM

#

mellow vector google colab has free gpus you can use, they're throttled by demand but if you p...

Oh neat, thanks.

crude hedge May 16, 2026, 6:16 AM

#

mellow vector google colab has free gpus you can use, they're throttled by demand but if you p...

i heard kaggle too

versed pilot May 16, 2026, 10:19 AM

#

yes, both kaggle and colab offer GPU and TPU acceleration options

#

But talking of learning ML, you don't have to go straight for GPU, learn the basics first, do some linear regression, look at SK learn etc.

gritty void May 16, 2026, 12:07 PM

#

There are cheaper GPU rent options like lightning.ai or vast.ai, but moving forward with free tier of Kaggle and Collab Pro+ should be first steps.

#

In the long term, buying a GPU with at least 32GB VRAM might me cheapest option tho.

obtuse acorn May 16, 2026, 1:21 PM

#

just marry someone with a powerful gpu

#

smh

gritty void May 16, 2026, 1:23 PM

#

obtuse acorn just marry someone with a powerful gpu

a 5090 will cost less than a wedding, hehe.

obtuse acorn May 16, 2026, 1:24 PM

#

prices of weddings vary widely

fallow coyote May 16, 2026, 4:38 PM

#

Apologies for being off topic (i.e. not talking about python), but how would you lot use Go for DS and ML?

serene scaffold May 16, 2026, 4:41 PM

#

fallow coyote Apologies for being off topic (i.e. not talking about python), but how would yo...

Why do you ask?

fallow coyote May 16, 2026, 4:43 PM

#

Im thinking about learning another programming language along with python so I want to see how. Just want to increase my skillset and see how I can use Go for data science and ML purposes.

serene scaffold May 16, 2026, 4:52 PM

#

fallow coyote Im thinking about learning another programming language along with python so I w...

If you want to learn another language, I wouldn't try to force it to be about data science and ML. Learning another language might help you make more kinds of things in general.

fallow coyote May 16, 2026, 5:27 PM

#

I might do that then. Tbf I was thinking about using Go for more network based projects. Could be useful if I need to quickly setup a network application

versed pilot May 16, 2026, 8:11 PM

#

It's not a language that is often mentioned for DS. Julia, R etc. yes.

serene scaffold May 16, 2026, 8:46 PM

#

versed pilot It's not a language that is often mentioned for DS. Julia, R etc. yes.

What would learning Julia and R afford someone who already knows "fluent python"?

tiny mauve May 17, 2026, 12:56 AM

#

hey I was just wondering if anyone here is a data scientist, if possible I can dm someone for advice on a roadmap, I’ve done my research online but I don’t know anyone with actual expertise in my life n wanted some personal help, if possible I’d appreciate havin a more in depth conversation in dms, I’m 23, restarting my life as a returning student at community college n plan in to transfer to uci after, any words would be greatly appreciated

serene scaffold May 17, 2026, 12:58 AM

#

tiny mauve hey I was just wondering if anyone here is a data scientist, if possible I can d...

I will answer questions as I'm able, but only in the server.
What does UCI stand for? There's too many universities for people to know all the acronyms.

tiny mauve May 17, 2026, 12:58 AM

#

university of Cali, irvine

serene scaffold May 17, 2026, 12:59 AM

#

Did you get a bachelor's in something else previously?

tiny mauve May 17, 2026, 12:59 AM

#

I took a big gap(3 years) and before I wasn’t really focused on school, I was pursuing a side hustle which ended up falling off

#

I’m coming back with a 2.23 gpa, n am trying to figure out a strategy to bring it up to an admissible grade for uc transfer (3.5)

serene scaffold May 17, 2026, 1:03 AM

#

So a few things you should know:

Tech hiring is way down. It might improve by the time you finish a degree. You should look at how much debt you're looking at and what your risk tolerance is.

"Data scientist" has never had a widely agreed upon or consistently applied meaning. You should look at current job listings for various titles and see what skills are being asked for.

tiny mauve May 17, 2026, 1:06 AM

#

I haven’t done too much extensive research yet on job listings for the field, I just figured it would work if I was passionate in business and analytics of the sort, coming back after the gap I figured I sort of had passions for understanding data n stuff along the lines of that

serene scaffold May 17, 2026, 1:07 AM

#

Then I would include "analyst" in the list of job titles that you look for listings for

versed pilot May 17, 2026, 9:05 AM

#

serene scaffold What would learning Julia and R afford someone who already knows "fluent python"...

I've barely used R and I have never used Julia so I can't tell you. I was simply stating that those languages get used for DS, whereas I was not aware of Go being used for DS

cursive cosmos May 17, 2026, 10:09 AM

#

versed pilot I've barely used R and I have never used Julia so I can't tell you. I was simply...

Go is commonly used in enterprise backend for distributed systems where models are deployed

#

speaking from experience of working at large ecom w/ 50m+ MAU

drowsy pollen May 17, 2026, 10:16 AM

#

gritty void There are cheaper GPU rent options like lightning.ai or vast.ai, but moving forw...

whoever ts thanks

#

im really new w ML

obtuse acorn May 17, 2026, 1:59 PM

#

im trying to think what would be the best way to store overlaps between data

#

like the easy way is to just store copies of the overlapping parts

#

but you could instead do something like storing the index of the overlapping parts and just read the data from the array when you need it

vale badge May 18, 2026, 10:13 AM

#

Hey peeps, I've got this graph, (Hue mean average over time) and it's showing some very strange oscillations. If I do a Fourier transform on the data set will that smooth out the whole graph? Also, if I want to find the frequency of the oscillation, and what might be causing it, how would I go about it?

Thanks in advance,

cursive cosmos May 18, 2026, 10:44 AM

#

vale badge Hey peeps, I've got this graph, (Hue mean average over time) and it's showing so...

its very hard to say for certain without knowing origin of the data, but oscillations are natural e.g. in physical systems.

yeah, fourier can help you out cut frequencies under some threshold and "dampen" the signal, you'll have to inspect the data to make sure that it didn't get wrong frequencies either though.

you can also do exponential moving average (EMA), which could be more versatile, since you can more easily iterate over weights. This feels much safer than frequency threshold.

Btw I have implementation of EMA for Adam optimizer in this notebook (there's also a link to the blog post that does an overview of EMA and where exactly it is in Adam): https://github.com/sutskelis/sutskelis_explains_stuff/blob/main/optimizers.ipynb

GitHub

sutskelis_explains_stuff/optimizers.ipynb at main · sutskelis/suts...

Dragon gives interview-friendly prespective on Machine Learning - sutskelis/sutskelis_explains_stuff

wooden sail May 18, 2026, 10:44 AM

#

vale badge Hey peeps, I've got this graph, (Hue mean average over time) and it's showing so...

what are you trying to do? inspecting images/video over time?

#

the fourier transform doesn't do any smoothing, it only gives you an alternative representation of the data. it should be able to tell you something about the nature of the oscillations

versed pilot May 18, 2026, 10:45 AM

#

vale badge Hey peeps, I've got this graph, (Hue mean average over time) and it's showing so...

If the data is in a Pandas dataframe then you can do an autocorrelation plot

#

which should pick up the periodicity, if there is any

#

and you can do rolling mean or rolling median etc. for smoothing

#

This is an autocorrelation plot from a project I'm working on

vale badge May 18, 2026, 10:49 AM

#

wooden sail the fourier transform doesn't do any smoothing, it only gives you an alternative...

Yeah, taking the average hue of video frames over time. And that's what I want actually, since the plot shows a clear curve, but also seems to show something which I think is a secondary frequency on top of it, and I'd like to know what that secondary effect is (See image).

wooden sail May 18, 2026, 10:50 AM

#

so the suggestion of lowpass filtering will probably work there. without knowing anything else about the topic, stuff like lighting changes introduces very sharp transitions

versed pilot May 18, 2026, 10:50 AM

#

That looks like you occasionally have outliers that skew the distribution?

#

Not really periodic, they start very frequent and become more sparse over time

wooden sail May 18, 2026, 10:51 AM

#

maybe a fourier transform can show you if there is a clear chunk of the spectrum that is nice, and other stuff that is noiselike

versed pilot May 18, 2026, 10:51 AM

#

so I wouldn't do either fourier or autocorrelation, I would go back to the raw data before the average

wooden sail May 18, 2026, 10:51 AM

#

but also maybe not. you can try to lowpass and also plot the magnitude spectrum and see if you learn something

#

what you can do is pick out a few of the frames, going by the timestamp, where these spikes occur

versed pilot May 18, 2026, 10:52 AM

#

maby take a small window around 300s and plot all points, or do box and whiskers for selected times etc.

wooden sail May 18, 2026, 10:52 AM

#

see if there is anything explainable causing the variations and whether they need to be addressed

vale badge May 18, 2026, 10:56 AM

#

wooden sail so the suggestion of lowpass filtering will probably work there. without knowing...

My working theory is that it's lighting related. The video I'm recording and analysing is from a webcam, with the lighting being provided from an LED, and I think the wedcam is picking up the flicker, but I've used this same LED under different conditions and not had this effect at all before.

But to summarise what you're all saying:
Go back to the original data and look for trends.
Maybe Lowpass filtering,

And analyse specific frames with notable peaks

Thanks guys

sudden canyon May 18, 2026, 12:00 PM

#

!rule 6 9 @jagged dew We do not allow looking for developers on this server.

arctic wedgeBOT May 18, 2026, 12:00 PM

#

Rules

6. Do not post unapproved advertising.

9. Do not offer or ask for paid work of any kind.

round crystal May 18, 2026, 8:31 PM

#

Open source models are lowk scary like why do I have to download 7600 zigabytes of parameters

fading sedge May 19, 2026, 4:13 AM

#

round crystal Open source models are lowk scary like why do I have to download 7600 zigabytes ...

U just lucky that u don't have to download dataset, only weights.

ionic zealot May 19, 2026, 5:14 AM

#

Hi everyone, I’m starting from zero and my goal is to learn programming first, then move into AI and machine learning. I prefer a desktop PC. What build would you recommend for this path if I want something reliable, upgradeable, and good for the long term?

unreal condor May 19, 2026, 5:50 AM

#

unless you have some serious budget, a normal setup is more than good enough for daily tasks. Just use cloud computing when you have enough knowledge and want to build some large models

vale badge May 19, 2026, 8:16 AM

#

So I looked in depth at the RGB averages the webcam is picking up, and it turns out the pattern matches some small variations in the blue channel that are then just being amplified when expressed as the Hue.

desert laurel May 19, 2026, 8:29 AM

#

Hello everyone 👋
I’m currently a BCA student and I want to build my career in Data Science / AI-ML.
I’m a beginner right now and I’m a bit confused about the roadmap.Could anyone please guide me:
What should I start learning first?
Which skills are most important for beginners?
How should I plan my daily study routine?
And what is the best way to practice and build projects?

hard nest May 19, 2026, 9:19 AM

#

In Google cloud, I have a project billed by slot time, the version is Standard and I have max 400 slots with auto scale. I want to estimate the cost of automatize some queries depending of the frecueny (every hour, every 4 hours...). How do you do it?

versed pilot May 19, 2026, 8:54 PM

#

Not sure about estimating slots, but if you do a dry run of the queries you get the Gibigbytes they process, I thought those kind of convert to $

#

you are not paying flat rate so many $/month for your 400 slots, right?

#

this in cloudshell or any shell with the SDK installed

bq query
--use_legacy_sql=false
--dry_run
'SELECT
COUNTRY,
AIRPORT,
IATA
FROM
project_id.dataset.airports
LIMIT
1000'

#

Or just paste the sql in the console query editor and it should validate it and show the data that will be processed

glass jetty May 20, 2026, 2:48 PM

#

@prime holly I've deleted your message. If you know it's off-topic, don't post it.

prime holly May 20, 2026, 2:49 PM

#

@glass jetty then where can i post it

#

which topic is it

glass jetty May 20, 2026, 2:50 PM

#

prime holly <@88336074710982656> then where can i post it

!ot we have off-topic channels, but really I think nobody can help you with this

arctic wedgeBOT May 20, 2026, 2:50 PM

#

Off-topic channel

#ot2-never-nester’s-nightmare

Please read our off-topic etiquette before participating in conversations.

prime holly May 20, 2026, 2:52 PM

#

glass jetty !ot we have off-topic channels, but really I think nobody can help you with this

ok

solemn depot May 20, 2026, 7:41 PM

#

Hey everyone. I’ve been practicing strict data cleaning and just finished a project matching exact crypto news publication times to 1-minute market data (Kaggle link: https://www.kaggle.com/datasets/yevheniipylypchuk/bitcoin-news-vs-1m-btc-price-action-2025-26).

The hardest part was standardizing the UTC timestamps and handling the exact T0/T+15m delta calculation. If anyone here has experience building backtesting pipelines or scraping financial news, I’d love a quick roast of the methodology in my notebook. Did I miss any obvious edge cases?

viscid wigeon May 20, 2026, 11:50 PM

#

Does someone know a good course for practical computer vision?

iron basalt May 20, 2026, 11:59 PM

#

viscid wigeon Does someone know a good course for practical computer vision?

https://www.youtube.com/watch?v=8jXIAWg_yHU&list=PLjMXczUzEYcHvw5YYSU92WrY8IwhTuq7p

YouTube

Joseph Redmon

The Ancient Secrets of Computer Vision - 01 - Introduction

The Ancient Secrets of Computer Vision

https://pjreddie.com/courses/computer-vision/

An introductory course on computer vision originally held Spring 2018 at the University of Washington.

▶ Play video

viscid wigeon May 21, 2026, 12:34 AM

#

iron basalt https://www.youtube.com/watch?v=8jXIAWg_yHU&list=PLjMXczUzEYcHvw5YYSU92WrY8IwhTu...

Thanks

jaunty helm May 21, 2026, 7:43 AM

#

solemn depot Hey everyone. I’ve been practicing strict data cleaning and just finished a proj...

my notebook

where

warm dune May 21, 2026, 7:04 PM

#

rn I'm studying ML Engineering, but I wanted to expand my knowledge to MLOPs, does anyone have a good course?

serene scaffold May 21, 2026, 7:25 PM

#

warm dune rn I'm studying ML Engineering, but I wanted to expand my knowledge to MLOPs, do...

I don't know of one, but the MLops skills I use the most are almost all related to docker/containers

warm dune May 21, 2026, 7:27 PM

#

serene scaffold I don't know of one, but the MLops skills I use the most are almost all related ...

like, after studying the structures and more (I still have to study a lot) I reached a barrier which is the computational power, I understand transformers, but it is impossible to make a GPT2 alone with my pc, so using already ready models is uam solution, I wanted something like that, to use already ready models and just change some things, I don't know if this would go into MLOps a lot

serene scaffold May 21, 2026, 7:28 PM

#

warm dune like, after studying the structures and more (I still have to study a lot) I rea...

This message does not seem to have anything to do with MLops.

warm dune May 21, 2026, 7:29 PM

#

serene scaffold This message does not seem to have anything to do with MLops.

Can you explain to me briefly what it is?

serene scaffold May 21, 2026, 7:30 PM

#

warm dune Can you explain to me briefly what it is?

When you actually run a model in an application and take care of everything related to that.

#

If you're still training the model you plan to use in an application, you're probably not doing anything related to MLops

hushed light May 21, 2026, 10:27 PM

#

I'm about to jump in to the waters of Machine Learning!!! I have no idea where to start. 🙂 I'm reading the beginner's guide for "Gymnasium" at the moment. I already have a game I've written in C that I want to use for the training. I.e., I want a ML agent to "learn" how to play this game. Presumably the output of this process is some file(s) with data that I can then use to write some kind of AI bot that can play my game, using this generated ML data? Is that the general flow?

serene scaffold May 22, 2026, 12:31 AM

#

hushed light I'm about to jump in to the waters of Machine Learning!!! I have no idea where t...

It depends on what technique you want to use. You don't want to use anything that even closely resembles what "agent" is currently understood to mean

#

The most popular way to make a beginner chess playing bot is to use a heuristic to calculate how favorable one board arrangement is to a given player. Then you consider different possibilities up to n turns ahead and decide how you can get to a better board in the fewest possible turns

#

But this isn't machine learning.

#

I don't recommend making a chess bot as your first ML project.

#

Hmm, why did I think you mentioned chess?

#

Is this a turn based game that you wrote? What I said is still applicable to turn based games with fully exposed state.

hushed light May 22, 2026, 1:25 AM

#

It’s a text-based (console) adventure game. It has 40 rooms with ability to navigate between them. There are treasures to find and use and monsters to fight and kill with different strategies. The goal is to escape to the “victory” room and maximize your score along the way.

serene scaffold May 22, 2026, 1:37 AM

#

hushed light It’s a text-based (console) adventure game. It has 40 rooms with ability to navi...

Interesting idea. I would first try to do this without using an LLM and instead produce all the player's inputs formulaicly.

#

You want to come up with a way to express game state and the player's decisions in some pure form, so that you can have a sequence of turns to train a model on.

hushed light May 22, 2026, 1:45 AM

#

Yea, I’ve refactored the code into a format that is (presumably) compatible with ML training. I have a reset() function to restore the initial game state.I have perform_action() as the method called by the ML during the training loop, etc. I have a defined GameState struct but I have been learning about Observational Space so I will be populating a struct for that, that is passed from the agent to the game engine. Which the game will update based on the agent’s actions.

#

The possible commands are very simple to start with, represented by single characters. I have commands to go in a direction NSEWUD, to Pick up an item, Fight monster or Retreat, etc.

hushed light May 22, 2026, 11:06 PM

#

So what exactly is Gymnasium? Is it just a tool for RL or is it for general purpose ML? What are its outputs? And once you have the outputs, what do you do with them?

iron basalt May 22, 2026, 11:25 PM

#

hushed light So what exactly is Gymnasium? Is it just a tool for RL or is it for general purp...

It's for RL, to have a set of common tasks to compare methods on.

#

Originally OpenAI, but it was abandonware and a mess. Lots of old papers use it and so to have those still be reproducible it was taken over by the Farama Foundation (forked). It has been heavily improved since then, effectively a full rewrite.

hushed light May 22, 2026, 11:31 PM

#

And what about the outputs and how to use them?

iron basalt May 22, 2026, 11:35 PM

#

hushed light And what about the outputs and how to use them?

import gymnasium as gym

# Initialise the environment
env = gym.make("LunarLander-v3", render_mode="human")

# Reset the environment to generate the first observation
observation, info = env.reset(seed=42)
for _ in range(1000):
    # this is where you would insert your policy
    action = env.action_space.sample()

    # step (transition) through the environment with the action
    # receiving the next observation, reward and if the episode has terminated or truncated
    observation, reward, terminated, truncated, info = env.step(action)

    # If the episode has ended then we can reset to start a new episode
    if terminated or truncated:
        observation, info = env.reset()

env.close()

hushed light May 23, 2026, 12:06 AM

#

As I stated, I am going through these tutorials right now. My question is about the end goal of this process. Does Gymnasium create some kind of "model" as its output? And then how would I use this "model" to control the thing I was training it for? Say I train it on how to land the lunar module. And now in my game the human user controls one lander and I want the other lander to be controlled by AI, presumably using the "model" I just trained. How do I use that model in my game?

iron basalt May 23, 2026, 12:52 AM

#

hushed light As I stated, I am going through these tutorials right now. My question is about ...

The model is separate, you bring your own. It only gives you something to train/test on.

#

It does what that code snippet does and nothing else.

#

It runs a virtual environment.

#

"How do I use that model in my game?" You give it observations, and it takes actions.

hushed light May 23, 2026, 1:37 AM

#

How do I "bring my own model" when my whole goal is to create a model I don't have yet? If I want to say, create a model that can land the lunar lander successfully. That doesn't exist at first.

hushed light May 23, 2026, 2:25 AM

#

I guess the first thing I need to know is the precise definitions of "model" and "agent."

iron basalt May 23, 2026, 2:41 AM

#

hushed light How do I "bring my own model" when my whole goal is to create a model I don't ha...

You create a model using training data which is extracted from the gymnasium environment.

warped salmon May 23, 2026, 9:15 AM

#

random rant: I see how useful AI is in fields like robotics but then I see how all the big companies are using it for the dumbest, most wasteful shit

#

like...

gilded depot May 23, 2026, 10:54 AM

#

hushed light As I stated, I am going through these tutorials right now. My question is about ...

you create an untrained model with random weights, maybe gymnasium does that for you though but you can probably pass your own model if you want to use a different architecture. When you run gymnasium, it trains the weights of the model

hushed light May 23, 2026, 8:01 PM

#

Ok, I think the word I am looking for is "Policy." Gymnasium trains to develop a Policy. How do I extract this policy from Gymnasium after I train it? How do I use this Policy in a different application that I write myself?

iron basalt May 23, 2026, 8:15 PM

#

hushed light Ok, I think the word I am looking for is "Policy." Gymnasium trains to develop a...

You need to learn about RL. You can read Reinforcement Learning: An Introduction by Sutton and Barto. It's the standard resource to learn RL.

#

You should already know some calculus and statistics prior to getting into this. Although it's still readable without knowing much of these subjects.

#

https://www.youtube.com/watch?v=C1lhuz6pZC0&list=PLUl4u3cNGP619EG1wp0kT-7rDE_Az5TNd&index=2 For general data science and ML starting knowledge.

YouTube

MIT OpenCourseWare

1. Introduction, Optimization Problems (MIT 6.0002 Intro to Computa...

MIT 6.0002 Introduction to Computational Thinking and Data Science, Fall 2016
View the complete course: http://ocw.mit.edu/6-0002F16
Instructor: John Guttag

Prof. Guttag provides an overview of the course and discusses how we use computational models to understand the world in which we live, in particular he discusses the knapsack problem and g...

▶ Play video

iron basalt May 23, 2026, 8:34 PM

#

hushed light Ok, I think the word I am looking for is "Policy." Gymnasium trains to develop a...

When you play a game, you use a learned policy to make decisions (hence the term "policy"). You are also using what you experienced while playing to update your policy such that it results in better decisions. The game knows nothing of your policy, or that you are a human playing it, it's just a game. So the game won't give you a policy/agent/model/AI/etc. You bring your own to play the game (which could be yourself, a person). Gymnasium is the game. It's just designed to simulate some game, and provide observations and rewards to the user.

hushed light May 23, 2026, 9:04 PM

#

I understand this. I am making the game. I have structured it such that it is trainiable via ML/RL. E.g., I have a reset(), perform_action(), check_game_over(), etc. I have modeled the data in such a way that I have an ObservationSpace struct that I update every game turn. I want to use Gymnasium to "train" on my game. Then I want to capture the Policy created by Gymnasium. I want to export this Policy, however this works. It's a black box to me at the moment. Then I want to implement an agent as part of my own game code that can play the game I wrote and just trained on. From a software perspective I know how to do all this. I just don't know what a Policy export is or what code I need to write in my own app to use it. Presumably it's very similar to the training code in Gymnasium where I start with the initial ObservationState after the first reset(). Then using the Policy data I extracted from Gymnasium, I determine the next action based on the current ObservationSpace. Rinse and repeat.... right?

iron basalt May 23, 2026, 9:09 PM

#

hushed light I understand this. I am making the game. I have structured it such that it is tr...

An example of a policy (not a learned one), is to randomly take some action from the action space each frame. That is a simple policy (env.action_space.sample()). A less simple policy would be taking some action based on what was observed. "Policy created by Gymnasium" - Gymansium does not to create policies. You create policies. If you boot up Tetris it does not spit out a policy, that's not its function. But when playing it, you receive information/data that you could use to craft a policy.

fickle shale May 24, 2026, 6:10 AM

#

how to perform good at datascience casestudies?

#

sometimes i am not able to think like i have an alzheimer!!

gilded depot May 24, 2026, 12:01 PM

#

hushed light I understand this. I am making the game. I have structured it such that it is tr...

the policy is in the weights of the model so you just need to save it

import gymnasium as gym
from stable_baselines3 import PPO

env = gym.make("CartPole-v1")
model = PPO("MlpPolicy", env, verbose=1)
model.learn(total_timesteps=10000)

model.save("ppo_cartpole_model")

# To load it later:
model = PPO.load("ppo_cartpole_model", env=env)

celest sandal Nov 2, 2017, 2:35 PM

#

hi

hearty hazel Nov 2, 2017, 2:38 PM

#

Hi o/

south quest Nov 2, 2017, 4:37 PM

#

Hõla

pearl valve Nov 3, 2017, 4:21 AM

#

Hey there

south quest Nov 3, 2017, 3:58 PM

#

Hõla

small sun Nov 6, 2017, 3:26 AM

#

how good has OCR gotten? i found a decently big data set i'd like to train a net on, but i need to extract the text from about ten hundred images of text on a generally mostly flat background

hearty hazel Nov 6, 2017, 7:08 AM

#

OCR is still tricky

#

I've worked with Tesseract in Python but it really is a bit hit and miss still

#

A lot of tools are deliberately not colour aware, including that one

#

And will convert images to black and white before trying to read them

#

So you need to be careful with light colours in your text

#

Some processing may be required beforehand

hollow kernel Nov 13, 2017, 8:15 PM

#

hello there

#

could anyone help me with processing some images?

#

basically i'm given a set of hundreds of images

#

and i want to convert each of them to a matrix

#

and then to a vector

hearty hazel Nov 13, 2017, 8:21 PM

#

You need to perform sone kind of OCR?

naive swallow Nov 13, 2017, 8:21 PM

#

Can't you use scipy for that sort of thing?

#

scipy.misc.imread()```
^ returns a numpy array

#

You can also use numpy, apparently:

>>> import Image, numpy
>>> numpy.asarray(Image.open('1.jpg').convert('L'))

hollow kernel Nov 13, 2017, 8:24 PM

#

i'm unsure what OCR is, i'm very very new to machine learning, and python

hearty hazel Nov 13, 2017, 8:25 PM

#

Text recognition from images

hollow kernel Nov 13, 2017, 8:25 PM

#

ah, yes that's what i'm trying to do

#

but i don' t necessarilly need help with that portion at the moment

#

it's the preprocessing of a different testing set that i'm working on

#

📎 test_0001.png

#

that's an example of an image

#

there's 150 '9's, 150 8s, etc down to 0

#

so my goal is to convert that image to a 28x28 image, then to a 28x28 matrix, then to a 784 (28*28) length vector

#

where i can test using my current model

hearty hazel Nov 13, 2017, 8:27 PM

#

so, you're trying to join these images together into a grid?

hollow kernel Nov 13, 2017, 8:28 PM

#

that i'm a bit unsure about

#

i understood it more as each image individually

#

this is what i have implemented atm

#

https://www.tensorflow.org/get_started/mnist/beginners

TensorFlow

MNIST For ML Beginners | TensorFlow

hearty hazel Nov 13, 2017, 8:29 PM

#

well I mean, you already can't really crop it to 28x28

#

anyway, if you need to construct or modify images

#

you want pillow

hollow kernel Nov 13, 2017, 8:29 PM

#

and i'd like to use that model to test it

hollow kernel Nov 13, 2017, 8:47 PM

#

if you're still around to help and i could point you towards this link

#

https://datascience.stackexchange.com/questions/5224/how-to-prepare-augment-images-for-neural-network

How to prepare/augment images for neural network?

I would like to use a neural network for image classification. I'll start with pre-trained CaffeNet and train it for my application.

How should I prepare the input images?

In this case, all the

#

that is effectively what i'm aiming to do, to process all the images and try to center the important parts

hearty hazel Nov 13, 2017, 8:51 PM

#

Machine learning is honestly not my area

#

we're kind of lacking on that department here to be honest

elder otter Nov 13, 2017, 8:58 PM

#

So I can help with this @hollow kernel

hollow kernel Nov 13, 2017, 8:58 PM

#

hello

#

that would be lovely lol

elder otter Nov 13, 2017, 8:58 PM

#

When they mean flatten, they just mean arranging the 28x28 image in a vector/array form

hollow kernel Nov 13, 2017, 8:59 PM

#

yes

#

so i have 1500 images

elder otter Nov 13, 2017, 8:59 PM

#

you can use any way you'd like to compress it into the array/vector form

#

as long as it's constant for all images

#

the network itself will identify relevant weights for the image

hollow kernel Nov 13, 2017, 9:01 PM

#

so it's the compressing into an array/vector form that's giving me trouble right now

#

like i said i'm new to python

#

but i have a folder full of images ranging from test_0001 to test_1500

#

so what i'd like to end up with is a 1500x784 array

elder otter Nov 13, 2017, 9:02 PM

#

the simplest way to do it is just join the rows of the 28x28 grid - this would work

hollow kernel Nov 13, 2017, 9:02 PM

#

i think

elder otter Nov 13, 2017, 9:03 PM

#

ya, you want 1500 long list of 784 length lists

#

if using python

hollow kernel Nov 13, 2017, 9:03 PM

#

yes

elder otter Nov 13, 2017, 9:05 PM

#

    compression = []
    for row in len(img):
        compression.extend(row)
    return compression```

#

or idk, probably not called compression

#

but something simple like this is enough

#

it's just that if you're using a basic neural net, especially one that operates on a by pixel basis and doesn't require convolution, it's much simpler to format the inputs in the form of a vector

hollow kernel Nov 13, 2017, 9:07 PM

#

that's effectively what i'm doing i think...

#

if each 28x28 matrix gets flattened into a 784 length vector

#

so pillow has one called resize

#

so if i were working with that

#

    compression = []
    for row in len(img):
        resize.extend(row)
    return compression```

#

?

#

or is that way off?

elder otter Nov 13, 2017, 9:11 PM

#

hmmm i've never worked with pillow before, but http://pillow.readthedocs.io/en/3.1.x/reference/Image.html is what i see?

hollow kernel Nov 13, 2017, 9:11 PM

#

http://pillow.readthedocs.io/en/4.3.x/reference/Image.html

#

i was looking at that one but yes

elder otter Nov 13, 2017, 9:15 PM

#

seems like it might be more like this ```
from PIL import Image

def compress("filepath"):
compression = []
img = Image.open("test1.jgp")
img = Image.resize( (28,28))
for row in Image.getdata(): # not sure about this
compression.extend....

#

tbh i think the best way is just to try it out bc you're working with an Image object, but you want the compression to return as a simple array/vector

#

bc it's all 1s and 0s, and there are no RGB values involved

hearty hazel Nov 13, 2017, 9:17 PM

#

This is some complicated stuff

#

I assure you I'm taking notes :P

elder otter Nov 13, 2017, 9:17 PM

#

noo not complicated at all

#

just preprocessing data aha

#

i've never worked with the pillow module, but as long as you can figure out how to resize the image, transform it into a vector you should be good to go to input into the neural net

hollow kernel Nov 13, 2017, 9:21 PM

#

so that should work

#

but i'm unsure how to do it for the entire data set

#

and i think that's where you put the # not sure about this lol

#

from PIL import Image
import os, sys

path = "/home/joe/Desktop/CSE474/proj3/Test/"
dirs = os.listdir( path )

def resize():
    for item in dirs:
        if os.path.isfile(path+item):
            im = Image.open(path+item)
            f, e = os.path.splitext(path+item)
            imResize = im.resize((28,28), Image.ANTIALIAS)
            imResize.save(f + ' resized.png', 'PNG', quality=90)

resize()

#

that's sort of working

elder otter Nov 13, 2017, 9:35 PM

#

Just for loop across all files

#

For entire dataset

#

The function compress is meant to work for a single image

#

Loop over all images calling compress on each

#

The not sure is bc idk how the pillow image object works

#

this is actually less of a machine learning problem, and more of a how to use python modules problem

ionic summit Nov 17, 2017, 12:27 PM

#

hello, are you aware of any python library that centralize and ease the download and load of machine learning dataset?

lapis sequoia Nov 17, 2017, 12:27 PM

#

✨ Level Up!! ✨

Wolfgang just got to Level 1 - Beginner

ionic summit Nov 17, 2017, 12:28 PM

#

I mean, when you use sklearn, you have access to the "dataset" module for this purpose but for example with mnist, the function only load few examples of the total dataset

modern vapor Dec 9, 2017, 7:54 PM

#

Hello Everyone, does anybody have a link where i can find weather sensitive product dataset or something similar.

hearty hazel Dec 9, 2017, 8:01 PM

#

You'll have to explain what you mean by that I think

eternal falcon Jan 22, 2018, 3:31 AM

#

so i'm just starting out with machine learning and i'm having trouble finding a place to begin so my question is, where do i begin?

#

i have absolutely no education in calculus, my highschool was a joke. i've found that to be a hurdle from what i can tell.

rose quarry Jan 25, 2018, 11:28 PM

#

@zigg https://machinelearningmastery.com/machine-learning-in-python-step-by-step/

Machine Learning Mastery

Your First Machine Learning Project in Python Step-By-Step - Machine Learning Mastery

Do you want to do machine learning using Python, but you’re having trouble getting started? In this post, you will complete your first machine learning project using Python. In this step-by-step tutorial you will: Download and install Python SciPy and get the most useful package for machine learning in Python. Load a dataset and understand …

#

Im not a master in machine learning, in fact I know literally nothing, but this was a pretty cool intro to it

#

https://github.com/collections/machine-learning

GitHub

Getting started with machine learning - GitHub Collection

Today, machine learning—the study of algorithms that make data-based predictions—has found a new audience and a new set of possibilities.

#

theres also this

foggy moss Jan 25, 2018, 11:45 PM

#

calculus is helpful

#

but you really just need to understand the ideas of calculus

#

you dont need to learn how to solve a bunch of differential systems

placid river Jan 26, 2018, 3:25 AM

#

Hey there

undone jackal Jan 27, 2018, 7:33 PM

#

i dont really believe you need calculus for it, but it certainly helps

charred kite Jan 28, 2018, 1:39 PM

#

i found these videos helpful for learning calculus: https://www.youtube.com/playlist?list=PLZHQObOWTQDMsr9K-rj53DwVRMYO3t5Yr

YouTube

Essence of calculus - YouTube

The goal here is to make calculus feel like something that you yourself could have discovered.

#

i havent finished them yet but so far theyre very informative

#

usually when it comes to maths 3Blue1Brown is my go to

undone jackal Jan 29, 2018, 4:53 PM

#

they and numberphile are the best math/number related channels ive seen

#

carykh is pretty great too even though its not pure math

quick willow Jan 31, 2018, 12:46 AM

#

Does tensorflow not work for python3?

#

3.6 rather

spark nimbus Jan 31, 2018, 6:20 AM

#

It should

tight dove Jan 31, 2018, 9:21 AM

#

Hi. Good morning

#

I'm about to download Anaconda for Data Analytics

#

There are two versions available

#

for 2.7 and 3.6 versions of Python

#

Please I need advise for which version I should install

naive swallow Jan 31, 2018, 9:22 AM

#

3.6

hearty hazel Jan 31, 2018, 9:22 AM

#

@tight dove #welcome has an FAQ about that

naive swallow Jan 31, 2018, 9:22 AM

#

no contest :^)

tight dove Jan 31, 2018, 9:27 AM

#

@hearty hazel , @naive swallow Thanks!

#

I hope to ask and contribute here!

hearty hazel Jan 31, 2018, 9:28 AM

#

We look forward to having you \o/

quick willow Jan 31, 2018, 11:32 AM

#

I got it installed with aconda

#

Wasn't working with pip

quick willow Jan 31, 2018, 4:07 PM

#

tensorflow is harder than I imagined GWdarateroLongneckThink

dim beacon Jan 31, 2018, 5:13 PM

#

@quick willow you may want to use a higher-level abstraction such as TFLearn or Keras, which simplify working with NNs/DNNs using Tensorflow (or even other backends like Theano or Torch)

south quest Jan 31, 2018, 5:13 PM

#

I used TFLearn a little

#

It's pretty nice

quick willow Jan 31, 2018, 5:22 PM

#

I'll check it out

spark nimbus Feb 1, 2018, 12:03 AM

#

Does anyone have any good references for Natural Language Processing?

wild oasis Feb 2, 2018, 1:57 PM

#

@spark nimbus I'm also trying to get into NLP but specifically into language classification. If that's what you're interested in then I have found some papers

#

I will probably be doing the language identification with N-grams since that seems like the best approach. I'm currently trying to decide on which Python library to be using for this. TensorFlow / TFLearn? NLTK? TextBlob? Something else?Does anybody know which library is the best?

spark nimbus Feb 2, 2018, 3:02 PM

#

I have never actively worked with machine learning so I wouldn't know

quick willow Feb 2, 2018, 3:35 PM

#

TFLearn is simply a wrapper for Tensorflow

wild oasis Feb 2, 2018, 3:40 PM

#

I was researching this for the past 2 hours and it seems that NLTK is the best thing to use for this. Tensorflow etc. is overkill

novel hornet Feb 4, 2018, 1:53 AM

#

Does anyone have any experience using convolutional nets to read bar graph data?

#

Or know of any papers aroudn the idea?

odd basin Feb 4, 2018, 11:29 AM

#

why there is a high demand for ML programmers?

hearty hazel Feb 4, 2018, 11:44 AM

#

A lot of companies think it's the Next Big Thing

#

in a way, they're not wrong, but I don't think it's as universally useful as they do

lean ledge Feb 4, 2018, 11:45 AM

#

It definitely is the next big thing for large companies with lots of data

hearty hazel Feb 4, 2018, 11:46 AM

#

Yeah, but that isn't everyone :P

lean ledge Feb 4, 2018, 11:50 AM

#

Could definitely apply to everyone though. And definitely could revolutionise not just tech and CS industry, but lots of scientific industries and lots of businesses

odd basin Feb 4, 2018, 11:53 AM

#

ok

#

so to do machine learning I found in coursera you have to understand statistics how far i should learn that subject?

#

at least the fundamentals?

lean ledge Feb 4, 2018, 11:56 AM

#

machine learning isnt about programming

#

its more like a field of maths

#

need knowledge of linear algebra, statistics and some calculus

#

be comfortable with those three

odd basin Feb 4, 2018, 11:57 AM

#

oh

lean ledge Feb 4, 2018, 11:59 AM

#

Heres a sneak peek https://www.youtube.com/watch?v=aircAruvnKk

YouTube

3Blue1Brown

But what *is* a Neural Network? | Chapter 1, deep learning

Subscribe to stay notified about new videos: http://3b1b.co/subscribe Support more videos like this on Patreon: https://www.patreon.com/3blue1brown Special t...

▶ Play video

#

and https://www.youtube.com/watch?v=IHZwWFHWa-w and https://www.youtube.com/watch?v=Ilg3gGewQ5U&t=700s and https://www.youtube.com/watch?v=tIeHLnjs5U8&t=533s

YouTube

3Blue1Brown

Gradient descent, how neural networks learn | Chapter 2, deep learning

Subscribe for more (part 3 will be on backpropagation): http://3b1b.co/subscribe Thanks to everybody supporting on Patreon. https://www.patreon.com/3blue1bro...

▶ Play video

YouTube

3Blue1Brown

What is backpropagation really doing? | Chapter 3, deep learning

What's actually happening to a neural network as it learns? Training data generation + T-shirt at http://3b1b.co/crowdflower Crowdflower does some cool work ...

▶ Play video

YouTube

3Blue1Brown

Backpropagation calculus | Appendix to deep learning chapter 3

This one is a bit more symbol heavy, and that's actually the point. The goal here is to represent in somewhat more formal terms the intuition for how backpro...

▶ Play video

#

You pretty much need a PhD in ML to be considered for a job/research

odd basin Feb 4, 2018, 12:00 PM

#

wowww

#

i dont even have a degree

#

haha

lean ledge Feb 4, 2018, 12:02 PM

#

(thats simply because almost all research fields require a PhD to be taken seriously anyway and most positions for ML happen to be at big companies who have the resources and need of ML want the best people)

#

Same

#

I'm not even an adult yet, I cant imagine spending years more at uni in what feels like a super specific field

odd basin Feb 4, 2018, 12:03 PM

#

well i guess i should look into other field

#

i want to make money

#

fast

#

you know

lean ledge Feb 4, 2018, 12:04 PM

#

almost no technical field in STEM makes money fast. finance or something is what you want to be looking at

#

most things that make lots of money require years of commitment or moving into something that isnt the field (like engineering management rather than engineering)

odd basin Feb 4, 2018, 12:07 PM

#

iam commitment to put years but i never was a good student

#

yeah to make money and get something we have to pur a lot of dedication

#

put*

lapis sequoia Feb 4, 2018, 9:15 PM

#

New to machine learning. How do I start? Are there any good tutorials? Appreciate the help. Thank you!

earnest prawn Feb 4, 2018, 9:18 PM

#

mainly google and docs of the lib you use

novel hornet Feb 4, 2018, 9:19 PM

#

Read o’reilly’s data science from scratch with python

#

And/or machine learning with scikit learn and tensorflow

#

I think both might be available online

quiet gyro Feb 4, 2018, 9:42 PM

#

TensorFlow has a great tutorial series

#

Two versions, one for people new to ML, another for people who know the fundamentals of ML already

lapis sequoia Feb 4, 2018, 10:09 PM

#

Thank you guys!

lean ledge Feb 5, 2018, 12:45 AM

#

Is anyone else annoyed about the number of people trying to "do ML" by watching tutorials and videos that walk them through basic things, leaving them with no mathematical understanding of what they're doing

#

Everyone's trying to do ML without realising the nature of the field because it sounds cool and is the new hot thing

charred kite Feb 5, 2018, 12:46 AM

#

i mean id get annoyed at myself for not knowing it, but not others

#

my general view is 'you do you'

#

i quite like learning maths behind these concepts tho, and as such have pursued learning calculus before even touching any form of learning

lean ledge Feb 5, 2018, 12:49 AM

#

It just annoys me when people try to just jump into super quantitative and large fields without literally any background or research. Got a bunch of people asking how to get into quantum mechanics with high school level maths on the physics server I admined

#

I think part of it is that even if they do learn something basic, it often leads to them pretending they know what they're doing and being overconfident

charred kite Feb 5, 2018, 12:50 AM

#

i do know what you mean there

#

i sometimes get the opposite effect, other people thinking im a master of some things i do (when i am very much not), which i guess can cause people to do that if it happens to them

ripe vessel Feb 5, 2018, 12:53 AM

#

Personally, I'd rather jump into something without knowing what I'm doing in order to learn more about what I'm trying to do, and the solution to my "problem". I know it's not the same for everyone, but I can personally see why people without any backing or prior knowledge would try to jump into a topic like maching learning.

lean ledge Feb 5, 2018, 12:56 AM

#

Intro ML doesn't even have the same requirements as does something like intro QM so it's probably still easier to get into ML. But the fact that people don't even research what it involves and just ask for basic tutorials or YouTube videos on it?? It's a large academic field like any scientific field.

charred kite Feb 5, 2018, 1:00 AM

#

do you mean like people who are only doing it to make something that looks cool (as a sort of boastful act maybe), rather than to learn about it and get better at it?

#

like those who are looking for the easy way, rather than the proper way

lean ledge Feb 5, 2018, 1:12 AM

#

Sort. People that are so ignorant and arrogant enough that they have no clue what the field they want to study involves and have no idea how little they know already

lapis sequoia Feb 5, 2018, 2:26 AM

#

Yes agreed @lean ledge. I asked because I work for a health research. we are starting a new project soon where we are going to use machine learning.

lapis sequoia Feb 5, 2018, 11:49 AM

#

hey guys

#

I have trouble printing results from my classifiers.. I have the code up and running.. I have good accuracy but I'm not sure how to do the confusion matrix

#

and not sure how to print results

#

can someone help?

earnest prawn Feb 5, 2018, 4:24 PM

#

kinda hard to help people without knowing their code

lapis sequoia Feb 5, 2018, 4:27 PM

#

ahh yeah

#

hold on

#

here's my code

#

https://pastebin.com/BuGtXHkk

Pastebin

code - Pastebin.com

#

i have the accuracy.. there's no train test split from what I can observe of it.. I basically tried to fork another code and modify it for my purpose.. the accuracy is good but I need to print the results of the models..and stuff

#

i don't know how

earnest prawn Feb 5, 2018, 4:30 PM

#

uuh never worked with keras dont think i can help you. my only advice would be looking up the docs and maybe do some dir() if you don´t find anything in the docs sry

lapis sequoia Feb 5, 2018, 4:32 PM

#

oki

#

Philosophical question, do you think data science vs web development has bigger potential to benefit humanity in the long run and why? :x

#

depends on how we use them..

#

as with anything else..

dusky agate Feb 5, 2018, 4:36 PM

#

it's a very broad question.

lapis sequoia Feb 5, 2018, 4:36 PM

#

so is the data science hype going to fade or explode out of proportions?

#

it's not a hype.. it'll be a way of life

#

it's not replacing web development..

earnest prawn Feb 5, 2018, 4:38 PM

#

the hype wont fade its just ... like tron says actually

lapis sequoia Feb 5, 2018, 4:38 PM

#

there's no measure of comparison.. it's apples and peanuts

#

fineee

foggy moss Feb 5, 2018, 4:48 PM

#

h u h

#

thats a weird question

#

data science is still in infancy

#

and yet is already critical to so many things

#

in a few years your LG refrigerator will have more data science in it than all the data science in the DoD today

#

i recommend the first couple chapters of the Undoing Project by Michael Lewis

#

and someday

#

i hope in my lifetime

#

ML can give birth to "auto brightness" on a phone

#

that actually works

#

grumpy

austere slate Feb 5, 2018, 9:05 PM

#

+1 @foggy moss hahahaha sooo true

odd basin Feb 10, 2018, 4:40 PM

#

hey guys

#

I dont want to learn R

#

Is there any book like Statistics with python

weak kiln Feb 10, 2018, 5:19 PM

#

he said he didn't want to learn R :D

naive swallow Feb 10, 2018, 5:23 PM

#

...

#

There's a good edX course for it

#

you can audit the course for free

#

https://www.edx.org/course/statistics-probability-data-science-uc-san-diegox-dse210x

edX

Statistics and Probability in Data Science using Python

Using Python, learn statistical and probabilistic approaches to understand and gain insights from data.

tight dove Feb 11, 2018, 9:53 AM

#

Hi guys

#

Please where can I learn Churn Prediction?

lapis sequoia Feb 11, 2018, 4:10 PM

#

@tight dove https://www.dataiku.com/learn/guide/tutorials/churn-prediction.html

tight dove Feb 11, 2018, 7:58 PM

#

@lapis sequoia thanks man

hasty maple Feb 12, 2018, 3:12 PM

#

@lean ledge "Sort. People that are so ignorant and arrogant enough that they have no clue what the field they want to study involves and have no idea how little they know already" Why do you think knowing the math is all so important for using a model of ML? I know what gradient descent does, take partial derivative(gives positive slope) and use that directional information to head to the minimum of the cost function. Even if I can't do the actual partial derivative I am satisfied with this understanding. I do similar abstractions of the concepts and understand them, piece them and apply the algorithms to my problem statements. Is this also considered ignorant/arrogant in your view?

loud crypt Feb 13, 2018, 12:17 PM

#

whats machine learning?

hearty hazel Feb 13, 2018, 12:17 PM

#

https://en.wikipedia.org/wiki/Machine_learning

Machine learning

Machine learning is a field of computer science that gives computers the ability to learn without being explicitly programmed.
The name Machine learning was coined in 1959 by Arthur Samuel. Evolved from the study of pattern recognition and comput...

tepid pagoda Feb 18, 2018, 10:04 AM

#

Math at Morning @..@

📎 photo_2018-02-18_10-41-33.jpg

young blaze Feb 18, 2018, 10:47 AM

#

Hi, guys! Does anybody know a good machine learning course?

#

@earnest prawn are you Dutch by chance? 😄

earnest prawn Feb 18, 2018, 10:50 AM

#

German

young blaze Feb 18, 2018, 10:51 AM

#

oh, lol

#

Niemand also means nobody in Dutch

earnest prawn Feb 18, 2018, 10:52 AM

#

I know

#

Already caused some confusion because of that

young blaze Feb 18, 2018, 10:53 AM

#

Do you know any machine learning?

earnest prawn Feb 18, 2018, 10:53 AM

#

Barely

#

Fun story

#

I actually listened to a two hour presentation of a Dutch professor about machine learning, he was speaking """"""German""""""""

young blaze Feb 18, 2018, 10:54 AM

#

They're almost the same anyway XD

tepid pagoda Feb 18, 2018, 10:55 AM

#

i know a little bit but only in german xD

earnest prawn Feb 18, 2018, 10:55 AM

#

Dutch is raped German

young blaze Feb 18, 2018, 10:55 AM

#

it's actually more the other way around

earnest prawn Feb 18, 2018, 10:55 AM

#

No

#

German is older

young blaze Feb 18, 2018, 10:55 AM

#

German sounds like an angry Dutchman

earnest prawn Feb 18, 2018, 10:55 AM

#

I looked that up during a discussion with another Dutch guy

#

German was there before Dutch so Dutch is raped German

young blaze Feb 18, 2018, 10:56 AM

#

goddamit

#

anyway, do you know some good resources?

#

I'm having a hard time finding them

#

What's even worse is that all resources are in English.

earnest prawn Feb 18, 2018, 10:57 AM

#

Personally not but I am quite sure if you go to search and enter sth like
in: #data-science-and-ml has: link
You will find stuff

young blaze Feb 18, 2018, 10:57 AM

#

okay, thanks!

hasty maple Feb 18, 2018, 6:42 PM

#

@young blaze You could try andrew ng's basic ML course to get started in ML

young blaze Feb 18, 2018, 6:43 PM

#

https://www.coursera.org/learn/machine-learning

Coursera

Machine Learning | Coursera

Machine Learning from Stanford University. Machine learning is the science of getting computers to act without being explicitly programmed. In the past decade, machine learning has given us self-driving cars, practical speech recognition, ...

#

this one?

hasty maple Feb 18, 2018, 6:43 PM

#

Yes

vast oasis Feb 19, 2018, 9:14 PM

#

I have some ML course material for python from my uni

#

Maybe I can share that

#

If anyone would be interested.

hearty hazel Feb 19, 2018, 9:17 PM

#

I'm sure some people would, we're sorely lacking on ML resources here

hasty maple Feb 20, 2018, 4:15 PM

#

https://jgreenemi.github.io/MLPleaseHelp/ It's a resource that holds resources that jgreenemi started on a ML server I am on. Hopefully this can be used as a place to look for some ML resources 😄

hearty hazel Feb 20, 2018, 4:16 PM

#

naive swallow Feb 20, 2018, 4:19 PM

#

Thanks for the resource

#

This channel's pretty dead

hasty maple Feb 20, 2018, 4:20 PM

#

https://www.youtube.com/watch?v=yDLKJtOVx5c&list=PLD0F06AA0D2E8FFBA 15.1, 15.2 in this playlist helped me understand 2nd order optimizers. I didn't have time for seeing other videos but based on personal experience of 15.1 and 15.2 I assume the rest would be good as well.

YouTube

mathematicalmonk

(ML 1.1) Machine learning - overview and applications

Attempt at a definition, and some applications of machine learning. A playlist of these Machine Learning videos is available here: http://www.youtube.com/my_...

▶ Play video

#

Well there are dedicated ML discord servers so I presume most discord ML conversations go there and lesser people visit python server for ML specifics. It's usually those that are reasonably good with basic python coding that jump into ML so seems normal they don't visit python server for ML and hence the resources for ML on python server is kinda less. If that makes sense 😅

vast oasis Feb 20, 2018, 5:56 PM

#

As soon as I'm doing studying I will put all the stuff on my git.

#

Is there a way to set a reminder for myself here?

earnest prawn Feb 20, 2018, 5:56 PM

#

nop

#

maybe implement a feature for the bot 😄

vast oasis Feb 20, 2018, 5:59 PM

#

Can I set a reminder to remind me to build a remind feature for the bot?

earnest prawn Feb 20, 2018, 5:59 PM

#

u could open an issue on github

vast oasis Feb 20, 2018, 5:59 PM

#

😉

#

You mean InfoBot?

earnest prawn Feb 20, 2018, 6:00 PM

#

no

#

self.help()

arctic wedgeBOT Feb 20, 2018, 6:00 PM

#

class Bot:
    bot.info()        # Get information about the bot
class NoCategory:
    bot.help          # Shows this message.

# Type self.help() command for more info on a command.
# You can also type self.help() category for more info on a category.

earnest prawn Feb 20, 2018, 6:00 PM

#

this one

vast oasis Feb 20, 2018, 6:01 PM

#

self.info()

arctic wedgeBOT Feb 20, 2018, 6:01 PM

#

Python Bot

A utility bot designed just for the Python server! Try bot.help() for more info.

Total Users

1727

Git SHA

47136cf

earnest prawn Feb 20, 2018, 6:01 PM

#

no bot commands here

#

just for showing the bot

vast oasis Feb 20, 2018, 6:01 PM

#

Please bot give me your github link.

#

😄

earnest prawn Feb 20, 2018, 6:01 PM

#

oh go to github search for discord-python

vast oasis Feb 20, 2018, 6:01 PM

#

👍

naive swallow Feb 20, 2018, 6:01 PM

#

bot.help()

arctic wedgeBOT Feb 20, 2018, 6:01 PM

#

class Bot:
    bot.info()        # Get information about the bot
class Deployment:
    bot.deploy_site() # Trigger website deployment on the server - will only ...
    bot.redeploy()    # Trigger bot deployment on the server - will only rede...
    bot.uptimes()     # Check the various deployment uptimes for each service
class NoCategory:
    bot.help          # Shows this message.

# Type bot.help() command for more info on a command.
# You can also type bot.help() category for more info on a category.

naive swallow Feb 20, 2018, 6:01 PM

#

pls

earnest prawn Feb 20, 2018, 6:01 PM

#

its the user owning the bot

naive swallow Feb 20, 2018, 6:02 PM

#

bot.help > self.help

arctic wedgeBOT Feb 20, 2018, 6:02 PM

#

No command called ">" found.

void depot Feb 20, 2018, 6:02 PM

#

shouldn't bot.help be bot.help() if it's a function

earnest prawn Feb 20, 2018, 6:02 PM

#

zis one

void depot Feb 20, 2018, 6:02 PM

#

bot.help

arctic wedgeBOT Feb 20, 2018, 6:02 PM

#

class Bot:
    bot.info()        # Get information about the bot
class NoCategory:
    bot.help          # Shows this message.

# Type bot.help command for more info on a command.
# You can also type bot.help category for more info on a category.

earnest prawn Feb 20, 2018, 6:02 PM

#

https://github.com/discord-python

GitHub

Python Discord

A large, friendly Discord server for learners and experienced users alike

void depot Feb 20, 2018, 6:02 PM

#

oh

#

uhhh

naive swallow Feb 20, 2018, 6:02 PM

#

bot.help()

arctic wedgeBOT Feb 20, 2018, 6:02 PM

#

class Bot:
    bot.info()        # Get information about the bot
class Deployment:
    bot.deploy_site() # Trigger website deployment on the server - will only ...
    bot.redeploy()    # Trigger bot deployment on the server - will only rede...
    bot.uptimes()     # Check the various deployment uptimes for each service
class NoCategory:
    bot.help          # Shows this message.

# Type bot.help() command for more info on a command.
# You can also type bot.help() category for more info on a category.

naive swallow Feb 20, 2018, 6:02 PM

#

you can do both

earnest prawn Feb 20, 2018, 6:02 PM

#

nooo

vast oasis Feb 20, 2018, 6:02 PM

#

Yeah, got it.

void depot Feb 20, 2018, 6:02 PM

#

those are two different commands

earnest prawn Feb 20, 2018, 6:02 PM

#

no bot commands

void depot Feb 20, 2018, 6:02 PM

#

they do two different things

naive swallow Feb 20, 2018, 6:02 PM

#

now that's weird GWchadMEGATHINK

vast oasis Feb 20, 2018, 6:04 PM

#

Nobody said no bot commands

naive swallow Feb 20, 2018, 6:04 PM

#

#bot-commands

vast oasis Feb 20, 2018, 6:04 PM

#

Yeah, I was just kidding. "Niemand" translates to "Nobody" in german...

hearty hazel Feb 20, 2018, 6:05 PM

#

@void depot They're the same command.

earnest prawn Feb 20, 2018, 6:05 PM

#

yesss

hearty hazel Feb 20, 2018, 6:05 PM

#

Aperture just has access to more stuff than you

void depot Feb 20, 2018, 6:05 PM

#

ohh ok

earnest prawn Feb 20, 2018, 6:05 PM

#

finally someone noticing its german and not saying its dutch

#

woop

#

thanks @vast oasis

hearty hazel Feb 20, 2018, 6:05 PM

#

Your name is Dutch? I knew that.

#

:>

earnest prawn Feb 20, 2018, 6:05 PM

#

no

vast oasis Feb 20, 2018, 6:05 PM

#

Haha

earnest prawn Feb 20, 2018, 6:05 PM

#

its german

#

f*** u gdude

hearty hazel Feb 20, 2018, 6:05 PM

#

Haha

vast oasis Feb 20, 2018, 6:06 PM

#

trolled

earnest prawn Feb 20, 2018, 6:06 PM

#

does f*** u gude count as swearing?

naive swallow Feb 20, 2018, 6:09 PM

#

#ot0-psvm’s-eternal-disapproval

hasty maple Feb 20, 2018, 6:55 PM

#

inb4 ml is spam land 😂

spring flicker Feb 21, 2018, 10:52 PM

#

ok so i have some questions. if anyone can give me a hand...< noob

#

when i download a github repository. i can never seen to figure out what's what , as in: which file is the ai? also when would i have use for a trainer file? wouldn't i use the ai itself during training?

#

i can't seem to find any tutorials that walk you through navigating a large project.

lean ledge Feb 21, 2018, 10:57 PM

#

Look at the file you're supposed to run. Read it

spring flicker Feb 21, 2018, 10:58 PM

#

they don't specify which file that would be.

dusky agate Feb 21, 2018, 10:58 PM

#

is there a readme.md?

lean ledge Feb 21, 2018, 10:59 PM

#

Well, look for a file with the same name as the project, or something along the lines of "main" or something

spring flicker Feb 21, 2018, 10:59 PM

#

i go through them one by one reading, and for the most part i can say ok this is building a database or this is specifying the parameters of a training environment

#

yeah

#

📎 thingy.jpg

#

so i think cool watch the video. but the video if him writing script that isn't in the repository, and makes no mention of it.

#

i'm not asking hey what is this guy talking about.

charred kite Feb 21, 2018, 11:03 PM

#

the layout may have been guidelines set out by the creators of whatever training/ai module thing he used

spring flicker Feb 21, 2018, 11:04 PM

#

i'm asking is there a resource for learning how this stuff works

charred kite Feb 21, 2018, 11:04 PM

#

i'd suggest taking a look at those

spring flicker Feb 21, 2018, 11:05 PM

#

this is what i was talking about with siraj's videos. @south quest

earnest prawn Feb 23, 2018, 10:08 PM

#

you will (sadly) rarely find help about ml here

lapis sequoia Feb 23, 2018, 10:08 PM

#

oh okay

earnest prawn Feb 23, 2018, 10:08 PM

#

and no need to delete stuff

south quest Feb 23, 2018, 10:08 PM

#

there are various modules for markov chain generation

lapis sequoia Feb 23, 2018, 10:08 PM

#

I thought the question didn't belong here

#

so I deleted it

south quest Feb 23, 2018, 10:08 PM

#

I use this one https://github.com/jsvine/markovify

GitHub

jsvine/markovify

markovify - A simple, extensible Markov chain generator.

lapis sequoia Feb 23, 2018, 10:09 PM

#

thanks

loud crypt Feb 25, 2018, 3:17 PM

#

Theoretically how would a machine learn?

lapis sequoia Feb 25, 2018, 3:18 PM

#

pattern recognition is one example @loud crypt

dim beacon Feb 25, 2018, 4:07 PM

#

@loud crypt cost function minimization, reinforcement methods, backpropagation, clustering, etc.

#

For instance, neural network mathematical models are theoretically based on real biological neural networks, which means that, to some extent, artificial NNs learn the same way as we do

loud crypt Feb 25, 2018, 4:15 PM

#

so you would need to study how the brain functions to make a model of it @dim beacon

dim beacon Feb 25, 2018, 4:15 PM

#

@loud crypt we know how neurons work, for decades

loud crypt Feb 25, 2018, 4:16 PM

#

i mean like learn from books

dim beacon Feb 25, 2018, 4:16 PM

#

You do not need to know about biological details to understand how neural networks work, simple artificial NNs models are mathematically very explicit about it

#

However if biology interests you, do not keep yourself from reading about it anyway 👍

hasty maple Feb 25, 2018, 4:18 PM

#

optimize a cost function, that's how they work

dim beacon Feb 25, 2018, 4:20 PM

#

@hasty maple indeed, but this is true for every ML model, what I wanted to tell when talking about "how NNs work" is how this cost function is optimized, which is using backpropagation (with sometimes some more specific details)

hasty maple Feb 25, 2018, 4:23 PM

#

Ah, I was just giving a simpler shorter answer.

#

It wasn't in relation to your example.

thick siren Feb 28, 2018, 11:36 AM

#

I'm wondering how to make a tensorflow neural network with two inputs, two hidden layers and one output

#

using relu as the activation function

dusky agate Feb 28, 2018, 4:19 PM

#

have you read the TF docs?

hasty maple Feb 28, 2018, 5:21 PM

#

@thick siren https://github.com/MaxPoon/coursera-Deep-Learning/blob/master/Improving-Deep-Neural-Networks-Hyperparameter-Tuning-Regularization-And-Optimization/week3/Tensorflow%2BTutorial.ipynb

#

It's coursera's andrew ng's course tensorflow basics code, maybe go through this to understand how to use tf

thick siren Mar 1, 2018, 3:17 AM

#

Okay thanks @hasty maple

#

Will do

deft harbor Mar 1, 2018, 9:04 PM

#

Enjoying that link, thanks

spare arch Mar 3, 2018, 12:54 AM

#

hello

spark nimbus Mar 3, 2018, 9:39 PM

#

So I was working on a neural net I probably took from somewhere

#

and I'm having issues setting it up

#

if someone's able to help me out, contact me either here or in DM

stone oasis Mar 3, 2018, 10:08 PM

#

@spark nimbus what api? tensowflow? what o/s? using cuda?

spark nimbus Mar 3, 2018, 10:09 PM

#

none of that

#

just own implementation

#

nothing too big

#

📎 unknown.png

#

just some numpy size issues

lapis sequoia Mar 3, 2018, 11:42 PM

#

say

#

this stuff looks cool

earnest prawn Mar 3, 2018, 11:42 PM

#

this stuff looks cool

lapis sequoia Mar 3, 2018, 11:42 PM

#

how do i implement machine learning in my program

earnest prawn Mar 3, 2018, 11:42 PM

#

thats

#

a very broad question

#

considering that

lapis sequoia Mar 3, 2018, 11:43 PM

#

are you an expert at machine learning

#

or AI

earnest prawn Mar 3, 2018, 11:43 PM

#

machine learning can have dozens of usage cases

#

and there are dozens of libs

#

and no im not

lapis sequoia Mar 3, 2018, 11:43 PM

#

oh ok

earnest prawn Mar 3, 2018, 11:43 PM

#

there is (not sure) no one on this server who is

lapis sequoia Mar 3, 2018, 11:43 PM

#

like whats the first project you'd assign me

earnest prawn Mar 3, 2018, 11:43 PM

#

but i guess you can ask stuff anyways

lapis sequoia Mar 3, 2018, 11:43 PM

#

that involves machine learning

earnest prawn Mar 3, 2018, 11:43 PM

#

oh

#

classifier

#

for numbers

#

to be more accurate

lapis sequoia Mar 3, 2018, 11:44 PM

#

what does it do?

earnest prawn Mar 3, 2018, 11:44 PM

#

classify numbers in a fixed size black white picture

#

well

lapis sequoia Mar 3, 2018, 11:44 PM

#

ooooh

#

where do i start

earnest prawn Mar 3, 2018, 11:44 PM

#

it gets a picture and gets the number out of that

#

its like the hello world of ml

lapis sequoia Mar 3, 2018, 11:44 PM

#

i see

earnest prawn Mar 3, 2018, 11:45 PM

#

well you do start by choosing a lib

#

there are.... a LOT

#

pytorch
tensorflow
scikit learn
and and and

#

but those are the most popular

lapis sequoia Mar 3, 2018, 11:46 PM

#

oh okay

#

what do you recommend?

earnest prawn Mar 3, 2018, 11:46 PM

#

oh

#

i dunno

#

i heard tf is more diy scikit is more highlevel and nothing about pytorch

#

tf is by google and now open source
pytorch by facebook and now opensource
and scikit learn well free software from the first day

#

i couldnt tell you much more without doing some research tbh

lapis sequoia Mar 3, 2018, 11:47 PM

#

welll

#

what do i do first

#

oh

#

have u done it before?

earnest prawn Mar 3, 2018, 11:48 PM

#

i have done the number detector in scikit

#

for reason i dont remeber anymore

lapis sequoia Mar 3, 2018, 11:48 PM

#

oh

#

can u show me how to do et

earnest prawn Mar 3, 2018, 11:48 PM

#

but most popular ai stuff you hear about is tf

lapis sequoia Mar 3, 2018, 11:48 PM

#

ah okay

#

well whatever impresses my computer science teacher

earnest prawn Mar 3, 2018, 11:48 PM

#

and two things
no i cant thats not the point of helping
there is a tutorial on their website

lapis sequoia Mar 3, 2018, 11:48 PM

#

i topped my first semester class

#

oh ok

#

can u link me to it?

earnest prawn Mar 3, 2018, 11:50 PM

#

http://scikit-learn.org/stable/tutorial/basic/tutorial.html

lapis sequoia Mar 3, 2018, 11:51 PM

#

📎 Akko_Excited.gif

#

tbh

#

i want to be a programmer

#

yet an artist

earnest prawn Mar 3, 2018, 11:52 PM

#

but?

lapis sequoia Mar 3, 2018, 11:52 PM

#

yet a musician

earnest prawn Mar 3, 2018, 11:52 PM

#

well you are lucky

#

machine learning and a few other programming topics mix that

lapis sequoia Mar 3, 2018, 11:53 PM

#

awww yeeee

#

problem is

#

im shit at being an artist

#

and i never made music b4

#

well i made this small melody on fl studio but thats it lol

#

📎 woosh.png

#

like this is the best i could do

#

lol

earnest prawn Mar 3, 2018, 11:54 PM

#

better than i could ever do

#

but for example there is an ai which can make a van gogh out of a picture

#

or

#

make a bach out of bach examples

lapis sequoia Mar 3, 2018, 11:55 PM

#

yeah

#

ive seen and heard

#

holy fuck its complicated

earnest prawn Mar 4, 2018, 12:00 AM

#

the scikit thingy?

lapis sequoia Mar 4, 2018, 12:02 AM

#

the link u sent me

earnest prawn Mar 4, 2018, 12:03 AM

#

well

#

that is the high level stuff

#

the real machine learning and science stuff is in the functions which get called there

lapis sequoia Mar 4, 2018, 12:04 AM

#

oh

#

well is there anything lower leveled

earnest prawn Mar 4, 2018, 12:04 AM

#

tensorflow

lapis sequoia Mar 4, 2018, 12:05 AM

#

ok

hasty maple Mar 4, 2018, 10:36 AM

#

@spark nimbus The shapes of a and y are not same, numpy broadcasting is the issue, reshape y to y =y.reshape(y.shape[0],1) before delta = a-y and should work I think

spark nimbus Mar 4, 2018, 10:53 AM

#

It worked fine previously though

#

@hasty maple only after repurposing it did it break

hasty maple Mar 4, 2018, 10:55 AM

#

repurposing?

spark nimbus Mar 4, 2018, 10:58 AM

#

It used to be written number recognition

#

Of 16x16 images with grey tints only

hasty maple Mar 4, 2018, 11:00 AM

#

and what is it now?

spark nimbus Mar 4, 2018, 11:02 AM

#

Emotional recognition in sentences

#

Maybe I just screwed up the data format

#

Hold on

#

fml now I have to un-gzip un-pickle my data

hasty maple Mar 4, 2018, 11:06 AM

#

Well I don't think there is an easy way to re purposing an image classifier for sentiment analysis

spark nimbus Mar 4, 2018, 11:16 AM

#

You wanna help reimplementing it?

hasty maple Mar 4, 2018, 11:33 AM

#

I haven't done any NLP work before. so I don't think I wouldn't be of much help I think 😅

#

How big is the data set? if it's small enough I'll download it and when I eventually get to NLP we could compare our implementations :)

spark nimbus Mar 4, 2018, 11:45 AM

#

It's not too big

#

📎 monika.json

#

If empty, use "none"

#

Max input size should be changeable

#

And the amount of neurons should be [input_size, 3/4 input_size, 1/2 input_size, output_size (7)]

hasty maple Mar 4, 2018, 12:30 PM

#

most is empty though, how would the training go when you don't have enough labels 🤔

spark nimbus Mar 4, 2018, 12:47 PM

#

this is all the data the project lead gave me

spark nimbus Mar 4, 2018, 1:24 PM

#

once this works we'll add more data

narrow flare Mar 4, 2018, 8:47 PM

#

can someone give me a brief outline on what machine learning algorithms are?

dusky agate Mar 4, 2018, 8:49 PM

#

There's a bajillion youtube videos about this that can explain better than most people on this server

spark nimbus Mar 4, 2018, 9:14 PM

#

https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi

YouTube

Neural networks - YouTube

#

This is the best guide I've seen so far

narrow flare Mar 5, 2018, 12:08 AM

#

Magikarp ur right x) im just being a poo head as usual

serene pier Mar 5, 2018, 5:49 PM

#

does anyone have any experience with online learning?

#

basically retraining models with new data constantly or periodically

thick siren Mar 6, 2018, 1:23 AM

#

https://youtu.be/pqY_Tn2SIVA

YouTube

The Coding Train

5.1: Doodle Classifier: Introduction - Intelligence and Learning

In this series, I build a "doodle classifier" using the Google "Quick, Draw!" dataset and my JavaScript neural network library. 🎥 Next Video: https://youtu.b...

▶ Play video

spark nimbus Mar 6, 2018, 8:18 AM

#

📎 Digraph.gv.png

#

This is fine

#

Iike how despite all weights being random, they all still look the same in each row

hasty maple Mar 6, 2018, 8:20 AM

#

They just show the fully connected layers, not the actual weights

spark nimbus Mar 6, 2018, 8:20 AM

#

Yeah I know they don't show the weights

#

But the value of each neuron in each layer has such a low diff

lapis sequoia Mar 6, 2018, 8:55 AM

#

any of you guys happen to know good algorithms for image upscaling, preferably i want to get something like this

📎 starman.png

#

not sure how this was generated though

hearty hazel Mar 6, 2018, 9:48 AM

#

Have you looked at waifu2x? It's an interesting image upscaler

spark nimbus Mar 6, 2018, 9:57 AM

#

why is backpropagation so hard aaaa

lapis sequoia Mar 6, 2018, 9:59 AM

#

i have, it's quite decent but it's not good at upscaling real-to-life images @hearty hazel

#

this looks like some sort of texture remapping

#

i forget what they call it specifically

#

@spark nimbus it's just math™

thick siren Mar 6, 2018, 10:13 AM

#

@spark nimbus feels bad man

hasty maple Mar 6, 2018, 11:05 AM

#

@spark nimbus http://bigtheta.io/2016/02/27/the-math-behind-backpropagation.html maybe you could go through this to understand backprop, there is one mistake in it though, let's see if you will find out if you go through it :P

spark nimbus Mar 6, 2018, 11:05 AM

#

📎 unknown.png

#

I know the math

#

just not how to implement

hasty maple Mar 6, 2018, 11:06 AM

#

oh

hasty maple Mar 6, 2018, 11:23 AM

#

why not just use an opensource library?

#

that only requires forward propagation,it will do the back prop for you

spark nimbus Mar 6, 2018, 1:22 PM

#

I don't understand shit about the terms they use

spark nimbus Mar 6, 2018, 6:44 PM

#

📎 unknown-19.png

#

Managed to implement these

#

Just the bias left

spark nimbus Mar 6, 2018, 7:59 PM

#

@hasty maple for $delta_{n+1}, should I take the sum of all the weight deltas for this neuron or the average, or what?

📎 unknown.png

#

📎 unknown.png

hasty maple Mar 6, 2018, 8:12 PM

#

You first canculate delta_N the final layer delta given by ( yN - t )* derivative of activation function for that layer

#

@spark nimbus

spark nimbus Mar 6, 2018, 9:59 PM

#

@hasty maple can you look into my code for a bit?

thick siren Mar 7, 2018, 2:48 AM

#

https://www.youtube.com/watch?v=pVgC-7QTr40

YouTube

Two Minute Papers

Building Blocks of AI Interpretability | Two Minute Papers #234

The paper "Building Blocks of Interpretability" is available here: https://distill.pub/2018/building-blocks/ Our Patreon page: https://www.patreon.com/TwoMin...

▶ Play video

thick siren Mar 7, 2018, 7:20 AM

#

Is there any gui tool you can use to make functional neural networks?

#

I can't figure out tensorflow

hasty maple Mar 7, 2018, 8:26 AM

#

@thick siren maybe try keras

#

It's not gui but at a higher abstraction level than tensorflow

thick siren Mar 9, 2018, 10:56 AM

#

ahhh it's so hard

tawdry dock Mar 10, 2018, 4:25 AM

#

hello