#data-science-and-ml
1 messages · Page 167 of 1
Alright, nice
In the main image, its like 135-180 degrees is the goldilocks zone. What these datasets seem to have in common in most cases is they're "evolved" information systems that are subject to selection pressure. They represent stable, prsisten patterns that have survived over time and they're in the optimal learnability zone, complex enough to be interesting , simple enough to process.
Like financial markets evolved through economic selection pressures, biological sequences with 4 billion years of evolutionary optimization, language patterns evolution and communication efficiency, image structures, time series. you get it.
Then you got some outliers. Like Artifically constructed data, pathological cases or ery recent/unfiltered information but its all the wrong angle.
87.1% cross domain classification accuracy baby, the pattern is real, not random
it is amazing, where did you learn it? i learned about models and how it works, i am currently understanding agents, if it is possible for you to tell me how or what should i learn to be able to do that
i would be thankful
Hi
hey 👋
I just think of ideas I like after read about new methods or something I learn. I think about what I have in my arsenal of code and how I connect the best tools together for the right job. You get better the more stuff you make. Agents are a hot direction right now.
Anyone knowledgeable on how to use the subset dataloader in pytorch to just get three classes from the fashionMNIST dataset? I’ve been running around in the documentation and trying things but nothing is clear, and the best way I’ve found is someone else’s method that directly uses the full dataset from its github repo and more ‘manually’ extracts the three classes i’m interested in.
Re: the subset dataloader, I would want to use a boolean mask (or whatever equivalent form of indexing) to just grab all instances of T-shirts, coats, and shirts, and then do my thing from there
Hey gang, How do i practice data science? I wanna invest my time into learning numpy and sci-py and matplotlib but i have nothing to apply myself towards or any data to mess with, any pointers?
the most important thing is that you don't try to learn in terms of libraries. if you go into it thinking "I'm going to learn numpy and then I'll learn pandas and then I'll learn scikit-learn" you won't get anywhere.
focus on learning concepts, and use whatever libraries you need to implement something that applies that concept.
thank you for your response, but knowing this i'm not really sure of where else to go from here, do you suggest a data science book like the one openstax offers? https://openstax.org/details/books/principles-data-science
I've never heard of openstax
!resources data science
The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.
@stuck belfry you can look at this curated list ^
wow this looks sweet, thank you again
also, as a reality check, keep in mind that it's very unlikely that you'd be able to get a job in data science without a degree, so if you're trying to work in this space, you need to do that.
lol i'm aware, i'm learning it for my own self interest
that's good. I think people should always learn about things that interest them, even if there isn't a profit motive.
i'm going to uni starting this august though, gives me time to refine myself before i go into the workforce with data science
or whatever else there might be, i think stuff like numpy is foundational to whatever other applications python has in mathematics as a whole
"data science python" is like its own dialect of python, and many of the distinguishing features of that dialect come from numpy.
hello and welcome to our wonderful data science chat
hello
Im learning data science w python and just data science in general
it's soo cool
i'll be reading the oreilly textbook mentioned on the list through the link you gave, i'll make sure to pass by here every once in a while to leave some hints on my progress, thank you again!
pretty much just a website with free online textbooks, i think it's mainly useful for high school students though
Hi, how would you advise someone who wants to be an AI Engineer and is considering switching from a full-stack developer? And how would you recommend building the resume to land full-time jobs?
I have an intermediate skill level with the MERN stack. I am trying to maximize my chance of landing a job, but I am worried I cannot specialize in any one field. I am anxious about the thought of not pursuing a field and then regretting later for not picking it earlier
Hey guys
how it looks is it used R with python or
R or python?
so should I focus more on python or R?
R is better if you're doing good ol' statistics, otherwise I'd just use python
e.g. semopy's definitely behind lavaan by quite a margin
(and the former's not even being updated anymore)
thanks, makes sense
R and Python are both popular programming languages for data analysis, but they serve different purposes. R is primarily focused on statistical analysis and data visualization, while Python is a general-purpose language with a broader range of applications, including web development and machine learning. Python is often considered easier to learn and more versatile, while R is favored for its specialized statistical capabilities.
not exactly true r is for ml too
source google search (probably gemini)
I don't know anyone who would choose R over python when it comes to ML
guys halp me
is my network dying?
import numpy as np
import nnfs
from nnfs.datasets import spiral_data
nnfs.init()
np.random.seed(0)
X = [[1, 2, 3, 2.5],
[2.0, 5.0, -1.0, 2.0],
[-1.5, 2.7, 3.3, -0.8]]
X, y = spiral_data(100, 3)
class Layer_Dense:
def __init__(self, n_inputs, n_neurons):
# defining layers
self.weights = 0.10 * np.random.randn(n_inputs, n_neurons)
self.biases = np.zeros((1, n_neurons))
def forward(self, inputs):
# dot product
self.output = np.dot(inputs, self.weights) + self.biases
class Activation_ReLU:
def forward(self, inputs):
self.output = np.maximum(0, inputs)
layer1 = Layer_Dense(2, 5)
#layer2 = Layer_Dense(5, 2)
activation1 = Activation_ReLU()
layer1.forward(X)
print(f"layer output: {layer1.output}")
activation1.forward(layer1.output)
print(f"layer output after ReLU: {activation1.output}")
#layer2.forward(layer1.output)
#print(layer2.output) # should be [[ 0.148296 -0.08397602]# [ 0.14100315 -0.01340469]# [ 0.20124979 -0.07290616]]'''
Neural networks don't "die"
output -
0.00000000e+00]
[-8.35815910e-04 -7.90404272e-04 -1.33452227e-03 4.65504505e-04
4.56846210e-05]
[-2.39994470e-03 5.93469958e-05 -2.24808278e-03 2.03573116e-04
6.10024377e-04]
...
[ 1.13291524e-01 -1.89262271e-01 -2.06855070e-02 8.11079666e-02
-6.71350807e-02]
[ 1.34588361e-01 -1.43197834e-01 3.09493970e-02 5.66337556e-02
-6.29687458e-02]
[ 1.07817926e-01 -2.00809643e-01 -3.37579325e-02 8.72561932e-02
-6.81458861e-02]]
layer output after ReLU: [[0.00000000e+00 0.00000000e+00 0.00000000e+00 0.00000000e+00
0.00000000e+00]
[0.00000000e+00 0.00000000e+00 0.00000000e+00 4.65504505e-04
4.56846210e-05]
[0.00000000e+00 5.93469958e-05 0.00000000e+00 2.03573116e-04
6.10024377e-04]
6.10024377e-04]
...
...
[1.13291524e-01 0.00000000e+00 0.00000000e+00 8.11079666e-02
[1.13291524e-01 0.00000000e+00 0.00000000e+00 8.11079666e-02
0.00000000e+00]
[1.34588361e-01 0.00000000e+00 3.09493970e-02 5.66337556e-02
0.00000000e+00]
0.00000000e+00]
[1.07817926e-01 0.00000000e+00 0.00000000e+00 8.72561932e-02
[1.07817926e-01 0.00000000e+00 0.00000000e+00 8.72561932e-02
0.00000000e+00]]
0.00000000e+00]]```
thats a scary amount of zeros im seeing there
oh STELERCUS
boy am i glad to see you
why?
I'm a bastard.
@zinc patrol why do you think so many of the outputs end up being 0?
i think its because of the ReLU, but im not sure.
right. the ReLU clips everything below 0
you might try a different activation function.
so i WAS right
hello guyssss
TAKE THAT RELU
discord still banned in turkey 😢
oops sorry gota bit carried away
I'll try sigmoid
which set
but thanks for the idea
whats that
i think ill use hugging face instead tho
some random dataset i found online
there's also leaky relu
neural networks from scratch
I'll try that too
oh okay
sigmoid fell out of favor in the first place cause vanishing gradients
hi guys so I want to build a ai website builder , where a user gives a prompt and the ai responds back with with necessary code . I want to use open source LLMs which are good for code-generation. Could y'll suggest me some LLMs that are good for this purpose , and also the parameters they have
you can visualize instead of staring at numbers
.
not comfortable way currently
I assume just plot functions output, after relu applied
depends on your hardware
nowadays LLMs come in all sizes
I have 16gb ram and 4gb vram
4gb vram isn't a lot so try something like Qwen 3 <4b models
use a quantized version so you can fit it in vram
I am trying to use the deepseek coder v2 lite
Does q4 help?
if you can stand 10+ seconds per token generation (and potentially longer), then sure
use the GGUF format so you can put your ram to use
Ohhh okay btw where do I learn the hardware for this? do I have serach llm hardware
you can usually estimate from parameter size
like deepseek coder v2 lite is 16b parameters, so if each of those parameters takes 4 bits (half a byte) you'd need around 8gb memory
- some more for the context
- you probably don't want to use
Q4_0norQ4_1but something likeQ4_K_M, which technically is higher than 4 bits on average
you also need to take into consideration the memory required for the inputs and outputs though, which varies depending on the context window size
for generating an entire website you'd need of a pretty large output
Got it , but why not q8 , as it would take the same amount of memory
q8 would take 2x the memory as q4
or in other words, 16b model at q8 takes about 16gb memory just for the model itself
Ohhhh okayyy I get it
So its not possible to get good results with a open-source LLM?
Considering my architecture
well I mean I suggested the Qwen 3 series
they're overfit AF and write super dry, but you don't care about that when you're just gonna code with it
Trueeeee , is it okay if use a code-specific LLM like codeLlama , wizardcoder
the ones you named here are highly out of date, so I wouldn't really consider them
Ohhh got it
like I wouldn't be surprised if Qwen 3 4b beats wizardcoder 15b at this point
The recent one would be qwen and deepseek?
the ones that are probably good at coding, yes
llama4 kinda flopped from what I hear
even before thinking about performance, it's outright impossible to run it in a consumer grade GPU
-# well, might be possible with a ridiculously bad performance, but the size is just absurd
Ohhh that's bad , anyways on my way to download qwen and deepseek thanks
How huge is it?
over 100B total parameters
well it's not the worst in the world due to it being an MoE, so CPU performance will be somewhat bearable on say a mac or ddr5 ram
for individuals though it's probably still better to just use a smaller, dense model
it says likely too large
Does it mean , i dont have enough disk space or i dont have enough ram to run it
yeah that's 32b, and that's qwen2.5
I said Qwen 3 which come in various sizes; something <4b is probably good for your setup
I feel like you're unlikely to get good results with complex prompts from anything lower than that tbh
I got that , but is it showing that likely too large , bcoz of my ram?
so 32B parameter model is like the bottleneck?
one of the strongest contenders for "low" param coding is honestly probably still QwQ 32b
or vram, idk I don't use lm studio
either way it's showing you a Q2 quant and I doubt it'll be coherent even if you could run it
I do have the choice to select q4 km
yeah and I doubt you can reasonably run that
I seee , looks like I need to switch to APIs most probably
are hard edges hard for 1d convlolutions? i.e when working with (simulated) LiDAR the data look like in the attached picture. It is rather high resolution but very sharp. (also attached a closeup)
Hey woundering if any one knows if its faster to do simple data cleaning with bash in the command line or with pandas ? Its looking like unix might be faster but woundering if anyone has a suggestion as right now im using chuks and setting it to 500_000 and my large files are taking a long time to clean. Im just removing the header and removing ^ symbols
hm... wdym by "hard" exactly?
there are kernels that can detect edges
just significantly making it harder for the model to converge
or requiring more parameters / larger kernel sizes to capture meaningful features
im trying to overfit on this "clean" simulated data before moving to realworld data but am running into real problems fitting the model
maybe?
sed 's/\^//g' data.txt > cleaned.txt
this simply removes all ^, you can try if it's faster
for the header, depends on if you can reliably detect a header ig
I'm not so sure on the convergence difficulty, sry
Yeah i was thinking or running something like that would prefer to keep the pipeline all in python but i think running it like that might be faster. Right now i am pulling from aws bucket as csv -> coverting to txt -> cleaning with bash -> then gzip scp to a new server -> unzip and load into a netezza server . I really wish i could simplify its pipeline 😂😂😂
all good 🙂 currently fighting with mode collapse anyways haha. probably some other underlying problem
maybe you could try high dilation since you said it's high resolution?
so the kernels could see more of the sequence at once without increasing the parameter count by too much
Good idea
bumping this in case anyone has an answer
Wow. I was already using dialation but only with a factor of 2-6 or so. Looking at the data my features were way larger! (60 datapoints or so for one "feature" / peak or whatever). I've downsampled by a factor of 10 because i really dont need the resolution for these tests and its training much better! Now the dialation actually covers the features
@jaunty helm Went from this
to
ignore the wonky GT data distributions. Thats from my simulation.
all that with under 300k params 🙂 definitely viable and a solvable problem i can invest some more time in.
If i want to learn maths and stats for data science which is the best option
I want to create an AI personal assistant using python. I asked Chat gpt to draft me an outline for the process but I am still lost. Where do I start?
👋 Hey everyone! Imagine trying to understand healthcare data privacy rules, but it feels like wandering in a confusing forest. That’s why I built MediRAG Guard.
MediRAG Guard is special because It doesn’t just search for keywords, it understands how pieces of information are connected using a unique hierarchical Context Tree. It uses Python, Groq, LangChain and ChromaDB. This helps it give you clearer, more accurate answers.
It’s like having a guide in the forest, showing you the way! 🌳
Check out the demo and see how it works: https://github.com/pr0mila/MediRag-Guard
Sure I can help!
whats the class mapping look like? What one is it exactly theres small medium large
i'm not sure i understand what you're asking?
What's the next step after convelution nets?
here is the meat of the network
here are the initializations for the weight and bias terms
@lapis flax it's easier for people when you give all the code as text. not as a screenshot.
!code
I'm working on combination on their own networks but what's after convolution?
are you asking me this or just asking in general?
Just in general cuz I mean a convolutional neural network I know how I can give it different image I just have to put in the folder name or at least the folders with their labeled and then train until there's a specific accuracy that I'm loving granted I haven't implemented a threshold yet I've been meaning to get that done but I've been busy planning a few hundred things
about the MINST fashion dataset
yes this exactly is what i was wondering about
im away from comp but will take a look later, do you know if this is covered in any of the torch documentation tutorials?
either way, thank you so much!!
Just in general
i commonly see some kind of max pooling layer in convolutional neural nets
So audio or video labeler
what would be the best way to detect an image? im thinking of using cv2 but since the object im trying to detect is a card with text and no images and it seems to detect based on noise, idk if it will work
depends on what you're trying to detect?
some OCR engines can be used to identify text bounding boxes
its like, a bunch of these things, im making an ai that plays a roblox game for me, im trying to win a bet with my friends
the issue im running into is that they all look the exact same, except for different colours and text
hello, can someone please help me with the coding🥺 it's more on django and type reaact🥺
That sounds more like a #web-development type of question. If you could share it in that channel with your code and error, someone there might help
would a custom image processing library (kind of like PIL) be a good project for a computer vision resume
it would definitely indicate you know the CV algorithms yeah
Has anyone ever taken this course ?
https://www.udemy.com/course/python-coding/?srsltid=AfmBOopm4dzFrPHU-owY2JQZd_YpaJsjwoca1POPrdMjCt0iPq8aS8Xm&couponCode=ST19MT280525G1
I just want to have some opinions about it so that I can decide if I should purchase this course or not
From my understanding so far, machine learning is a combination of math (especially statistics and linear algebra) and programming (popular languages include python, c++ that you will combine with R, sql, Scikit-Learn, etc.)
machine learning is an application of theoretical math and you implement the algorithms that do it in Python.
you mixed up a few things that are orthogonal. R is a programming language for data analysis. SQL is a query language for tabular data. scikit-learn is a python library for statistical machine learning (ie not neural networks).
but in scikit-learn you have perceptron
sklearn implements very basic neural networks, but that's it
Alright, thanks for the clarification, I am still new to it
What you guys think of this : https://www.udemy.com/course/python-coding/?srsltid=AfmBOopm4dzFrPHU-owY2JQZd_YpaJsjwoca1POPrdMjCt0iPq8aS8Xm&couponCode=ST19MT280525G1
I'm not sure if I should buy it or not ?
is shiny posit usefull for beginner?
iirc it's popular in R, but not so much for Python
there are Streamlit, Gradio, Dash and many other similar libraries that have widely used around in the python ecosystem for longer, but I guess Shiny also works
arguably even Marimo could count as an alternative for it
There are a lot of options, if you want to make a dashboard or extremely simple web app they are all useful regardless of your level of experience, which one you pick is mostly a personal preference
It seems like it's all about deep neural networks nowadays, are there any tasks where something else is used for the state of the art?
i mean, large language models are sorta what a lot of researchers have turned their attention to
there's a lot going on other than large language models.
that’s definitely true, perhaps i should specify that in my little corner of the machine learning and ai world, LLM’s are like the ‘hot new thing’ that a lot of people are thinking about and playing around with
attention pun not intended
wrong channe sorry :\
what are most important topic/s to in ML/Data science?
Statistics
You guys wanna see some cool images. I finally got the validation suite done for UCF..
93.1% on Cross domain classifcation
guys where can i find pretrained LSTM RNN for time series data
for self learners of data science, where do you guys go online to learn (free materials).
Been wanting to go into this hence choosing this as an elective course for my degree, but after finishing the course I realize I still got so much more to learn as when I go through other people's project I barely understood anything, so wanna go into it more on my own but courses on coursera and edx are not free
Kaggle is pretty good
I also like the statquest YouTube channel for learning statistics
I see statquest channel is statquest with (name of person which I forgot)?
statquest with Josh Starmer, yeah
You can search "statquest playlist" on YouTube and his playlists will just come up, I think they're organized pretty well
it's because it's "easy" as you can just make it bigger™️ and use more data™️ and usually it'll be close to the top (while using a lot of compute)
but take time series for example, I'm pretty sure the general consensus on those is still "for each problem, the best model varies"
That makes sense
I should look into time series then, sounds interesting
Wait there’s roadmap/tutorials on kaggle? I’ve always thought kaggle was only for providing datasets
and also there's this that says trees still outperform nns on tabular data, though it's a lil old
kaggle.com/learn is the link I think, they have mini-courses that teach you stuff
Damn ok will take a look
Highlights from the latest #nvidia keynote at COMPUTEX 2025. Topics include @NVIDIA's MASSIVE new Blackwell Ultra GPUs, new products like DGX Spark, DGX Station, and RTX PRO Server, and how they'll power generative AI models like #chatgpt by #openai and #deepseek R1, reshaping artificial intelligence and computing as we know it. Based on all the...
Cool, I just started learning random forests so that's exciting!
My uni didn’t teach stuffs like time series, MLOps, NLP and so on, saw these things on forums and I have no idea how they work
Or even they existed
Thanks sir lizard
is there a subreddit to promote and sell my python app as crypto
i made AIO pc usage tracker ap from ai code editor called cursor
sorry if its wrong channel...
when did uni teach any important stuff related to the industry?
no one teaches databases properly, no one teaches any kind of technologies that are used heavily (docker for example)
schools are obsolete af
no one teaches taxes, no one teaches insurances...
then when people exit the schools they gotta find these all by themselves, whats the point of schooling then? Is shakespeare more important than how to live in the society?
bah
rant over
well people originally went to uni to learn the theoretical stuff for research
eventually every job started to require a degree though :[
I had in the past course in machine learning but it was in R not in python, dont know why this was decision for uni course
i assume it was theoretical
its fine if thats the case
I suppose teacher has knowledge of R
the algos dont change
most probably
theoretical and practical (labs)
i dk man i feel like suffocating in my current masters degree
all these theories, no usecases whatsoever majority of the time
yes they teach without context
I can buy one which one i need to buy first
Hands-On Large Language Models: Language Understanding and Generation
or
AI Engineering: Building Applications with Foundation Models ?
Hello could I please have tips on training a small language model
can I have custom theme instead of default light theme in jupyter notebook?
dont like these green with gray this is ugly for me
dont know vs code with jupyter notebook maybe
what color scheme do you have for ai/ml?
I'm talking about readability and beuaty also
ok first question is googleable rather
use vs code
infinitely better
Can anyone recommend a machine learning book for a beginner? I'm thinking about either introduction to statical learning or machine learning with python
hey guys
I'm not sure books are the best for learning machine learning and neural nets, but if you really want a book, I would advice "Neural Networks from Scratch in python" (aka nnfs)
it would also be good if you saw some tutorials on youtube and pay close attention
i remember wanting to read nnfs 3 years ago
except my uni covered ML as a course the next sem and NN from scratch in python was literally an assignment
feel the maths/algorithm is more important than "python/implementation"
as long as you're decently confident with one programming language, you'll be able to do it from scratch
Do you mean training yours from scratch or fine-tuning an already pretrained SLM?
You just have brave the storm. Find ways to make it fun. I recently saw a video on YouTube where a guy was complaining of the same thing.
He enrolled for an AI masters program in Germany thinking he'll be doing Neural Nets and all that fun stuff but his first semester was just filled with mathematics, statistical proofs 😅
You must combined the applied side of ML (perhaps by learning and building cool stuff on your own) with the theoretical part majorly covered in technischule.
I think it's kinda cool in Germany cos they have Technical Universities (more suited for people interested in Research & PhD) and School of Applied science (for people interested in the engineering part of ML)
Training one from scratch
I'm going to presume you've sorted out the data collection part already or you have the means achieve that.
Read this https://lelapa.ai/inkubalm-a-small-language-model-for-low-resource-african-languages/
If you find it interesting and would like to dive more deep, then you should download their published research paper on arXiv. (That's literally all you need and perhaps with a little bit of googling here and there should you encounter any implementation they did that's not clear to you or you find it hard to reimplement.)
If you, however, consider this "a lot of work", then searching on YouTube might probably work best for you.
Tbf when I first started self learning ML/AI I thought I could just get straight onto using the tools but in reality, the mathematics is much more important. Understanding and learning the maths behind ML is so useful; I can actually understand how the ML libraries work, even if its at a surface level understanding.
hai guys
One confusing i my mind is data science is also make dashboard ??
what
Could i need to learn all dashboard like ploty ,powerbi and tableau ??
I am not understand where should i go i have learned pandas,numpy and matplotlib ,could i also need to learn Excel ,sql
Is data science job role also make dashboard or not ??
what are you trying to say
I haven't yet still trying to figure out what books I should probably teach it first or what text should I teach it first
Since you're just trying to understand how to build one yourself, I'd say, forget about collecting your own dataset for now and use any open sourced dataset instead.
More so, you might wanna start from fine-tuning a already trained SLM to see how it works before trying to start training from scratch or even with a custom dataset you created yourself.
What would you recommend what libraries should use for a small model?
Looking for a guy who knows programming, more specifically Python, in DM please, urgently
I know to find somebody with somebody who knows how to work with python just asking pytorch is there already a file for small language models or at least medium size models
PyTorch, Lightning, HuggingFace, Unsloth
I know does pie torch have anything for making a small language model just like how it has the data set that you can download into your network
No it doesn't.
Anyone have a project they want to submit for the project showcase? #announcements message
So where should I get the data set?
You'll mostly find one on HuggingFace. The LelapaAI website I shared also added link the train and dev set they used in training their SLM. Check it out
I’d prioritize learning SQL—but only to a practical level. Focus on mastering CRUD operations (Create, Read, Update, Delete) since that’s what you’ll use most often. You don’t need deep expertise like a Database Administrator; just know enough to query data, make updates, and handle basic tasks efficiently.
Which hugging face data set would you recommend?
I have a project where you control your mouse with a gyroscope/accelerometer (like from your phone/smartwatch)
a kalman filter isn't enough, do yall suggest a neural network to filter noise?
and if you have any, id be more than happy to hear suggestions on resources regarding NNs, im still having to learn. more specifically i thought about a recurrent NN (long term short term or whatever)
Got an example? I assume you have something like xyz time series
I can provide my script if you want me to, but the basic idea is as follows:
you collect accelerometer (and gyroscope) data, about every 5ms, so @200hz.
as of right now, it's being pulled from a local server hosted by phyphox on my phone. it does a great job at refining sensor data (kalman filter usw), but it's not enough
data looks like this:
acceleration = [(x1, y1, z1),
(x2, y2, z2),
(...),
(xn, yn, zn)]
rotation = [(x1,y1,z1),...] # you get the drill...
I measure the data live, so I can collect however much I want
Hey guys does anyone know where I can get the most recent annual mean cost of commodities like wheat,petrol and other energy sources per country?
I tried using our world in data and well there are a lot of incomplete data sets even for as far back as 2022
I created ai anyone have some idea to improve it and make it better
you have to be more specific than that for any help
I want to train an AI image model, where would I find a dataset for images?
i want to use csv files and train datasets using tensorflow and stuff
Which version of python and tensorflow should i use?
kaggle
or roboflow
is this your first time using a DL framework?
yep
because i know it gets error
version not matches etc
I wouldn't suggest Tensorflow anymore, it's being phased out by Google. I would use Keras with Jax or Pytorch.
Or PyTorch manually or PyTorch with PyTorch Lightning these days
It's also easier for beginners and has friendly API and is the most use one in academia.
what kind of image model ?
my teacher uses tensorflow so they said us to use that too
object detection or image classification ?
generative AI, its for a school project so i just dont want deformed hands 😭
it is very very expensive
ah okay, that's different then. It's still out of date.
Anyways what was the error you got?
my teacher is permitting me to use the school's gpu array, i think its mostly 1080s but theres a few more modern models
unless you mean the data is expensive 😭
if you have atleast dozen h100 we can talk 😂
n e v e r m i n d
😭 what did he expect me to do he literally noticed my interest in AI and he said "oh you should try to make your own"
you can make non generative image model cheaply and easily
text generative?
yea i have a classification model
improve on that it is easier than generative models
Actually you can fine tune your own model, don't train your own from scratch...that's expensive.
Look into Parameter Efficient LoRA
and 4 Bit Quantization
that is another way
you can fine tune on a single 4090RTX with this and unsolth
4 bit quant works with image models too ? i thought only text was possible
ImportError: DLL load failed while importing _pywrap_tensorflow_internal: A dynamic link library (DLL) initialization routine failed.
Failed to load the native TensorFlow runtime.
See https://www.tensorflow.org/install/errors for some common causes and solutions.
If you need help, create an issue at https://github.com/tensorflow/tensorflow/issues and include the entire stack trace above this error message.```
it's just quantization, float precision afaik...never tried it with image models...but why wouldn't it work with image models
just never thought about that
what about trend models? im decent at math for my age (im 16) and ive recently landed an internship at a quant firm, i want to see if i can make one that does low frequency trading off momentum instead of high freq trading like most quant firms. i think i can do something to smooth out the noise from the stock data by averaging the price per time period and using smth like fourier sequence to fit a sine approximation to the data
fine tuning sure yea ill look towards that
i am too dumb for impleamenting Ai models bruh
how did you install this package? was there any error? if not, try again with this
pip install tensorflow --force-reinstall
does anyone here have experience in stock market? does momentum trading provide reliable profit? even just 1% will be fine, i care more about a good sharpe ratio versus a good profit rater
using pip
i think adding llm for market news context will make better trades ?
it works fine with image dataset but it does not works with csv
show full traceback
news context usually is already shown by stock movements. of course, human analysts are used by quant funds since most signals are exhausted the second they show up, but then human analysts also cost a lot. im assuming that LLMs wont be able to provide as strong of a market context as human analysts, but itll be a good starting poiont. however news doesnt cover small firms as much, which is the market id want to target because of the signals there lasting for longer than 1/2 ms
File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, i from tensorflow.python._pywrap_tensorflow_internal import *
ImportError: DLL load failed while importing _pywrap_tensorflow_internal: A dynamic link library (DLL) initialization routine failed.
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "c:\Users\laptops galaxy\Desktop\BSAI-5A\ANN\logon\gan_project.py", line 4, in <module>
import tensorflow as tf
File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\__init__.py", line 40, in <module>
from tensorflow.python import pywrap_tensorflow as _pywrap_tensorflow # pylint: disable=unused-import
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 88, in <module>
raise ImportError(
ImportError: Traceback (most recent call last):
File "C:\Users\laptops galaxy\Desktop\BSAI-5A\venv\Lib\site-packages\tensorflow\python\pywrap_tensorflow.py", line 73, in <module>
from tensorflow.python._pywrap_tensorflow_internal import *
ImportError: DLL load failed while importing _pywrap_tensorflow_internal: A dynamic link library (DLL) initialization routine failed.
Failed to load the native TensorFlow runtime.
See https://www.tensorflow.org/install/errors for some common causes and solutions.
If you need help, create an issue at https://github.com/tensorflow/tensorflow/issues and include the entire stack trace above this error message.```
what's your code?
Gans Model
Please react with ✅ to upload your file(s) to our paste bin, which is more accessible for some users.
@limpid zenith what should i do...
Install Visual C Redistributable -> https://aka.ms/vs/17/release/vc_redist.x64.exe
okayy
"make your own" is probably not possible, not only do you need lotta compute you also need lotta data
finetuning / making a lora should be more doable
as for deformed hands... I've heard that flux mostly fixed that issue? tbf I've never ran it cause I don't have the hardware
still same issue
did you reinstall tensorflow?
Yep
Maybe check WEF, or better still, try using LLM
Thanks man
No.
I was about to ask in svr make_pipeline, and the use of standard scaler, does it only apply standard scaler() on X , or does it also apply to y
and also, you sound more... cheerful
Just on X
To anyone who's hosting their own LLM, what is the cheapest GPU VPS service available?
I want to try and build a live proof of concept
Introducing MediBeng-Whisper-Tiny! 🚀
We’ve fine-tuned OpenAI Whisper-Tiny on Hugging Face to 𝗧𝗥𝗔𝗡𝗦𝗟𝗔𝗧𝗘 code-switched Bengali-English speech into English. This helps improve doctor-patient transcription and makes clinical records more accurate. 🏥🎙️
Bonus: It’s an easy way to fine-tune Whisper for translation tasks!
Check out the repo for more details and try it out!
🔗 MediBeng-Whisper-Tiny GitHub Repo
: https://github.com/pr0mila/MediBeng-Whisper-Tiny
🔗 Hugging Face Model:hugging_fire: : https://huggingface.co/pr0mila-gh0sh/MediBeng-Whisper-Tiny
If you like it, don’t forget to give a ⭐ and 👍!
#AI #SpeechTranslation #HuggingFace #WhisperTiny #HealthTech #AudioToText
MediBeng Whisper Tiny improves doctor-patient transcription by training the Whisper Tiny model to translate mixed Bengali-English speech into English, making it easier for analysis, record-keeping...
Can anyone please help me with fine tuning my tesseract ocr
https://discord.com/channels/267624335836053506/1379045149916528691
Can anyone tell me about "JSON handling for data exchange"??
That depends™️
What do you wanna know about it?
is it possible that i take the most advanced ai model from hugging face and tune it to generate youtube thumbnails
like https://pikzels.com
If you pick "the most advanced" model, you probably won't have enough GPU memory to fine tune it. But what you said is possible in principle
Someone highlighted your message to me which is why I rejoined the server. This message is very much off the mark, if you're interviewing then don't mention any of this
Hello why do I get significantly different results between my Google Colab (GPU) and local (CPU) runs?
both of them are using ```py
pipeline = Pipeline([
('tfidf', TfidfVectorizer(sublinear_tf=True, strip_accents='unicode', analyzer='char',
ngram_range=(2, 6), max_features=40_000)),
('clf', XGBClassifier(random_state=42, enable_categorical=True, device="gpu"))
])
le = LabelEncoder()
y_encoded = le.fit_transform(dataset['class'])
pipeline, y_test, y_pred = train_e2e(dataset['text'], y_encoded, pipeline)
CPU colab results are the same as the CPU local ones
do you have a local gpu to also compare with? what immediately comes to mind is that gpus by default use 32 bit floats while cpus tend to use 64. most of the time this isn't an issue, but sometimes it can be. you can also set the bit depth to 32 bits for your cpu runs and see if they now match the gpu result
Thanks man
whats a good beginner book for linear algebra? Ive tried reading LinearAlgebraDoneRight but reading through it, i feel like im missing out on a lot of the context as my only exposure to linear algebra is vectors and matrices. Ill probably go back to LADR once I now about linear algebra but need the base knowledge first
can someone help me when i m importing another python in the same folder its not getting imported why
what's the error message and the folder structure?
folder struc
ai/dataset.py
model.py
prediction.py
train.py
error is cominhg moddule not found something like this
you can try gilbert strang's linear algebra for something based on matrices and vectors and with applied examples
When do I use it?
why I see mode 0 50000 instead of mode 50000 in example on simplilearn demo? ignore 0?
or maybe this 0 has some meaning
hmm I assume its index 0
mode() of pandas dataframe
see docs?
I confusingly thought its bimodal with 2 values but its probably not
yes it's index 0
Always returns Series even if only one value is returned.
Ill have a look. Ive seen that book recommended quite a bit. If I were to use that book as my primary source, what other books should I read for furthering my understanding (particularly in applying linear algebra to ML and other areas of programming/science that require it)?
probably a book on ml
Thats actually reminded me of a book which i downloaded called mathematics for ml or something like that (youll know the one as its commonly used)
I hear AI all the time. But I never hear about the people going into the weights and trying to figure out how it all works. Which is strange considering how common AI is.
I'm not sure about books. But IMO one of the strongest increases in understanding on linear algebra's applications is when you understand its connection to graph theory, and representing problems with graphs.
This is because both linear algebra and graph theory can be applied in so many cases, and they work nicely together.
What do you mean by "going into the weights?"
people dont need to understand all levels of abstraction to meaningfully extract value from something
and yeah weights in what sense? the sentence is analogous to "people going into the transistors of electronics": AI covers a huge field, and not all models will even have weights
Fixed it yet? If no, I'd like to see how you tried making the import
nah
not fixed
its like from dataset import datset . dataset is a tuple name inside dataset.py file
Cool. Are you using VSCode or Jupyter? Where are you making the import ?
Can you show the structure of your directory?
vs code
ai/dataset.py
model.py
prediction.py
train.py
structure
I don't expect everyone to play with the weights, since some people just want to use the tool. Different strokes for different folks as they say.
I find the extreme lack of people interested in playing with the weights strange.
Most AI, such as LLMs and generative art has weights. I am also curious about AI that does not have weights of course, as it works in a different way.
Good. So on which script are you making this import?
i fkin used all sort of gpts , ai , llms everything its not working
"Most AI has weights" what? maybe the flavour de jour nowadays is genai, but theres many industries and hobbyists who are using more old fashioned ML stuff every day lol
adjusting individual weights is basically just backpropagating but less accurately. would say its better to mess around with actual network architecture than spending any time on weights
can someone teach me how to make a neural network like in detail but in an easy way
I'd like to see a full traceback if possible. Meanwhile, inside your ai directory, add an empty __init__. py script there as well
i added it didnt worked
I am curious how they train the old fashioned AI? Is it more of a hand-made decision tree?
sorry what are you talking about
Are you perhaps talking about AI interpretability? That is a thing that some people are working on, sure
Yes I would like to reach out to these people more.
Hmmm... Just double check once again from VSCode terminal, that you're indeed inside the ai directory. If not, you have to cd into that directory for it to work.
Ok so for the past 6 months i have been trying to work on a basic chatbot, but it's failing miserably
What i have tried:
Different types of model:
Chat generation
Rag
Seq2Seq
But none of that works, I'm taking my data from HF about 100k lines of conversational data
My latest model's params were 18M but yet it is still spewing gibberish
Any suggestions?
Now we wait ☕
Oh also I'm only using pytorch
DONE EVERYTHING ITS NOT FKIN WORKING
would be very helpful if you provided screenshots, tracebacks, more detailed logs
if you don't want to debug nor cooperate with people trying to help, just install a package manager like uv and create a new project following its documentation, then copy/paste your code into the template it generates
I get that it can be frustrating when our code isn't going as planned, however, behind every error message, there's always a clue therein.
Can you send the full traceback?
ya like its that man tbh it took like 5 hours for that still it wasnt working
if u think why did i use caps lock . it was toggled for another work but i used it for it
Guys
yes sir
Can someone give me a path to be a data science? Bc I want to to this later
i aint a data science eng sry
vivek babu wsp
math, lots of math is a good path to it
study calculus and linear algebra, stats & probability theory
then learn python and or r and then either take a course on data science or watch videos or read books on it
learn to work with kaggle environments and such
probability is easy tho
buwet calculas and linear algebra are hard asf
wdym? probability is basically just combinatorics in disguise with kolmogorov axioms applied to scenarios
and combinatorics is difficult
bro tryin be shashi tharoor
Ah and I also have a question. I'll learn all that stuff in 2,3 Years. Should I learn Python, R etc. First??
I mean math
I'm 7.grader🥲
you can both at the same time, study math and python / r
Ight, do u think I can learn many stuff of python and R in 2-3 years(also math) and I can get a job?
I can already understand the basics of python and it's very interesting
i can't gauruntee anything since the job market is really bad rn, but it;s definitely a good asset to have learned
Ok👍
Hey guys can anyone help me for finetuning my tesseract for getting better accuracy
really need the help
what exactly do you need help with?
which part of the fine-tuning process are you struggling with?
otherwise this seems like a good start: https://github.com/Matleo/Tesseract_fine_tuning_training
Thanks alot for text
i will explain to you
So like i was trying to finetune, and i got really bad result, did using this doc. https://arcruz0.github.io/posts/finetuning-tess/
so like as said created files with image and its gt.txt value about 200. But when i tried to test it it gave me really bad answers, this is fully wrong and it detect nothing.
Here are the graphs of loss curve, which shows the result is good
i tired it but it really doesnt seem to work, i really need some help
what text are you parsing? can you share your dataset? I'm afraid 200 images might not be enough
does your dataset contain data that is similar to that which you are trying to parse or is it just some random generic dataset? I can't tell how some of the images I saw in your dataset in your help post relate to the parsed text in your image above
yo chill
you gain nothing from crashing out on discord
Has anyone ever used spink - moj python library
an entity-resolution / entity-record deduplication tools. For. e.g. you have customer table and there are duplicate entry of customers i.e. distinct customer id and may have slight error in name and other details being same or slight change in contact...etc
I'm facing perfromance reated issue
Does anyone know a source where I can download the "Blackbird" dataset? (https://github.com/mit-aera/Blackbird-Dataset)
The domain mentioned in that repo is no longer up and I can't find any sources for the data anymore. There is a torrent for it on academictorrents but nobody is seeding it right now so it's just stalled on my client.
If that specific dataset can't be found do you guys know of a similar dataset to it? It's a flight perception dataset for drones that includes camera streams, rpm and imu measurements, and ground truths for pose estimation. I'd gladly look into anything that resembles this at a fairly large scale, the datasets i've found so far are very tiny (a few gb at most)
atp I might just start contacting the researchers to see if they have it
that's kind of the basics of economics (which is to say, I'm not sure how useful of a hypothesis it is, or if it's even much of a hypothesis in the first place)
https://www.investopedia.com/terms/e/economic-cycle.asp
I do otherwise like the idea though, it seems cool and might turn out to be more accurate than a monkey throwing darts after all 😄
Hi guys, i'll be spending 2 months in the USA this summer and i wanna make most of my time by getting into ML, i started learning pandas and numpy a bit so far. How would i learn the most while still learning everything throughly to actually learn it not just scratch on the surface?
is the reason why we use MSE as opposed to MAE for gradient descetnt so we only have 1 global optimum?
MSE has a smoother curve that gets even more gradual the closer you are to zero
I guess it's not smoother per se but it's definitely more gradual
ah okay that makse sense
oh another thing
with SGD right you are never reaching the global minimum as opposed to batch - thats fine obv for most purposes but my question is
does that
A. Every prediction is slightly slightly off
B. Some predictions are still spot on but most are off
pretty sure its A right but could the parameters accidentally line up and make some perfect predictions like in case B or nah
even prior to machine learning taking off, there are many statistical reasons for why MSE would be used (that I'm too dumb to understand to explain to people)
Assumption of errors being the result of many small independent effects, which when combined results in a Gaussian distribution by central limit theorem. Taking that Gaussian assumption it naturally follows from the objective (argmax_theta L_theta[data] (MLE)).
(The square comes from the square in the Gaussian)
There is also a geometric understanding to this. Basically you are assuming that the probability density "cloud" is spherically symmetric, and from that the best estimate is the one closest in distance to the observed data, where this distance is the Euclidean distance (L2 (quadratic norm)).
following up on what squiggle just wrote, using MAE is like assuming that the error in each estimate (each entry of the vector you're looking for) is laplace distributed independently from the other estimates (instead of gaussian distributed)
the geometrical interpretation follows in a similar way, except we now think of more general "norm balls" instead of circular symmetry
these things would fall under "maximum likelihood estimation," in case you wanna take a deeper look yourself
I guys i am new to data science and i want to build an nlp project for my portfolio which is not too generic but also give me somethings to learn.
Can i please get resources related to that?
That's nice
its very creative ngl
how can i buid it?
what tools to use?
okok
can i get your receipts?
I have never done too much shopping😭
ok makes sense
well i will start shopping then😂
Yeah will do that
Dm maybe?
No--people need to buy it themselves
Understood. I'll delete my comment. Thanks for the quick reply.
it's disrespectful to the authors to send people free copies of their work
As in, what's AI doing with its ilegally sourced training data to output images?
what point are you trying to make? I'm not in charge of that.
I'm messing a bit with you. No ill intention, just pulling your feet for your profile pic. I worked within a legal tech, that aided fight copyright infringement, sharing something I legally bought is not wrong. It's pretty much a grey area, that's why piracy is really hard to punish, especially in those cases no money transactions is involved
Hello I am new in python i am creating a model audio to text, for used openai/wisper large-3 but my audio duration is 10 min or + how that process it been take very long time
and this process run on my local system any buddy tell me if can make faste
Sorry if this is getting sidetracked. @serene scaffold already helped me answer my question. I'll keep my unrelated posts to a minimum.
@final kiln both. Either helping companies, big or small or individuals that suffered from it.
I'm not sure if there's a way to get good performance with Whisper without a GPU.
Hey everyone. Just got in to a company that has an ongoing AI Project. Would love to have some advice on what I should focus on learning, but based on the direction of the project it seems we'll be automating a lot of stuff and probably develop some more projects for better quality assurance, do you think learning some Machine Learning or Data Science would help me keep up with the team? Also, are there any courses that you would recommend I take just so I can dip my fingers and pretty much understand the bigger picture of this project that I'll become a part of.
hmmm you are right, is there any platform to use GPU
I have tried AWS and GCP. GCP is a bit cheaper in my experience and usecases.
You have like 200~$ for the first two months for free when creating a new account for the first time.
See I have purchased open ai key
I am using there gpt4o model can anyone tell what is the context window size of this model.
You have a range you can explicitly define. Give me one second and I'll look it out
Yes
From 1 token, to 16384. Default is 2048 if undefined
2048 tokens by default
128k would be my first guess
Yes some in google chanting it
What's the truth range
128k context
16k is how much it can output at once
1 lakh 28 thousand is input and output is 16 thousand
note that the quality of responses likely degrades before hitting the 128k
Context window is the total of input and output
If you're using the API, it should be the same as https://platform.openai.com/playground, shouldn't it?
maybe it doesn't allow 128k on playground to prevent abuse?
idrk
No, you were right.
https://platform.openai.com/docs/models/gpt-4o
full context window is 128000, and max output is 16Kish tokens
What parameter in payload sets the context window
if i use my local machine for in develop case what is minimum rquirement of local system can atleast 10 minute audio,
current system is i5 7th gen, 2GB nvida
Because I am a huge model It not only audio to text
it compete transcript model which using as open source model
I believe this is used in the hidden chain of thoughts layer, we don't know what it's thinking process is due to the way they built it. I'm unaware if they changed it, but the last year I wasn't able to mess with that.
it is literally your input (prompt)
the only part of it you can control is the maximum amount of output tokens
AI is really demanding when you want to launch it locally. I don't think you can do much with it, but I'll search hugging face to see if there's a minimum model that you can withstand. What's your NVIDIA?
Yes by max token I can set the output length
Thanks for your response
One more thing , how can I improve prompt
write better
yaa I am trying to use hugging face, NVIDIA graphic card
A Comprehensive Overview of Prompt Engineering
clearly state your requirements
like, in what format is the data passed in
what do you expect it to output
Ok , thanks I am going to readit .
Do you want to do Text To Speech (TTS)?
Also, you also NEED to know about your graphics card model, in some cases you have to download a specific driver to be able to use certain AIs.
I'm assuming you use linux. I found this that you can either run it on a raspberry pi 4 or at linux with python.
https://github.com/rhasspy/piper
This is amazingly fast and lightweight, so I'd check it out if I were you. Also, if you're already at Huggingface looking at a certain model, read their documentation, that's better than anything else.
it good but some of language which i am looking it not available, simple
Audio to Text
Text to Transcript (any lang)
Transcript to Speech
that is my goal for now
I have very long vision regarding this model but currently need to ony these step
and yes if you good model for translation let me know
I use Argos Translate for translation but it not given expected output as compare with Google translation
what do you mean by "Transcript"?
transcription is the process of converting audio to text
I mean translated - script
Like audio in English get text and translated into other lang then use TTS for text to speech with new translated text
I don't think you'll find many open source TTS models with a descent quality that support more languages than piper though
maybe if you only need of a specific set of European languages, or Chinese/Japanese/Korean, but for other languages it can be very hard to find open source models that don't sound very bad
https://github.com/coqui-ai/TTS
no i am found this and check example
once it done i am work on voice cloning for good sound
it supports way less than languages than piper?
edit; ah you mean Fairseq Massively Multilingual Speech? I would expect for it to sound kinda bad, I cannot even find any samples online though
also their model weights license is a bit awkward
okay but i am finding hindi also included
if there any other model have support hindi lang let me
50 free calls / day iirc
unless you put 10$ into it, then like 1000 / day (you don't have to spend it, just having it in)
Is 50 free calls a day good
no
What iirc mean
if I remember correctly
If I remember correctly
What is good
I be seeing it and not knowing
depends on your usecase? maybe 50 is enough
imo 50 will be spent rather quickly unless it's just some one-off tests
What is 50 a day
like you can get 50 responses per day
Mistral and Google have some pretty nice free tiers, just be aware that any data you send to free APIs is as good as public as far as privacy goes
Should I use Google gemini
I can say that for mistral, you're highly unlikely to hit the limit as an individual
Or mistral
the "limit" is like 1 billion tokens per month
it varies~~, usually each word is 2~4 tokens in English iirc~~
https://ai.google.dev/gemini-api/docs/tokens?lang=python
For Gemini models, a token is equivalent to about 4 characters. 100 tokens is equal to about 60-80 English words.
I think it's closer to 1-3 now
most commercial LLMs work with nearly any language with a significant presence in the internet
if you can find online communities speaking that language odds are it'll work
How many tokens a day
I think so
mistral's models are very often advertized as having been trained on a variety of languages
I don't remember out of the top of my head, check their pricing/documentation
Oh so I should u mistrals
Use mistral
Gemini works just as well? what from that makes it sounds like mistral would be better or gemini worse
I want something with more tokens
elaborate?
as in, larger context window? more lenient rate limits?
I would like something that lasts long for many users
you are billed per token, and after a certain point models tend to work worse the more tokens are in your prompt
in general it is best for the tokenizer to compress your inputs into as few possible tokens as possible
(even on a free API, it'll affect response time and may affect rate limits)
I think there isn't something good free
Mistral and Google Gemini do have good free tiers (as long as you disregard data privacy)
They will use it to train models??
What the token size for open ai 3-4
why else would they give out something that expensive for free
Yeah that's okay I wouldn't send something private to the ai anyway
I would use it
Should I make the ai for a phone or pc
web, then it doesn't matters which device you are using it from
again that would depend on how many users there are and how token intensive your product is
just take a look at their respective limits that they clearly state:
Ok
So just want an opinion should I make my ai for PC or phone
For the first prototype before I start updating
I will use python backend for the model and the ai itself
Does anybody here know how I can use my GPU when training with tensorflow? I'm a bit uneducated on setting this up myself and could use some help
i feel like disregarding data privacy is pretty much a given at this point unless ur hosting ur own LLaMa model or whatever
python3 -m pip install 'tensorflow[and-cuda]'
tf.config.list_physical_devices('GPU')
and, if you're not required to use tf, pytorch is a much better choice given that most of the attention that was on devving tf has moved on to jax
or heck, if you don't need a really low level api just use karras, works with whatever backend you like
We're using Tensorflow in school and it is very frustrating
But that's just because I'm really bad at coding
Maybe you can help me
Im currently working on a project that should recognize the genre classifying it in the 10 genres that GTZAN gives you
I trained my own model, and one using VG16 as base model
When I test using a file from the training data, it always shows "pop" for my own model as genre, and it always shows "classical" for the VGG16 one
And now im stuck
mel spectrograms are the training data
224x224
yeah, first i tried by using the images that gtzan has in its dataset, but my models didnt train at all
so i just converted them myselves using the wav files
wait i was under the impression that jax was designed to optimize torch/tf/tensor operations
i read the message wrong lmao
i read it as someone was saying that jax was a drop in replacement for tf
so basically i misunderstood the message 💀 mb
is it good?
jax i mean. did you see a non negligible performance boost
ah
what about usability?
better than TF is the lowest bar out there lmao
icl tf syntax is atrocious
hmm maybe i’ll check it out
pytorch is all right. it’s package management is brain damage tho
can’t believe i have to make sure all my versions are compatible when i’m using torch vision, torch audio and torch text
can someone tell me if the prices quoted on this website are accurate?
https://thedigitalprojectmanager.com/reviews/n8n-pricing/
$20/mo - $50/mo - $??/mo
just check the official website? https://n8n.io/pricing/
they have have a free community version with a very restrictive license
Complete beginner to ml here, how do you know what model to use by just looking at a dataset?
I want to do some kaggle competitions to learn but how do I know what model to use, like a linear regression or something else
You have to decide what the model is intended to do, and then consider what kind of models can learn to do that in terms of the data.
The composition of the dataset doesn't necessarily dictate what you can do with it. So "I want to train a model on this dataset" is an incomplete goal. Train it to do what?
So what is a linear regression intended to do
Cause that's the simplest isn't it
Linear regression is a process. It's not an action that a model does.
"a thing that a model does" would be something like "predict the value of a home given its square footage and number of rooms"
OK so for that example you just gave, from that what model would you use
xgboost, maybe?
I do natural language processing.
Is this something you learn about when you learn ml, when to use said model?
You develop a sense for it.
Isn't nlp like coding in more human like language or something
It's not about coding in human language. It's just "AI and ML as it pertains to human/natural language"
ChatGPT is an application of NLP
it's not "natural language programming", it's "natural language processing" 😄
Help me understand...I went to a ML presentation from a vendor today. Our team provided them with a lot of data. The step in their process so far was to build out an unsupervised model to cluster our client base. This is all paraphrasing but essentially they did so feature engineering, parameter hypertesting...cleaning ecc. They then tried Kmean clustering with PCA...testing features i think of 5, 10, 20. I believe the Kmeans was insufficient resulting in ~800 clusters so they tried dbscan + a centroid distance based regrouping they got it to 30 clusters with ...i forget the term but essentially Edit: silhouette score positive rate of .51. We were tasked with analysing the different clusters and assigning labels to these clusters.
My question is...the only outcome from this model would be to then use those clusters/labels as "bucket" end points to then classify new incoming data correct? Some one my team (management) seem that this is the first step in further classifying our clients as like buyers/not ecc. I tried to really drive the point that what we define as labels WILL BE the classification we get...If I am completely off base or misunderstanding please let me know...im not smert
With k means clustering, you pick a number for k, and it finds k centroids that represent the k most cohesive clusters it could find
You have to decide what significance each cluster has, if any
If you have a new instance that you want to classify, it belongs to whichever cluster it has the closest centroid to.
Hey, I’m a CS grad from UPenn with a tech background. I just built and agent builder that lets you create any AI agent in 3 simple steps (takes about 3 minutes). The adds-on it takes care of the architecture, finding and connecting apis, and you get the full deployable code, turn it into an API, or export it to Lovable to try it out quickly.
If you want to give it a try:
https://search-dream-weaver-kit.lovable.app/
so im working with a dataset right now
from a league of legends API
the data for one of the features appears to be quite off/inaccurate, but i checked the script used to get the data as well as the API docs, and it should work
so should i just assume its broken xd cuz i cant find anything online about it talknig about the inaccuracy
Is there any ML Engineer here doing NLP work using Windows natively (no WSL, no dual-boot, no VM), and everything works fine? how did you get everything working smoothly?
ello
i need help with an error in my code
basically im trying to make a loss function for my neural network
and idk how to add a forward pass here
#calculates data and regularization of losses
#given model output and ground truth values
def calculate(self, output, y):
#calculate sample losses
sample_losses = self.forward(output, y)
#calculate mean loss
data_loss = np.mean(sample_losses)
#loss
return data_loss```
bit of a weird one:
time series classification, I know my data is from int16 sensors, and I can see that some parts of it sit at exactly 32767 for an extended period of time
would you try "fixing" the time series somehow? or say ignore it and make a complementing series that's 1 if the data is clipped and 0 otherwise
do you know why it is that happens? if you're certain it's missing/erroneous data, then yeah, you can use a mask and treat it as a problem with missing data
I'm pretty confident that it's cause the sensors are int16
like the only other value this happens at is -32768
and there are no greater/lower values than 32767/-32768
mhm
but i guess the way you would treat it is different if the problem is clipping as opposed to just missing data
in either case though, it does make sense to mark the entries
an example slice of the time series where I think clipping is happening
(technically there are 6 series but I don't think that's too important for this issue)
other than clipping, it looks to have pretty high periodicity
ig what I'm asking is, are there techniques to somehow transform the data so it'd be like if it wasn't clipped?
the one that has the hover is the angular velocity during a hand movement in the x direction
ye I thought this would be a good learning opportunity, cause I've never ever really tried "unscrewing" time series before
did you write the code that reads the sensor data? or using something found online?
phase unwrapping is not really simple if you don't have an accompanying quantity
it's a competition setting, so I only receive the data
acceleration in xyz and angular velocity in xyz
you can probably do some sort of fitting that requires a couple of derivatives to be continuous
should be doable with splines
that's like enforcing smooth motion
hm, so something like the SplineTransformer I assume?
I'll look up on it
do you know for a fact that the data is periodic? if that's a fair assumption, there are periodic splines you could try. but yeah, you can try masking out the clipped values and interpolating them back in with quadratic or cubic splines
there are fancier things you can do, but i think this is a good starting point. maybe someone else has better suggestions
alright cool, ty!
you could grab the differential equations of motion and relate the 3 components of linear and angular velocity to them, then have your network spit a solution to the differential equation
that'd be a more inverse kinematics approach, what people nowadays call "physics-informed neural network"
or if you have enough data, fr you can probably just mask out that data and train a black box network as is 😛
but splines are a very manageable first step and benchmark. polynomial good 💻 🐒
Thanks. This makes sense as a way to ID clusters that have some business importance from an "unknown" combination of factors, but if we have the "Y" value already...doesn't it make sense to use that as the target in a supervised learning from the start? Or at least see if our classification is stronger that way?>
hey guys where can i learn probability and statistics for data science?
I've just found this ML project repository which has a ton of projects. I was wondering if itd be worthwile pinning it. Feel like itd help beginners like me who have understood the maths but not quite know how to use the tool effectively
Hi
I need some help in prompting
If anyone could help me , please mention me
This is the sytem prompt I am giving to open ai gpt4o model .
Please react with ✅ to upload your file(s) to our paste bin, which is more accessible for some users.
✅
Can anyone help me ?
is simplilearn good or choose other channels and which?
about machine learning and data science
just want to watch just enough to have better understanding when doing some project
dont want to watxh redundant videos also dont want to watch while yt about ml and data science
simplilearn has playlist with 160 videos about data science and approx 500 about ai
but looks like or not they are overlapibg
Bro it is a paid course or free ?
free
Are u comfortable in hindi
no
Then I don't have any suggestions for you
i see its also outside of yt too
You can visit Krish Naik
I watched ml teach by doing but its not enough
from vizuara
I want some path like most important parts of machine learning and data science and then straight to practice project
or just start project and fill gaps
dont want to watch everything and not starting project
looks then like tutorial hell
Bro
I don't know about vizuara but there before using any algorithm like linear regression and other you must know the maths behind it , then you can enjoy learning
Simply calling the library isn't going to help you
Can someone explain pathfinding AI to me?
you have start and target node you find shortest path to reach target with dijkstra or a*
And what are a* and dijkstra and which is easier?
algorithms to find shortest path
try to understand how this works
you can also try with pen and paper
dijkstra is easier, a* has h heuristic
dijkstra is taught in discrete math courses
a* seems like robotics games rather
yes you can do dijkstra with table
look also at pseudocodes of them
I'm not sure dijkstra is exactly easier, dijkstra is really just a specific heuristic for a* (essentially)
@final kiln sorry for ping, but im looking into jax rn. do you have any opinions between flax, vs equinox vs haiku for neural networks?
Thx
say i make an RNN to estimate velocity from acceleration that has the usual kinks you get from an accelerometer,
do i run it with a set value of past measurements (like 200) every time i get a new one, or do i just keep feeding it and look at the memory gate?
second one is much faster but i think my PC should be able to handly both, so im looking for the best runtime
also, just a regular RNN isn't enough, ive tried, even hand trained it, so im making an LSTM model
I meant when you know bfs then dijkstra is easier
How many photos do I need for each subject for a folder and do they have to be in separate folders or just one mixed up one?
I was wondering if someone was slightly or more experienced with AlphaZero's type of model and could help explain why when i use the code from the most searched youtube tutorial it works, but when i modify it to keras instead of pytorch it doesn't give me any errors and i think i understand the code, but then it always learns the wrong policy as when i test it for a given board state it doesn't give a high probability for the "correct" move that wins them the game. Anyone who could help or point me in the right direction? Here is the pytorch code github: https://github.com/foersterrobert/AlphaZeroFromScratch/blob/main/7.AlphaTweaks.ipynb and here is my code: https://codeshare.io/5X843z
I've just been learning about linear regressions. Is there a reason we would use gradient descent to help find our coefficients for our model, instead of using the "normal equation" which to me looks like something we can just compute instead of having to iterate over our dataset
What is the most used modul in python for Data Science?
And Is there any source I can learn Data Science with? like an Arabic Source or English
Hi! You're correct in that sense, yet gradient descent is more preferred because gradient descent handles large datasets easier and more efficiently. If we use the normal equation, we have to compute the inverse of matrix, which can be computationally expensive and can lead to an undesired big O complexity.
Hey im trying to code a transformer and learn about ai coding does anyone have any good tips or things to study?
I'm trying to define a good cost to push fidelity to be exactly 1+0j.
def loss(
mps: qtn.MatrixProductState,
target_mps: qtn.MatrixProductState
) -> float:
fidelity = mps @ target_mps.conj()
return abs(fidelity - 1)**2
Need something that can get a bit more performance.
If you want to try it out (small code, so can just try on your side if you prefer) lmk.
on top of the other answer, we also usually use linear regression as way to teach gradient descent, especially for methods where analytical method is not applicable, like NN
so nn regressor does automatically linear regression?
what about generalized regression nn grnn is it still used, I remember I had about it on ml course
I mean I never met this outside of course, like only when googling about it
this channel being about ai; if someone is up to the challenge, could you answer this question
how many nodes in the input, hidden and output layer, how is this determined ?
do you know how matrix multiplication is defined?
dot product
and so what is the restriction regarding the sizes of the columns and rows when you multiply two matrices
given a matrix A of size m x n and a matrix B of size p x q, if we want to calculate AB, what do we know about the sizes m, n, p, and q?
size of rows of A must equal size of columns of B or vice versa dont remember
(this is the answer to your question regarding the number of nodes in the input, output, etc)
ohh ok think i got it
or it is size of cols of A must equal size of rows of B
thats why must transpose matrix
NNRegressor? which package has that
I only know MLPRegressor which is in sklearn
so input layer = 3, hidden layer = 2 and output layer = 1
I might as well ask here. Does any of you use Darts?
if you assume a nn has a width of 1 for simplicity, you can think of it kinda like input -> linreg -> activation -> linreg -> activation -> ... -> output
edit: well ig a width of 1 would make the inner layers really uninteresting, so just ignore that part
interesting insight
I was looking between TSLearn, GluonTS, and Darts for time series forecasting. I used to use manual forecasting, statsmodels, pmdarima but I needed a more united package
Manual forecasting meaning like I build like a regressor from sklearn and plot the predictions against actual.
can also look at some time series R package if you are allowed to use it
or if only allowed python then different story
there's also Nixtla which people say is faster I think
I used R in the past still, but Python it's kinda challenging. My colleagues use Python
With R they have model time which is really nice
like Nixtla is focused on forecasting while the others include more analysis utilities
wait you said forecasting not analysis ok I meant ts analysis then
It's why I joined this server because my colleagues are like pmdarima and manual forecasting all the way. And I'm thinking there got to be a better answer
It's both
I'll take a look
"manual" forecasting is still competitive though
cause you can just include some lag features and turn it into a tabular prediction task, where gradient boosting trees excel
please not Excel 😂
Yeah I seen colleagues do a manual xgboost and lightgbm, but I was hoping for like a united interface encompassing deep learning, classic forecasting, machine learning, and hierarchal all in one package . I do include lag features though my colleagues get super sticky on the specifics (will not mention)
there's also sktime (and aeon which is a fork)
I've a rant in here a few weeks back about how I'm not impressed by the performance when compared to just raw dogging it in polars, but that could be because polars is fast and not that sktime is slow
It's sad I never learned polars
I should but I'm used to pandas already
On a personal level too
deep learning, classic forecasting, machine learning
yeah nothing like that exists, it would cover too broad a scope to not be bloat hell I imagine
well ig technically a good chunk of the forecasting libs mentioned above (sktime, aeon, darts, nixtla, idk others) do include neural network methods
it's just that you can't alter the architecture as easily as say if you were to use pytorch
so were most people that use polars
I'd very strongly recommend learning it
Does polars have something like pandas.ta
It's just convenient for a lot of packages I use
you have polars_df.to_pandas() and polars.from_pandas() to switch between them, if your package only takes pandas dataframes
Like Edgar tools is the best SEC parsing package and they only parse to pandas. Idk I guess I'll try to learn it
so transformers for ts forecasting is overkill for this case?
What's your data looking like
just ask generally
for now time series remain a field where (un?)fortunately the best model is "it depends on your data"
oh ok
Yeah
there are some plugins like https://github.com/wukan1986/polars_ta and https://github.com/Yvictor/polars_ta_extension, I have never used either of them myself though
polars does have official support for plugins though, which is pretty neat
How did I never heard about this. They hype up pandas ta with 130 technical indicators wtf
Generally though for government economic data, they usually use ARIMA, SARIMAX
So it depends on the data but also the type of data
So my solution based on the chat seems to be Darts and Nixtla. I'll check both of them fully out
it's not even just transformers, sometimes deep learning just loses to a xgboost for example: https://arxiv.org/pdf/2101.02118
Me personally you can combine models
They didn't show any visualizations sad
Not sure if you discussed with anyone else yet, but the reason I asked to see an example is because it's hard to know just from this description what kind of noise you have and what "not enough" means.
Is it possible to compute the eigenvalus of a matrix symbolically using numpy?
Sympy is too slow for larger matrices.
darts is the closest.
If darts is not good enough, then manual.
it's exactly what you expect:
- (of course), random deviation from the exact value, especially when accel is high
- sensor ringing (a sine curve) after changes in acceleration, especially when stopping, tapping or hitting something
- gaps between measurement points contribute a lot to drift after stopping, almost impossible to filter out
all filters had either or multiple of the following problems:
- not enough; still drift after some time or after fast movements
- cursor moves back to where it came from (like with a vanilla RNN or forgetting integration
- cursor starts absolutely tweaking once you move any faster than a slow glide
- input felt highly unnatural, cursor was hard to move at all (often happened when you overdid it with filters that else aren't enough)
zero velocity updates (set velocity=0 when acceleration=0 and delta_rotation=0) either don't detect or detect too eagerly
hi guys, i dont know how to go about learning the math of ML? im learning the data libraries of python and i wanna start learning the core math of ML as well before i jump into deeper into ML and ML concepts so i can actually understand whats going on at a deeper level. How would i go about this? do u have any resources and guidence?
u still struggling with this?
i understand machine learning at the lowest level possible so im sure i can help
yea
im not sure how to approach actually learning ML but i'll learn pandas/numpy/matplot parallel with math for ML
and try to implement the math with python
tho im not sure where and how to learn all of those
same about ML concepts
i mean
start with a single 2 input with weight -> 1 output with weight design. understand how that works and then how to retrain it to approximate towards a goal.
once youve done this its just a matter of scaling
essentially current ml models require training pools. the more neurons the more accurately that the machine can create an outcome. the more layers the more complex the result can be. understanding how a signal machine neuron works first is essential
ok, i'll start there
thanks
ok bro good luck. if u need any help along the way lmk
got it
the increased bus width is representing the analogue signal to avoid unnecessary digital clock based/space unoptimised processes
(oh god, I was imagining something like one made out of wood 😭)
the ram contains the weights, though the architecture allows for localised ram for each neuron set
wood isnt conductive...
ill be physically building it soon once ive got the modularisation down
well, I was apparently conflating "physical" with "mechanical", mb
i mean it will be physical soon. like basically small connectable cubes that can stack onto each other to create a massive neural network
(I was thinking something not involving electricity (perhaps, with a hand crank
))
i mean... i can power it with a hand crank but why would i do that when i can plug it in...
also that would involves some crazy gear work 😭
i just cant think creatively enough to modularise it effectively
I mess around with mechanical computers and such, here is a good starting point for some ideas: https://www.youtube.com/watch?v=s1i-dnAH9Y4 I recommend starting with Lego.
A 1953 training film for a mechanical fire control computer aboard Navy Ships. Amazing how problems of mathematical computation were solved so elegantly in "permanent" mechanical form, before microprocessors became inexpensive and commonplace.
(I have done NNs)
bro this is so goofy
i done it though. ive discovered the perfect method to physically modularised neural networks
have you tried using symengine?
it's in c++ rather than python so should be more performant
Yeah about that, I saw the full documentation and I have to say Darts is awesome.
I was able to implement most of what I wanted
And then I supplement the rest with pytorch forecasting
It doesn't have SARIMAX explicitly but I can change it around to get SARIMAX
iloc takes integers while loc takes labels
wdym?
like you do df.iloc[:, 2] to get the 3rd column
or, if the 3rd column is named "fruit," you can do df.loc[:, 'fruit']
3 column and all the rows
is it fixed to being a two parameter function or what
what I dont understand is the syntax df[[something, something]
i have this vue dataframe
and loc[row, column]
what if loc only contains one param, wtf?
@jaunty helm
the only difference in loc and iloc, is that loc uses labels (str) to access stuff while iloc uses indices (int)
it's df.loc[<row stuff>, <column stuff>], and if you omit the second one it's just df.loc[<row stuff>]
if you think I type too slowly just see here https://pandas.pydata.org/docs/user_guide/indexing.html
is fine, I gotchu
where can I check function definition of loc()?
because, if I only have one parameter you know is row
because loc(row, col) right
is df.loc(5, :) equivalent to df.loc(5)
not (), []
ok
I think I get it tho, this cose in here
this code is saying df.loc[5, :] = [stuff to put in corresponding r,w ]
like
it will put in the corresponding columns
"heinz, armando" will be put in row = 5, column = 1
1255 will be put in row 5 column 2
'a' will be put in row 5 column 3
and "flex" will be put in row 5 column 4
I get it I think @jaunty helm
❤️
i appreciate it tho, you broke down the principal differences and I think I start to get it tho
are you from america?
how does df.groupby work ?
like grouped = df.groupby("columnnnn")
prepares the dataframe for an operation, such that it's applied to each group
like df.sum() gives you the sum of everything, df.groupby('a').sum() would get you the sums of each group
same
what the flip does the [] do
after the groupby
access a column
is hard
the indexing guide here is a lot more complete than what I can give you
i just need to guess which line of code makes the following operation on this df
I think it will be df.groupby("asiento")['fila'].sort()
well, certainly the groupby, selects a column that you will be using
group = column or row
you can also do loc on multiple columns
which is equivalent to this one
loc is the same thing as using df[label, label]
unless I am tripping @jaunty helm
I think groupby groups by the rows
and then we access the column with the []
something like that no?
yeah so in this case we have the row 'a' that is going to be summed entirely or something
row 'a' from the dataframe
for this df we grouped by the animal row and accesed the column age to put the mean of each age of the animal
groupby i think is like SQL's?
and used with some aggregation function
do I need some knowledge of hpc and parallel computing for machine learning?
or only when someone manages infrastructure of it?
so not neccessary
tbh for machine learning u just need a module to do it and a training set :p
are there any books which teach linear algebra and its application in computing? ngl it just seems long learning about linear algebra. I just want to know enough linear algebra for machine learning. Im going to use some A level textbooks to learn matrix basics and then use gilbert strangs book to from chpater 3 onwards but i wouldnt mind using a book that teaches linear algebra ground up and learning to apply it
bro. linear algebra is insanely easy
Cool.
for example when u calculate characteristic polynomial/equation then its similar to high school algebra
Is there any way to use wolfram mathematica inside python to compute eigenvectors symbolically?
I think you can just use sympy for that.
No. I have a matrix 11 by 11 with a lots of zeros, and if i just introduce one variable it will take too much time to compute it
have you tried symengine
I dont know that one. I will check it.
have you used a tensor for the matrix?
No.
r.i.p
Why?
gpus are really good at matrix functions c:
cuda probably has the protocols to do this efficiently
Wait, when you say tensor is not using Matrx(...) right? The question may be dumb. Ahahhah
tensors are essentially a 3d array
an 11x11 matrix could fall into the tensors realm
How?
if ofc it meets the criteria
You have tried sympy but it's too slow?
Yea. Tooooo slow.
Is Mathematica a lot faster for this?
Yea, by a lotttttt
cuda might be the optimal approach here
Mathematica can make it in seconds and using sympy may take at least some hours.
I may not have cuda.
cuda should allow this to happen in millisecond latency
If the difference is this large I doubt it's a hardware issue.
Sympy is probably just doing something horribly wrong.
no its most likely that sympy is using cpu fetch decode execute cycles instead of gpu tensor capabilities
which is the wrong approach to digital matrices
yeah tensors should be perfect for that
I can show you how the matrix is.
No I mean like 20 year old hardware can demolish that task.
yeah
This is 12 by 12 but its basically to remove one row and one column.
I want to substitute all of them except maybe 3 variables.
I can send the code.
If you can create a short MRE for the people here that would be great.
Maybe with a smaller simple matrix as an example too of what you want.
MRE?
Minimal reproducible example.
so its sparse
Looks like about 50% sparsity.
yes right
So in between, but still worth storing in sparse form. Although this is about symbolic computation, so things are a bit different in how you do things.
My guess is that sympy stores its structures in a pretty slow way compared to Mathematica, but this probably does not explain the massive difference.
symengine is like sympy but in c++, which is why I suggested it
Something quick to try in Sympy is cse (common subexpression). It can be used to optimize by having it replace all the common subexpressions with symbols so the simplifier does not chug through a ton of unneeded work.
Note that anything above 3x3 is going to start getting really bad for symbolic computation on this task, especially if the engine is not making use of it being sparse.
import sympy as sp
# Define symbols
F, G, A, B, epsilon, mu = sp.symbols('F G A B epsilon mu')
# Define the matrix
M = sp.Matrix([
[0, 0, F, 0, 0, 0, 0],
[0, 0, G, 0, 0, 0, 0],
[F, G, epsilon, 0, A, 0, B],
[0, 0, 0, 0, 0, mu, 0],
[0, 0, A, 0, 0, 0, mu],
[0, 0, 0, mu, 0, 0, 0],
[0, 0, B, 0, mu, 0, 0]
])
# Compute symbolic eigenvectors
eigen_data = M.eigenvects()
Even doing this is already very bad...
!code
So yea. I may try first to use symengyne.
I cant find the documentation of the functions of symengine.
Look in to specialized methods for "sparse matrices". They can be loads faster by skipping all the zeros.
That works if i want it symbolically?
I don't know. Like I said, look into it.
But the idea is at least 40 years old. Someone's probably made one that works for Computer Algebra Systems by now, as well as for numerical methods.
👋🏾 anyone got a project they need help with?
Hi