#data-science-and-ml

1 messages · Page 180 of 1

cold plover
#

or you take several instances into account before you take a step so is that what stochastic is doing? while regular does the former?

serene scaffold
#

gradient descent is like climbing down a hill with a blindfold on. it's "stoachastic" when you feel around a few spots with your foot before deciding which way to go.

#

@tidal bough do you like my analogy or no
it's okay if you don't

#

I barely get to think about neural networks anymore now that everything is """""agentic"""""

cold plover
#

ah so regular is just taking the trip on a guess and then finding out that there could've been a more efficient route? vs stochastic is taking one step, looking around for the next like downhill slope and going that way?

serene scaffold
#

also it occurs to me that my definition of "not stochastic" might be inverted

tidal bough
#

It's easy to show that if you take the limit as learning rate approaches zero, the average direction of the gradient of SGD will match the true gradient (and the expectation value of the SGD gradient just is the true gradient, always). But as for why it works in practice for sizable learning rates... One way to explain that is that SGD doesn't go along the direction of the true gradient, it randomly deviates from it. The thing is that, for tasks with many local optimal like backpropagation in neural networks, SGD typically works better than true gradient descent because of this. The randomness makes it better at global optimization, the same way many global search algorithms involve randomness.

#

i think I remember reading some blogpost that showed a cool result relating SGD to a more complex algorithm but I'm not sure how to find it...

cold plover
#

right so instead of assumed/ill informed planned routing, improvised routing?

Like choosing a route on google maps that appears to have less traffic but then adhoc taking another exit/road to avoid the evident traffic and arriving at the same destination?

tidal bough
#

Sure, that's maybe a decent analogy

cold plover
#

i know stat quest is a really simplified version but this still confuddles me.

#

the way he explained regular descent didn't go back to the initial guesses either, the steps built up on top of each other?

tidal bough
tidal bough
main fox
tidal bough
#

I don't think so

cold plover
# cold plover i know stat quest is a really simplified version but this still confuddles me.

so in regular descent, the derivative of the loss function was calculated with ALL data in mind. a guess was taken for slope of loss function and intercept, multiplied into the learning rate and rinse and repeat until the loss function is close to zero.

i can understand how that would be a lot of computation for a LOT of data points for the parameters.

where as stochastic takes only one data point for the parameters, find a line of best fit, and outputs the parameters for step 1. then the subsequent steps take the previous parameter otuputs, multiply by learning rate, and do the same thing which leads to a more efficient path to local optima?

#

i can sort of understand it as: each data point being allowed to pull the line of best fit toward itself for SGD until it closely fits in an arrangment where all data points are "happy", where as RGD just shotguns a guess for slope for all data points and keeps adjusting that until "happy"

#

so i can see why SGD would be faster.

tidal bough
# cold plover so in regular descent, the derivative of the loss function was calculated with A...

Sure, note that for SGD you're effectively doing much less computation per step, so you can afford many more steps.
E.g. you have a dataset of N=10**6 points. You could do one step of GD, computing the gradient on the entire dataset. Or you could do SGD on minibatches of m=1000 samples each. 1000 samples is many enough that the gradient computed this way would be a pretty good approximation of the true one, and for the same amount of compute, you'll be able to do N/m = 1000 steps, instead of one.

cold plover
#

so a compromise of optimization between accuracy and computational power

tidal bough
cold plover
#

i feel like my interpretation here makes sense, could you correct me if I am wrong?

#

sorta like throwing one thing out of balance in the favor of other until you get close as opposed to finding the best arrangment for the initial data point and then optimizing off that to get a close arrangement that works for all.

tidal bough
#

Yeah, that describes SGD, sure

#

However I should note that... 2d dataset fitting is a toy problem, and for it, SGD is, I think, objectively not at all better than GD.

#

because the loss function here is smooth and convex. GD would just go directly to the minimum, whereas SGD would wander a little. SGD has pretty much no advantage here, so trying to understand why SGD can be good by only studying this task won't go well.

cold plover
#

like it didn't just spout variables and calculus but rather explained the underlying reasoning/process.

#

but thanks for the concise explanations whilst i stumbled through my thought process!

#

ill be back with more questions...soonish.

iron basalt
# cold plover so in regular descent, the derivative of the loss function was calculated with A...

SGD wiggles around which lets it generalize better since it will jump out of steep local minima holes. Not having to go over the whole dataset matters for performance reasons. In practice batches are used in deep learning which is kind of in between the two, and this is done there because doing just one sample at a time would be slow as computers want to work on chunks of things at a time. So the batches act as chunks for better performance.

#

In non-toy, non-convex problems SGD is used for these properties.

cold plover
#

gotcha

cold plover
#

how does one determine what kind of cross validation method to use based on the amount of training data?

#

for example if it goes 50,100,200,400,500,1000 etc?

dreamy solstice
#

gm

Please, is there a paper or an article that explains how word embedding captures meaning from training? I recently finished learning linear and logistic regression and multiclass with softmax, so I'm planning on building a sentiment analysis. I'm planning on Word2vec embedding. Training numerical data is simpler because your X is the input data, but from what I've learned so far, the linear model takes the parameter h as an input whch is the avg of all the vectors of each word in a sentence.

And it trains and trains and changes the word vectors (The beginning of the confusion), how will the changing of the vectors make the model understand the words? Since the input is a mean vector and to find the h gradient, you won't use the parameters of the h you first passed.

I know I can use packages for embedding, but it's somthing i want to write from scratch, so if any paper, blog can help, cause i dint even think i under wht im trying to ask any longer.

Thank you.

cold plover
#

@tidal bough is there a ML course you would recommend?

#

my current class is an elective that is...poorly taught IMO. would like something more clarity and structure.

serene scaffold
half pulsar
#

A good way to start is to take a specific algorithm and work through it end-to-end, from the derivation of the equations to a full implementation in code. Translating it helps turn the math into something executable builds intuition much faster than passively watching lectures imo.

final kiln
#

seems like they finally cracking arc agi

fringe thicket
#

Hi

#

Guys I Just completed Python And I want To enter in data science field I don't know what to do Now
Can Anyone help me ?

serene scaffold
rare bane
#

Wait can we put links to GitHub repos for data science projects in this channel?

rare bane
#

For the more data science inclined python devs, when working with model predictions, is it like a procedural thing, or you just have to work an entire new brand of logic to get what you want

For example: if I'm working with a small dataset I usually use a linear regression model and then load the dataset, clean/wrangle the dataset , then select my features and my target, run some metric scores and visualize. And I go about with that almost everytime. Is that the standard case? I know different datasets and features to predict require nuance and different predictive models as it's not a one size fits all scenario, so I'm just asking if it's as procedural in data science like when you're making am omelette, where you know exactly what to do and the process doesn't change

sullen urchin
#

Is there anyone who is looking for a dev?

elfin stratus
#

Hey guys! I got very interested in coding and especially data science in the past year. I learnt python pretty decently and started learning other tools and libraries with kaggle.

I am ambitious but the path is unclear. I would be happy to get a little clarification about the best way to build out decent data science skills. Like a roadmap.

Thanks in advance

main fox
# rare bane For the more data science inclined python devs, when working with model predict...

There are well described patterns/workflows for standard ML work, yes

Logically, you:

Understand the ask, and determine an approach

Identify relevant data sources, if you're working with more than one source of data

Understand the data, clean and reshape for your purposes

And determine if you need an ML model

If so, establish a baseline, and note general transformations that you may need (OHE, missingness, standardization)

Test different models and feature engineering steps

See if performance and complexity are justified

Determine deployment strategy

Like you noted, these steps are not all encompassing, and every problem has it's own details

rare bane
#

Huh, so basically if the output is looking somewhat reasonable for a complex dataset, after using multiple models, then that is probably ok?

Well I appreciate the detailed answer if anything. T for thanks

main fox
#

You'll make the call if performance is acceptable

rare bane
main fox
#

Usually within industry you can tie e.g. dollar amounts to the events you're trying to predict to help you determine if the model is worth implementing

proven pier
#

This may potentially violate rule 9, but I think I'd like to look into potential options for learning AI by paying for a curriculum or some sort of teaching service. - I suppose I'd say, not asking for anybody here to do it specifically, but perhaps somebody here knows of some reputable sources for such things? I think what I need is a good curriculum to follow then I could be more confident in the directions I'm taking

serene scaffold
#

!resources data science

arctic wedgeBOT
#
Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

umbral dove
ocean hinge
#

Hello can anyone look into my issue on python-help?

serene scaffold
jaunty helm
#

it's also seldom 'procedural' as in one-and-done, after a model is trained you inspect its predictions and if it's not good enough, go to one of the previous many steps to try and improve it

copper kindle
#

I just can't get over this concept. Easy to grasp W_(target node, source node) and easy to compute Wx+b.
But I've seen some other example online that takes in account W_(source node, target node) and then does a transpose of weight matrix before dot product. Why would we need to do that ? is that a special case based on inputs dimention or something else?

final kiln
#

in the second image the weights of each "neuron" are organized in column vectors instead of lines

#

there's stuff about line vs column vectors which may be meaningful if you're in some advanced text, but I havent seen it a lot outside of research papers

https://en.wikipedia.org/wiki/Dual_space

but yea not worth digging into this if you just learning grad descent and other basics

In mathematics, any vector space

    V
  

{\displaystyle V}

has a corresponding dual vector space (or just dual space for short) consisting of all linear forms on

    V
    ,
  

{\displaystyle V,}

together with the vector space structure of pointwise addition and scala...

copper kindle
copper kindle
final kiln
final kiln
copper kindle
#

@final kiln Thanks.

final kiln
#

any time

fringe thicket
wet dome
#

What are the most common data visualisation tools? Is it power bi and tableau?
If I was to learn one what should i choose?

waxen kindle
#

They are very similar to one another

#

The free alternatives are too

rich moth
#

Has anyone played around with WorkClaw?

serene scaffold
#

!warn @hardy fractal your message was removed for listing a job, which is not allowed.

arctic wedgeBOT
#

:incoming_envelope: :ok_hand: applied warning to @hardy fractal.

hardy fractal
#

What should I do for that?

obsidian talon
#

Tableau, Power BI, Looker Studio are the top 3 most common, but there are others

#

Programming languages can make data visualizations too though

serene scaffold
rich moth
#

I build a "brain" map of my AI system. Just wanted to see the inner workings a bit, but it will be really helpful for debugging. But since its initial origin point its built and connected these nodes over roughly 2 weeks, its organizing its thoughts. but so far so good, no hairball. Its currently integrating itself into WorkClaw, its really fun to watch it just go. I dont even watch TV anymore.

#

its pretty incredible what some of these local models can do these days

rare bane
glass temple
#

does SMOTE work with high dimensional sparse matrices like TF-IDF matrices?

#

If not, then should I reduce the dimension of the TF-IDF matrix by limiting max_features, or use Truncated SVD to reduce the dimensions? What's a good rule of thumb to the max dimensions to use for SMOTE?

wooden sail
#

general linear combinations aren't sparsity-preserving

glass temple
#

I'm guessing there's no over sampling method for text data that uses just ML then?

full granite
#

I’m not sure if this is something I should be asking here, but do you think that having written my own programs to conduct materials analysis during university and graduate school could be considered one of my strengths when it comes to job hunting?

final kiln
#

yes

prime cliff
#

Hlo

unkempt apex
final kiln
wind breach
#

guys, is there that idea to have tokens that only the LLM can generate and they are not displayed to user but they are like for reasoning (not just reasoning tokens, but tokens what are not in dataset, not in human texts, not mapped to real text, only unique id tokens)?

daring matrix
#

any bioinfos

fallow coyote
#

Im thinking about doing a little programming project for my workplace where Ill make a program that can automatically detect and measure wear on a tool. My company is a tooling company and when Ive been tasked to record tool wear for a project, its tedious task taking a photo with a specialised microscope cam, positioning the tool and then trying to get a decent enough measurement. I want to first automate identifying siginificant tool wear and obtaining the measurement for it. How do i go about it?

jaunty helm
torn hill
#

Hi Guys

I have been experimenting with sentence relevancy for the past few weeks.

So I made Scout , its an experimental attention model that slightly modifies the standard Transformer attention architecture to learn directional relevance between sentences instead of tokens. Instead of asking "are these similar?", it asks "does sentence B actually help sentence A?"

Still a small model trained on ~4,500 synthetic pairs. The deeper question that am trying to asnwer is can attention mechanics encode functional utility rather than just contextual compatibility. But early results are interesting.

Do check it out and tell me what you think

https://github.com/samyak112/Scout

GitHub

Modified transformer that learns directional information gain between sentences. - samyak112/Scout

wind breach
jaunty helm
# wind breach no, I think just tokens would push model to be more token-eff

then the other person linked you something relevant I think
a quick search also found this survey on latent reasoning which may or may not be of interest

wind breach
#

okay, not only that but also force LLM to use only them for reasoning by showing to the user all tokens except these

wind breach
jaunty helm
wind breach
#

how is that different than the normal no reasoning mode, where magical stuff happens internally in the llm, with intermediate vectors not corresponding to any particular word in the vocabulary, and also humans can't really 'see' it

allows to save progress between forward passes

#

normal non-reasoning LLMs can only "think" inside a single forward pass, which is extremely limited

jaunty helm
#

you can check RWKV which is a novel architecture that's been devved for some years now (though to no larger scale or adoption), I think it has some close ideas

wind breach
jaunty helm
wind breach
#

the only state is previous tokens

#

and it is used anyway

#

actually, I wouldn't call it state, that's just part of LLM input (LLMs are auto-regressive)

jaunty helm
wind breach
#

one forward pass is only one token

wind breach
jaunty helm
#

I feel like I'm just trying and failing to guess what you're actually trying to describe
can you like say exactly what you mean and how its different from existing architectures

empty dragon
#

Ok 👍

pliant steppe
#

anyone got experience with gpus and OOM errors? im doing a project where we have teach a neural network to play from a baseline of data and then improve from selfplay (baseline is generated by a MCTS agent and then a PUCT agent trains on it and starts the selfplay phase). the problem is that when i got to training the neural net i set a batch size of 64 for the data and it keeps causing a OOM error. iv only managed to train the first model on 10k games from the MCTS agent, then i generated another 10k games using the PUCT agent but now when i go to train the 2nd neural net on all the previous data it keeps causing a OOM error, im not sure why

bronze wyvern
#

Hello, quick question, our lecturer told us to split our data from the very beginning and to always work on the train data , anyone has any idea why? For e.g what I'm used to do is preprocess everything and at the very end then only split the data.

if the test data is unprocessed and we test our algorithm, say linear progression, is that a problem?

jaunty helm
bronze wyvern
#

when using standardScaler for e.g, this normally should be applied on whole dataset, no? Like to scale everything to a specific range?

jaunty helm
#

so e.g. you may do

train_X, test_X = train_test_split(X)
scaler = StandardScaler()
train_X = scaler.fit_transform(train_X)
test_X = scaler.transform(test_X)
... # more processing

however you can not do

scaler = StandardScaler()
X = scaler.fit_transform(X)
train_X, test_X = train_test_split(X)
```because your scaler had the information from the test set to learn a better mean and std from, your model will seem better than it is on true, unseen data
#

I recommend using a sklearn.pipeline if you have a lot of steps

jaunty helm
#

also the above extends to manual processing as well
e.g. for some unsplit numerical dataset arr you can not do:

arr_standardized = (arr - arr.mean()) / arr.std()
... # split later
```because again the `.mean()` and `.std()` used information from what should be the test set
bronze wyvern
#

yep I see, I will read a bit on what you mentioned and come back but I understand the gist of it; we want the test data to be truely unseen

jaunty helm
# bronze wyvern yep I see, I will read a bit on what you mentioned and come back but I understan...

yeah
again, anything that requires stuff to be learned from data, you must do after splitting
there are other operations that don't, in which case you can do it whenever
for example, it's often you train the model on the log of house prices (then exp back the model predictions, for the actual predictions), because usually house prices are very skewed and taking the log can make it fall into a nicer distribution
since log doesn't need to know anything about the dataset, you can do y = log(y) before splitting it into train_y and test_y no problem

bronze wyvern
#

oh ok

#

Will come back, I will dig a bit in what you mentioned/read a bit to document myself and come back, thanks for the insights

bronze wyvern
#

yeah I have a better understanding now, thanks !

limber plover
#

Can I share my idea I had about Ai system I was building and how scifi is this or could this work?

#

Anyone here?

serene scaffold
limber plover
#

I just want to be courteous to the topic at hand they are having and not spam.

#

A long while I was doing research and I had this idea, though I don't know how practical this would be to implement. Parallel Forest: A Mesh Network-Enabled Model for Diverse Task Management
Overview of Model Architecture
The model utilizes a unique architecture that combines forest clusters with a mesh network to efficiently handle a diverse set of tasks. Each cluster, resembling a revolver drum, contains dense trees arranged in a matrix formation. This matrix arrangement enables parallel processing and efficient computation within each cluster, allowing for simultaneous data processing and pattern recognition.
In addition, the forests (Random) are interconnected through a mesh network, which provides redundancy and fault tolerance. This interconnected nature ensures that the model can continue functioning even if one forest goes down, as data can be rerouted and processed through alternative paths, maintaining the overall functionality of the model. This scalability allows the Random forest to expand based on the tasks given.
By leveraging the matrix formation for parallel processing and the mesh network for fault tolerance and redundancy, the model aims to achieve robustness, efficiency, and continuous operation across a wide range of tasks. this is a bit general overview....

serene scaffold
limber plover
#

I have more of this, but I did work on some code and I started out with basic binary classification problem for valid and or not valid worlds of which I was training NN for, though it did have also decision trees. When I say go down, I mean power outage or problem with a node. If I remember right I simply wanted a system using mesh network for certain ML tasks, for example one node would be down so it switches to another. Sorry this was A LONG while ago I did this.

#

I don't have an example as this was a road map for me to start on, but I never did get properly started on it. The idea basically is a multi core processing. The idea is that, one cluster would have Just math so when math is needed that cluster would be activated FOR that task, it would then revolve again for say language processing. Sort of like LLM with multi hat prompmt however, each node does ONE taks only. The master controller then manages this of which node is used acting as input and output. I have more of this but I am afraid to spam. It.

#

I unfortunately have no math and no code at the moment that I can find. However here is more info on this.

#

Decision Tree Details
The dense trees arranged in a matrix formation within each forest would appear as a grid or array of interconnected nodes, representing the individual decision trees. Each row in the matrix could correspond to a specific feature or attribute, while each column could represent a different split or decision point within the tree. The nodes within the matrix would be interconnected to facilitate parallel processing and information exchange, allowing for efficient computation and pattern recognition within each cluster. This matrix arrangement enables simultaneous data processing and collaborative learning among the dense trees, contributing to the model's overall performance and robustness.
Just for the decision tree.

#

Basically if I remeber right, LLM have hard time with math as they are mostly LLM. Suppose you want a math problem now, the language node and math node would talk and are able to get to you the solver for the problem at hand. I used matrix because its the best for math processing. At this time I was working on a chip that was called MALU (matrix aritthamtic logic unitt) I thought I campe up with it, but apperently this existed already for sometime now since the 70s or early.

#

sorry not sure what is going on and double ttt textt, my keyboard being wireless is having problems just a sec.

glacial socket
#

Hi!

So I am new to python. My teacher told us that the second chance at exam is in may/june. i suddenlh got a notification and now our exam is on friday. The exam is like this multiple choice exam. You pick what code is right. Some of them you need to fill in the blanks and some are math python stuff. I have had a rough half year and I had to love across the country and I really dont need a good grade. I just need to pass. He said everything is allowez expect talking to people and AI. Do you guys have any tips for me? It would really help me, I need all the tips I can get, even the basic ones. I am not good ar coding and our teacher has been very absent. The whole class is failing and we know almost nothing.

#

had to move across*

limber plover
#

eden you have question to help you with?

glacial socket
#

Not really a question, I am sorry! More like do you know any websites that perhaps can help me under the exam? Something like that. He said we can use internet but not AI

limber plover
#

Right to help you with what exactly? I am not not sure what will be on your exam, you have notes to tell you what you should study for?

glacial socket
#

I have python for beginners. Not anything difficult. Why the exam is hard, is because the exam is full of very long code, and we need to either fill in or to pick between code lines. Give me 1 minute and I will show you how far we have come into python

limber plover
#

Ok. Otherwise this reads as if.."my teacher told me something to do with the room and he wants it to look certain way, but I am not sure what to do about the room, we have been doing stuff in the room but he said we cannot talk to people or use ai to do something with the room....

glacial socket
#

Oh ok! Well I am sorry, never mind just forget it

limber plover
#

I am not discouraging you I am tying to ask you to to ask me or us specifics. What was your topic about?

#

Oh good, now I am the bad guy...

#

@glacial socket Still there?

main fox
#

Since you're asking in the data science channel, let us know if your exam is specifically about data science content

limber plover
#

Maybe I am just bad with people. Thanks Twiibz

#

That said anyone interested in chiming in about what I wrote with regards to my road map to making this system. For example I can see maybe doing small nodes for certain tests. Perhaps making a binary decision tree but braking it up how it is accessed. Having a master controller managing. Perhaps have a node that just stores words, and then have binary decision tree tell you said words of valid or not. IF you ask for example ABC or bob is valid.

#

@serene scaffold Any input on this topic?

#

@serene scaffold Do you have no opinon on this? Should I move this to some other place for talks I am open to any ideas or questions.

serene scaffold
limber plover
#

@serene scaffold Ah ok my apology

#

Does not look like they are active there. Maybe I will comeback this evening.

half pulsar
#

Instant banned for starting a talk in General chat

#

Redditor level mods

limber plover
#

What the hell?

#

I was having a conversation about my idea I had while back and I got banned for it, never told why until I had to ask.

half pulsar
#

Clear as day

limber plover
#

LLM text? you mean REGULAR text? what is LLM text?

half pulsar
limber plover
#

I never used GPT to think out my thoughts, Now I was pasting large block of text, BUT ASKED if could.

half pulsar
#

Unlike you I actually care, build and have had a passion for it.

limber plover
#

Holy hell, how childish.

#

I was A asking if this something possible and B have a conversation about it. What did I have to gain by using GPT for this?

half pulsar
#

Whatever. That was on you. Don't copy and paste ChatGPT shit.

limber plover
#

ITS NOT GPT MY god

#

I typed this out

half pulsar
#

I got banned in Affiliation

half pulsar
limber plover
#

YES it was a wall of text, I ASKED IF I CAN DO THIS. You said sure.

#

I could have done it line by line why would I though?

#

I was getting my thought out. What a wired reaction.

half pulsar
#

I said SURE cause I just was looking for conversation in something I'm passionate about, I was literally about to tell you to stop the LLM Spam.

#

It was already getting on my Nerves just type it out next time.

limber plover
#

Ok, lets do this, what part of it was LLM specifically.

#

The idea of multi core system for specific tasks?

half pulsar
#

How about we start off where we left off, It was about KGs

limber plover
#

Ok tell me more about KGs

half pulsar
#

No copy and paste walls of text.

#

Just regular talk, so have you built anything?

limber plover
#

All right I wont do that, though I was not trying to spam. I actualy asked if this was ok, it seemed like it was.

#

I did build something though I no longer have the code, I started out, AS I was saying using simple binary decision tree classifier if I remember right, for valid and not valid words, I was training to look at impute like this dgo and dog and tell me if this was valid or not, though I was going to try and make it into my multi core system. I abandoned this idea.

half pulsar
limber plover
#

Fine, I am no longer interested to ever be near that server. I was suggested to go to it for deeper talks about it.

#

That was all, since the person who I was having a conversation with did not have the time for it.

#

I continued spaming text to get my full thought out. I could type it out but that would be really strange. The mod who handled this could have reached out and talked to me but nope.

#

Fine by me. Never using that server.

half pulsar
limber plover
#

RIGHT, that is why I stopped as it would require rather a lot of work and processing, that said, it would be interesting to implement it in more small scale. Like using simple binary classification. Have nodes comunicate in a mesh network doing specfic tasks. I did not say it was a good idea, I was curious how practical it was, I even asked this.

#

I cannot now tell if anyone was even reading anything about by in between inputs.

half pulsar
#

If I can do it you probably can

#

Like I said if you want to build it you're gonna need to learn a lot more about "experimental" equations for AI

#

If you're not good at math it's going to be a struggle.

limber plover
#

Tell me more about your system you built

half pulsar
#

Telling anymore than that would be revealing

limber plover
#

Yes, I did not know you can make a hybrid system that is why I wanted to talk about it, what did yours solve when you built it?

#

Sorry, I having a conversation with the mod right now and they are REALLY wired. I honestly cannot tell if I am not talking to Ai.

#

The one that banned us..

half pulsar
#

I think that's as much as I'm going to share so don't expect much more cause anything else would be systems level revealing.

limber plover
#

Are the nodes hardware or did you make software? I was thinking the cluster would be servers...more or less doing certain task IF I was to scale this.

#

I mostly started with emergent behavior systems .

half pulsar
#

Okay enough is enough 🤷‍♂️

#

And if you wanted to see a visualization of it(limited though cause it'd crash)

limber plover
#

That is EXACTLY how I saw it in my head though I did view it also form top view

half pulsar
#

That's a limited view of it

#

Looks more like that

limber plover
#

honestly that is incredible

half pulsar
#

Thanks

limber plover
#

Man, I am telling you the mod that banned us is REALLY odd, I feel like they are an AI. We are talking about regarding how I knew a lot of stuff I talked about and they said we like cs50p. WHAT?

#

Its some Harvard corse but its not relevent to the topic...;

half pulsar
limber plover
#

Yeah dude I don't know really odd, anyway stay around I do want to talk more bout it. Let me ask this, was my "spam" wrong in terminology I used? Was there something I missed on what I said wrong way?

half pulsar
limber plover
#

Ah ok your own private project then?

half pulsar
#

That's as much as I'll say aloud publicly. Yeah

half pulsar
#

If I truly had something stable at scale and everything I said it was I'm sitting on Gold.

#

If you're interested in KGs then start now

#

If you just want a hobby project just take it easy

limber plover
#

All right, anyway sorry for getting you banned but that was really odd.

limber plover
bronze wyvern
#

hello, quick question, when we normalize our independent variables, say for a linear regression algorithm, do we need to normalize the dependent variable also, that is the target variable?

limber plover
#

@thin sky Can we have a conversation? I am really confused what you do not get? I am actually really curious about this. Perhaps call?

rich moth
limber plover
#

Yeah I know nothing about KGs I just got introduced into them.

#

I presented my idea and turns out the concept I have is similar to what I had in mind. I have not built anything yet. Long road there.

half pulsar
limber plover
#

OK this may sound dumb but are you guys interested in having this compressed into smaller nodes?

#

Could one use matrix for this? I am trying to remember this again.

rich moth
limber plover
#

@half pulsar Is there no way to do tthis # Compress ONE text into 2x2 matrix, then reconstruct
text = "HELLO"

Split & store in matrix (list of dicts)

matrix = [
[{"char": "H", "pos": 0}, {"char": "E", "pos": 1}],
[{"char": "L", "pos": 2}, {"char": "LO", "pos": 3}]
]

RECONSTRUCT (concatenate)

original = ''.join(cell["char"] for row in matrix for cell in row)
print(f"Original: '{original}'") # "HELLO" but on larger scale and with nodes?

#

I am thinking node in hardware terms here as a sort of another brain sort of like that one chip.

limber plover
#

Sorry, can't matrix used for storing data if yes and then concatenate out? If so can't you use that for maybe not the node but the data it has?

limber plover
#

Would that be just too much to compute?

half pulsar
#

Not only just that I wouldn't expect it to even be stable.

limber plover
#

Why?

#

I thought matrix's are designed for fast processing, specifically numerical data no?

half pulsar
#

There is a lot more than to it then just trying to store information in a huge "matrix".

#

And why would you store something useless liike Hello.

limber plover
#

The hello was an example

half pulsar
#

Get better examples, it'd help you build a better structure, you need a goal.

limber plover
#

I thought also its braking it up so that its not processing all at once but in pieces

#

Though I guess overhead would be the problem trying to concatenating out is that why it would not scale?

half pulsar
#

Yeah overhead would be the biggest problem there.

limber plover
#

Yeah, I remember trying to do something with compression and even trying to compress all the indices that have to go on to under what you compressed was a problem.

#

This is party why I left the project, I simply did not have the time to learn the exact math for it.

half pulsar
#

You need a algorithm.

limber plover
#

That is true but what algo would be good to work with that?

half pulsar
#

I'm not telling you that XD

limber plover
#

So need to make a custom one then?

half pulsar
#

Pretty much

limber plover
#

Interesting...

half pulsar
#

Think in layers here

limber plover
#

Yeah..but for matrix would it not be for x and y and z? You would basically need to make a cube?

#

If you layer it, that is what it would become yes?

half pulsar
#

These are questions you can find answers for online, there's a lot of research papers on that type of stuff.

limber plover
#

I have read them but a long while ago, I am just remembering some stuff I read a long time ago.

half pulsar
limber plover
#

Layering would be an algo you would have to use.

#

I am just thinking outload..it would be best if I was in a voice chat group about this.

#

I have no experience with this or not much of it, but I am just thinking this out visually what it would look like and what you could possibly use for it.

half pulsar
#

The problem is that you're over thinking the wrong thing, its not as simple as how do I store information, its more of how can I retain that information at scale with stability and do it efficiently, it takes many "Layers" to get to that point even then you'd hit a wall of Complexity creep, Every-time.

half pulsar
#

Nobody is good at math, You need to practice regularly.

limber plover
#

Ok I am reading this again as its been a while not humor me as I am using arithmetic logic unit for this that would be implemented in harware form..something like The core of the MAC is the Matrix Arithmetic Logic Unit (MALU). The overall functionality of the MALU is to perform the matrix operations and write the output to memory. Could one not do something like this same idea for nodes?

half pulsar
limber plover
half pulsar
#

Like give me a demo of what your project would do. In text

limber plover
#

The link I gave you is what I made a while back but using verlog I think that is the name...implementing a chip. For a 16 bit system.

#

I was wanting this as ALU would be much faster FOR numbers specifically. I then wondered if one can use this for higher levels. Mostly I wanted to see one can fit ML into 16bit system and what can it do.

#

I never did finish 16bit computer as I got distracted thinking about this...then Ai.. My mind loves to move fast..a lot

half pulsar
#

So you're building a 16 bit computer?

limber plover
#

Yeah sorry topic jump, its my history why I am so hung up on matrix.

half pulsar
#

That makes more sense now personally I don't see the value in proceeding with that.

limber plover
#

I was building it, I have a lot of ideas like these as I am expressing and I get half way through them and then leave them....

half pulsar
#

Like if you want to build AI you don't need to build a 16 bit computer

#

Talking about two different things here KGs and 16 bit computing. Its having me confused

limber plover
#

No, but I do want to build one since I would know exactly what its doing, I would then try to fit a FORM of LM into this system to see how strained it could get. Perhaps then needing more ram but curious what sort of lago you would use to compress it further down.

#

Think of it this way, you can implement KG ON a compute yes clearly?

#

How low of bits can we make this to fit it.

#

Maybe even just one node.

half pulsar
#

I don't see the benefit

limber plover
#

Think of the moon lander...

#

They had to do with very little yet they were able to get A LOT done, now what sort of algo did they have to use, Its impresseve, same idea, though we are NOT ladning on the moon.

half pulsar
#

Giving me 80s/90s vibings NOT a good thing

limber plover
#

I mean this is what nvidia is doing are able to have large processing task done on a chip.

#

You can implement your KGs I bet, much better on thier chips.

#

I am sort of thinking a long those lines, though I thought matrix would be involved

#

Anyway, I digress....

#

My main question is, HOW small can you make the nodes to do what you need them to fit on embedded systems.

#

Why not? A node does not have to be with KGs..it could be some other processing node for say classification problem yes?

half pulsar
#

Even me telling you my project does hundreds of nodes in seconds is revealing

limber plover
#

Hundresed of nodes in secs on how large of a system with how much ram do you need?

#

You don't have to answer that, but suppose you do it with some other simple problem

#

96gb is a lot of ram, though context here is how much processing are you doing with data, so it must be impressive, now assume you try to do this with even less ram. Would you think you could?

half pulsar
#

Answering that would tell you the magic behind the curtain

limber plover
#

Is that the active project then you are working on?

#

Ah see, we think a like. I am just approaching it from another way. Clearly you cannot out engineer complexity to scale eventually you do need to add more ram.

#

So I guess I am not crazy or stupid in this yeah?

#

Can you tell me what task you are trying to solve with AI?

#

Ok let me ask this, can you access your nodes procedurally?

#

Meaning only with in certain task given it access said nodes as some time when its done it dose not proceed with the data untill need more?

#

Not sure if I made that right.

#

No...hold on let me think on this;...

lunar heart
#

hi

limber plover
#

You have a node, and its doing some data or needs to access some data yes?

#

Ok wait, could you build nodes procedurally instead of by hand? if you know what I mean...

#

I feel like that is something you would want....

#

If I need more tasks done I would extend nodes but I have an algo that just builds them as needed yes?

half pulsar
#

I'm not going to describe the architecture

limber plover
#

Sure you don't have to, but you get the idea what I am saying, am I on the right track in thinking that generaly?

#

I am not looking at your project or needing some info from yours, I am just reasoning all of this out, I am not even doing any math about it lol

#

The problem is that overhead for having to build these nodes as needed, the processing would be hard, so I guess you would have to control rendering for said task. I could think using pricewise function for this where it predicts future use needs. Or something of along the lines of that sort of algo,.

half pulsar
#

Like this is as much as I'll let myself say aloud, You just gotta take it and make what you want from it

limber plover
#

This seems strange that I am actually that close to your project surely this all seems BS to you?

half pulsar
#

No you're way off-track but thats fine don't try to do what I did, it's up to you to build whats right.

limber plover
#

Oh ok good

half pulsar
#

Don't try to follow others don't poison yourself, Just like I said the answers become clear when you know the math

limber plover
#

Are you afraid that if you correct me it will leak out on how you are thinking so I might start develping along your lines?

half pulsar
#

I'm not going to tell you what ways or any other pointers, except for scale and stability. watch for complexity creep.

#

That's the only thing you should focus on after learning the math behind it

limber plover
#

What math would I need to know?

#

So linear algebra I suppose

half pulsar
limber plover
#

Let me ask this, can you make procedural nodes Suppse I do not care about processing for now.

#

Were you using some other architecture besides KGs. Meaning did you start the project with that in mind or was there some other you made before but did not work so you rebuilt it?

#

Anyway, I am actually not going to pursue this, its interesting to think about. I am just going to do some more python programming for whatever ideas I have, like procedural nodes that interests me now.

#

This is actually how I got my project here done, I was thinking this much and just implemented it, it worked but not sure how efficient it was, though I am not really good at programming.

half pulsar
#

This is something I've been working on since 2014. I don't expect people to try and pursue it.

limber plover
#

Wow a long time.

#

Congrats

#

Ayway, good luck I am out for now.

keen wind
#

is it possible to convert chemical structures and their information into vectors for machine learning

rare bane
# umbral dove https://theconversation.com/openai-has-deleted-the-word-safely-from-its-mission-...

With regards to this anthropic has followed suit and removed safely from its mission statements as well, and currently chatter around is due to the upper hierarchy of anthropic frowning at their usage of their AI in military operations. Of course they use grok and openAIs chatgpt, but it seems claude appears to be a cut above the rest. And with Pete hesgeth leaning towards labelling anthropic as a "supply chain risk", I guess it was bound to happen
https://www.linkedin.com/news/story/anthropic-shifts-stance-on-ai-safety-7047916

rich moth
lime grove
#

The military world demand this regardless of who's the President.

rich moth
#

Like a parade?

#

I get it, the USA and CHINA in a new space race, AI. Everything the admin does is political. But its not presidential the way its going down.

#

Theres no normalizing this dude.

#

anyways! im done

lime grove
#

It's just the nature of the Pentagon. I'm not saying I agree with this.

#

Consider nukes, for instance. There's no guarantee that they will not use them at some point, and any external control over their ability to use them would be unthinkable to them.
This whole topic is f****d

mossy blaze
urban heart
#

I'm running a flow for my chess model where I let codex: 1. validate lichess datasets (using python-chess) 2. upload to hf 3. start a runpod to train the model, 4. get it back 5. start and run the model with a bot account to compete against other bots on lichess. (all this done by codex running commands, scripts, and following my specs)

Anyone interesting in giving me some pointers or talking about this?

The model I'm training is a phase aware (early game, mid game, late game), LSTM next-move model, training on elite game PGNs (from lichess) capped at 4 random moves moves in the game.

smoky robin
#

guys i need help with something

#

I am building a NER for NCBI disease Corpus. my text abstract or the input sequence are variable in length. I Plan to use LSTM for this task initially and i am using TensorFlow as the framework. Problem is

How do I handle the Variable length input sequences

gritty vessel
smoky robin
gritty vessel
smoky robin
#

ok how do i do that? isnt that a feature available in pytorch?

#

another thing, i am having issues tokenizing the sequencing. i tokenized the labels but for the input i still have no idea what is the right approach

gritty vessel
#

I don't have any good resources for that but when I train something like convlstm or unet on images

#

Sometimes satellites don't capture data so it gives fillvalue or nans

#

I remember there is a way to ignore this fill values during training

#

I will share it as I find it

#

criterion = torch.nn.CrossEntropyLoss(ignore_index=255)

#

Something like this

smoky robin
#

yeah i researched it a bit apparently you pad the sequence by <PAD> and then mask it so it doesn't affect the training

gritty vessel
#

Great

smoky robin
#

there is another thing i worked on recently but it was never cleared up. lets say i train a RNN model on patient EEG session. in testing i pass on variable length input like pat 1 5 session, pat 2 20 session and so on so based on that how do you predict the rest of the sequence

fleet tendon
#

What do you think is the best way to track these types of stuff?

I tried doing something like a cup tracking but the resnet18 model I tried using to track coordinates of every cup at all frames is having trouble with the occluded samples (returning coordinates of nothing because it can't see the 2 hidden cups). Because of this, the network i trained to connect the coordinates per frame is making mistakes

Resnet18 might be lacking in resolution but i can't do heavier model as this is meant to run realtime

gritty vessel
smoky robin
gritty vessel
#

I will stop commenting as I am not qualified enough to comment on this ,I work on forecasting so in that I take like past 8images and then predict next 4images so it's little bit similar but still not enough to guide you.I will let someone else take over from here

gritty vessel
smoky robin
#

average of a EEG recording in a single session i believe

dire surge
tardy haven
#

Hii everyone I'm new here

dire surge
#

hello

granite zephyr
#

This may be a stupid question, but i want to use LinearRegression() from sklearns, and use the fit function model.fit(X=x_train, y=x_train), my teacher has written for us to use model.fit(X=x_train, y=x_train) and not model.fit(X=x_train, y=y_train) , is this a typo or am i misunderstanding something? Also is there a reason why the MSE of model.fit(X=x_train, y=x_train) is 500+?

serene scaffold
#

Do you understand x and y, and train and test?

granite zephyr
#

Yes, also forget the question about MSE, I used the y_val set.

serene scaffold
#

Great

granite zephyr
west wing
#

any one worked with equinox

serene scaffold
opaque condor
#

Has anyone ever used the hand drawn number and letter data set to generate a message using the dataset

tardy haven
#

Guyzz listen I want to create a bot for instagram gc any body knows how to make ?? So olzz help me

#

I want to impress my crush

opaque condor
tardy haven
#

Yuppp bro but I want to do something crazy for her

#

I want to make something by programming

opaque condor
#

How long have you been working with python?

tardy haven
opaque condor
#

Specifically because that gives us more of understanding the question

#

Do you want in AI or something like a regular program

rich moth
#

This is your crush we're talking about. Take a page out of Nikes book and just do it. 🙂

opaque condor
#

I'm sorry I just need more information before I can really give a response

tardy haven
tardy haven
tardy haven
opaque condor
tardy haven
#

I actually wanted to make a welcome bot for an Instagram group chat, but it’s quite difficult

#

But now I’m thinking of making one for my crush that trending blooming flower thing from reels and host it on something like Netlify

opaque condor
#

I'm sorry I wish I could help you

#

I don't really know how to use social media I wish I could help

tardy haven
#

It's okk broo

#

Bro, can you make something good for me on your schedule or way??

lyric vale
#

can anyone suggest me project in neural network for resume

limber plover
#

@lyric vale How simple or complex do you want the project? Why not make a simple binary classification and train the NN on dictionnary words for valid or not valid words. Something like dgo and dog, one is valid and one is not valid. You could then extend the list to add more words. Kinda over kill using NN but why not.

lyric vale
limber plover
#

@tardy haven You can make something better, if you are really into them you can simply say, here is thing I tried making, I was going to program this whole complex thing, but I am not really good at it, but I tried my best. I am rather sure they will appreciate the effort.

lyric vale
limber plover
#

@lyric vale Projects starts with what you know. What do you know?

lyric vale
limber plover
#

Predicting just text like 1 2 ..4 the what is missing is? Or something like context aware?

lyric vale
tardy haven
limber plover
#

@lyric vale That is fine, then since you know basic things about NN, you can just look up what a binary classification problem is. it is not any more complicated then predicting text.

lyric vale
limber plover
#

@tardy haven If it is beyond your limits then how can you possibly make it?

lyric vale
#

is it good idea or should i make smth else

limber plover
#

@tardy haven Start with what you know and then make it creatively, something amazing is very subjective to individual, it means nothing on what you are saying amazing except what you see it is.

tardy haven
tardy haven
limber plover
#

@lyric vale Well, how close is that to what you know? Try it and see. However, binary classification problem is not much different, you are classifying two choices, is 0..9 a number yes? valid, is abc a number no? then not valid, I am being abstract here but this can help you with other topics later on.

limber plover
#

@lyric vale That is just one way of using NN, you can also use it to build basic logic. Something like nand gates /AND/OR/XOR gates from that

lyric vale
limber plover
#

Now this is simple right? But try building ALU just using NN. Its a ridiculous project but interesting exercises

limber plover
#

Yes

#

Which you can build from previous gates. You can start with XOR or use NAND. Nand is mostly used as it is a bit faster.

#

You can then rearange nand in to any basic gates, from there you build your structure for like dmux and mux and so on. However, in your case you are using NN or several of them doing just that.

#

Its not going to be efficient, but that is not the point.

lyric vale
#

should i make it from scratch or use inbuild functions?

limber plover
#

Make a simple one, you can try training single NN on several chips or use several of them. You will have to have combine them eventualy using one after the other. NN based computers have been done before back in 1960

#

Depends on what you are interested in, is inbuild functions going to abstract too much from your learning?

lyric vale
#

probably i should use inbuilt functions cause if i make it from scratch then it will take too much time

limber plover
#

@lyric vale true but you would learn more. However, if you know in general what inbuild functions do, then its ok to have that abstracted for you and treat it like a blackbox. IN computer engineering, the engineer is not really interested in how exactly the transistor arranged inside it, they are only interested on what the chip is doing. You leave the rest to hardware engineer. So you can think of it that way in this small case.

lyric vale
limber plover
#

@lyric vale Its hobby for me, I don't work any place, I do like collecting knowledge though.

#

Mostly I set out to learn how to learn, but real applications I leave to someone else.

limber plover
#

I have a lot of info but very little depths

lyric vale
limber plover
#

@lyric vale I have, yes, though I never finish them. Mostly because the current question I have about something gets answered, then I don't really continue it or have to.

#

For example, I have build ALU before, but I never had it used for anything, I got the general idea on what it was doing but I lost interest in the rest of the 16 bit system I was using.

lyric vale
#

nice but you should make smth that usefull to people

limber plover
#

That is subjective. I cannot possibly know what is useful to people unless they say so. I can make something for me, that is useful and then hope someone finds it interesting.

wind breach
limber plover
#

This also helps me keep my mind steady, and not criticizes my self too much, since to me I can see the imperfections, like how efficient is it really build NN computer... but then I am too close to the subject, a layman might think it is impressive and someone might want to do something with it..next thing you know, you are selling a product you had no idea had this sort of use for it. Someone found out though.

#

Anyway, I digress try that project and see how far it goes..use inbuilt functions don't use it, who cares.

tardy haven
#

Guyzz help me what should I do to impress my crush

limber plover
#

That is a loaded question. Sounds like you don't know your crush well to impress her. For example does she like programming? If not then why do you want to use programming as a tool to impress. If you are telling her, look how smart I am on what I did, then that is a bit egocentric and you might have problems later down the road. Maybe just ask her out. And let her ask the questions on what you like then she might be impressed. Since you are not the one saying hey look I code.

#

Just know that this is AI and data science section. If you want advice ask general python group that is active.

tardy haven
#

She liked it last time, that’s the thing. I made something for her and she really liked it, and now I want to do something similar again. Can you help me?

limber plover
#

How would I know what you made? And what she liked. I have no idea what I would help you with but then ask yourself this. When you impress her, should I also step in and say oh yeah I made this as well.

#

I gave you some advice, good or bad, that is the best I can do.

tardy haven
#

According to you, give me something nice that girls would like. I can make it in a way that she will definitely like it 100%

limber plover
#

Sure, though you can just go to Python discussion and ask for help there. Show you previous work and ask, how can I improve on it.

tardy haven
#

My previous work wasn’t done very well, that’s why I’m asking for your help, sir

limber plover
#

That is fine, however, this is not the right sub unless you are asking with AI and data science. Python discussion is what this is for. There is python expert there to help so they say so ask them. But you have to be exact. What is your goal what have you made and how do you see improvement for it.

quasi pier
#

not sure if this is the right place for this question: Is anyone here familiar with Reinforcement Learning on Farama Foundation's highway-env ?
I'm having trouble getting decent results using DicreteActions with DQN.

cursive schooner
#

heyo

rich moth
#

do economic, environmental, and competitive pressures improve llm code patch quality? im about to run a controlled study with contamination auditing using the qwen3.5-35b-a3b model to test this theory.

#

i honestly think it will, but i was wondering about your guys opinions

#

ill share the visuals it produces nevertheless, for science!

barren gulch
#

is there no demand of Data Science / Data analysis in Healthcare sector? i haven't seen a single DA/DS job in healthcaer
am i cooked?

waxen kindle
waxen kindle
#

You can look for data scientist jobs outside of healthcare

barren gulch
#

yeh, but healthcare/medicine is the only thing that sparks my interest and sustains long-term attention

so perhaps i should use healthcare datasets to really learn DS and then apply to DS jobs outside of healthcare

waxen kindle
#

Yeah, or if you really wanna stay in healthcare, look at non data scientist jobs

barren gulch
#

right. i think i will do just that, use healthcare datasets to learn Data Science and after having learnt enough of DS, will apply outside of healthcare

#

skills would be transferable right? (my head says so, but i think i still need confirmation)

waxen kindle
#

Yes

barren gulch
#

alright, thanks!

half pulsar
barren gulch
#

I seee, how much is it expected to grow in next 2 year? like till 2028 😅

half pulsar
#

It'd probably be another 1 - 2 years before you'll start seeing things pop up for it

barren gulch
#

thank you!

lusty rune
#

Am I doing something wrong here?
I'm following along the python one liners book, I got to the neural network section. In his examples he gets a low finxter score for doing 0 hours of python coding in the input data, while mine gets a higher finxter score for 0 hours and a lower one for more hours. I'm getting the complete opposite behavior he is with the same dataset and inputs.

My first two responses was running the model twice at 0 hours, the next response was once at 20, the next was once at 50. The finxter score goes down the more hours of python weekly I input , but the book has the complete opposite behavior

#

But the code is the same and so is the dataset

#

I get why it thinks more hours is a less score since the lowest score is someone who says they code 35 hours a week

limber plover
#

@lusty rune Were you able to resolve it?

odd shell
#

How important is it knowing when to actually use dataframes or series? Cause the syntax is murdering my sanity. Example:

print(cars.loc[:, 'drives_right'])
print(cars['drives_right'])
print(cars[['drives_right']])
(python, pandas)

odd shell
#

Feel like there are too many ways to do something, or I'm doing something too many ways

odd shell
limber plover
#

@odd shell I would use ai to answer that question if you want researching it. You don't need to let it code for you, but answering basic questions will give you a general idea alike. Also, you could look at the sources its using it.

odd shell
#

I've limited my GPT on purpose using study methods, enforcing docs/community assistance feedback 😉

limber plover
#

@lusty rune What book are you reading?

lusty rune
odd shell
#

I've leaned too heavily previously, realising it was causing damage to my learning. Though, I agree, GPT/LLM's can be potent if used right!

lusty rune
#

All I can think is that I put in the wrong data

limber plover
#

I am not familiar with that book, I would have too look into the exercise.

lusty rune
#

But I spent 2 hours last night re checking the data set

odd shell
lusty rune
limber plover
#

Yeah just looking at it, it seems data set you used would be the problem, I don't see any problem with the code...

odd shell
#

I wonder if capital X is used due to grammar or purposefully

serene scaffold
lusty rune
odd shell
#

I don't work with scikit, but this is honestly interesting lol

serene scaffold
#

except that array shouldn't even be called X because it has the y in the last column

lusty rune
#

Also got the automate the boring stuff with python book

#

The secret life of programs is probably my favorite

odd shell
#

Of course SQL is the thickest.... ☠️

lusty rune
#

The powershell one I bought for my buddy's birthday

odd shell
#

whenever I see powershell, I think, maybe I should switch to Linux completely? 😂

lusty rune
#

This one is pretty thick too

odd shell
#

Yea, I know they're useful books, but too general 🙁

limber plover
#

@lusty rune That is actually what I was thinking to, I was about to ask when was this book published...is there a way to look up the library and its updates?

odd shell
#

I think they often come with repos?

lusty rune
limber plover
#

I finished looking at the array but I do not see a discrepancy between yours and theirs...so it has to be the library

lusty rune
lusty rune
limber plover
#

The best thing to do I suspect, but you said you know why its behaving the way it is, so you generaly understand what the book is talking about, I would not get too hung up on it.

odd shell
#

I played once with scikit on data I pulled from a videogame(eve online) 4 million rows or so, and still ended up a lot of overfitting

#

not familiar truly with the math/stats how to properly use it

#

pandas doesnt like that amount of data either lol

lusty rune
#

It's still super fun

lusty rune
limber plover
#

ML is always interesting...I which I kept my code from when I was doing classification problems

odd shell
#

I was in way over my head, but had a lot of fun

limber plover
#

I got to a point where I kept building the dictionary data set in text and then parsing it for training so that then it could tell me valid and not valid words based on examples.

#

Example is cta and cat, cta is not valid but cat is, however, it was interesting that when I did tac which was not part of the binary decision, it still said not valid. Which makes sense but I did not program it for that.

odd shell
#

This lib seems pretty interesting on say historical market data 🤔

lusty rune
#

When I get done with this section I wanna teach an AI how to play blackjack or poker

limber plover
#

Yeah you do get emergent behaviors from them.

lusty rune
#

I'm working on a little RPG game and it would be cool to teach my NPC enemies how to make the best moves based on players decisions

lusty rune
# odd shell This lib seems pretty interesting on say historical market data 🤔

The first exercise in the ML section was using linear regression to predict a little stock market sample, it's been really cool learning about the different algorithms. This stuff used to be so intimidating to me but the way the book breaks it down is easy to understand and when I don't understand something too well I do more research on it

limber plover
#

That sounds like you will need a lot of data for that as players play. I can see them getting smarter over time, but it will take a while for that.

odd shell
#

Im trying this method now on a pokemon dataset and see if I can match later* generations with only earlier generation data

limber plover
#

My favorite book I used was Grokking Algorithms..

lusty rune
lusty rune
#

I might have to check it out

odd shell
#

Alright, wonder what happens if I do this regression consectuively for gens? 😄

#

i guess meta-shfits from designers prolly makes it harder, unless they stick to their philosophy methodology?

#

or lack of data and we get goey? 😂

limber plover
#

I am more interested in the library directly and its math. I never used it, I just set out making my own at some point.

odd shell
#

yea, i get that. i need to start digging into math more

#

instead of building stuff i dont understand in the end truly

lusty rune
limber plover
#

Yeah, though I have not done this in a while so I forgot a lot about it now.

#

I do electronics mostly, so I hardly have to deal with this high level programming. Mostly low embedded system programming.

#

I am only coming back because I don't have the budget to continue it and software is a lot simpler to get into.

odd shell
#

yeah..the tools to do data analysis is so damn accessible

lusty rune
#

Whoops

#

Wrong reply

limber plover
#

The fact that python is free is amazing to me

lusty rune
odd shell
#

something.microsoft?

limber plover
#

Well, yes, for licenses.

#

If you plan on using it in commercial setting then yeah. For example there is programming language forth. Well, swift forth that is rather limiting until you pay.

odd shell
#

the problem is rather where do you store all the data from the languages 😂

limber plover
#

Same with something like "true basic"

#

If I remember right some charge you for compilers and such...language dependent

odd shell
#

@lusty rune Did a "prediction" on type, if x stats = fire or water? then added +1 on every consecutive generation to see if it improved actual confidence. and eh, yea lol. they changed how they defined types. but also missing lots of nuance. ofc. anyhow, fun stuff

#

could improve the model (or worsen) if we consider every type, or more

rancid thorn
#

Guys im trying to make an LSTM model but for some reason the loss flatlines and the outputs end up being all the same

#

(or very similar)

#

Also while training

limber plover
#

are you training a sensor on different weather types ?

rancid thorn
#

Train loss decreases slowly

rancid thorn
#

Basically I give it the last, say, 20 days, and it tells me the weather of tomorrow

rancid thorn
#

No real improvement overall

limber plover
#

Wow, that is going to be hard but interesting.

rancid thorn
#

But for some reason it doesnt work

limber plover
#

Yeah, I am not sure , I would have to look at the book you are using

rancid thorn
#

textbook application doesnt mean it comes from a textbooks

#

It means its common/classic

limber plover
#

OH I thought you were working it out from textbook sorry.

#

Are you using a library for this?

rancid thorn
#

PyTorch

#

pretty standard for AI

limber plover
#

Has PyTorch been updated recently?

rancid thorn
#

in this AI era I really doubt its left to itself

#

OpenAI uses it

#

All big AI firms do too

limber plover
#

You can look up recent updates, it might be something with this if your are sure you data is right

rancid thorn
#

No I doubt they messed up LSTMs

limber plover
#

Not sure then, I have neve used it, but I thought from systems approach it might be that.

#

@rancid thorn Have you tried asking Ai on this?

rancid thorn
#

what do you mean?

limber plover
#

Asking ai on the problem you are having you said something "loss flatlines and the outputs end up being all the same?"

rancid thorn
#

wont help

#

ai sucks at this kind of stuff

limber plover
#

Well, what I got, not sure if makes any sense to you but "the model has collapsed to a trivial solution, like predicting the mean or mode of targets across all timesteps"

#

Using science direct as its source though \

#

Maybe that is too general?

rancid thorn
#

no that doesnt really seem to be the issue

limber plover
#

Were you just testing it or you know for sure?

#

I mean another one I got was "Unnormalized inputs or targets cause exploding/vanishing gradients, forcing the model to output safe constant values" uncertain if this helps.

rancid thorn
#

Im clipping the values and the inputs are already normalized

limber plover
#

All right what about "Learning Rate Problems
Too-low LR traps the optimizer in flat loss regions; too-high causes oscillations ending in constant predictions"

#

This sources is reddit so..take that with a grain of salt

rancid thorn
#

Nope learning rate is a normal one

limber plover
#

Have you ever made LSTM from scratch?

rancid thorn
#

this is my first time

limber plover
#

No I mean the library that you are using which use LSTMs yes?

rancid thorn
#

yeah pytorch

jaunty helm
limber plover
#

Well the other suggestions I got dead neurons not sure how accurate that is but I don't even know if you can see this or test this.

#

Sorry, not really helping as I don't know much about them. I just never use libraries for this and take apart what LSTM is. Maybe the math on how it actually does it.

#

For me its more of "its fine if you don't want to know what a brick is to lay it down, but if you want to know why it keeps crumbling, you better get to know the chemicals of it"

#

Why not look into pytorch forums if they have any, maybe they had a problem like yours

rancid thorn
#

I mean understand what you're doing, necessary

#

Making it yourself from scratch, will probably end up doing worse

jaunty helm
#

also on the topic of lstm (or deep learning in general) for time series:
every other week, some new hot sophisticated dl architecture for ts will come out boasting sota performance
but also, don't sleep on "traditional and outdated" methods like arima/ets, which are still surprisingly competitive in certain scenarios

rancid thorn
#

What are "sota" and "arima"?

limber plover
#

@rancid thorn TURE but it would help you see the picture better.

tardy kayak
#

hello

jaunty helm
#

describing them in detail on discord is probably not very effective, you can search them up when you want to learn more

rancid thorn
#

oh okay i will thanks

#

This is the source data (a slice)

#

Bigger slice

#

From a purely visual standpoint, Id say theres some pattern

#

So it can work

limber plover
#

So that is the original data?

rancid thorn
#

yes

#

Not the whole data

limber plover
#

What is the predicted then?

#

Sorry, I am going to ask basic questions, as I am now reading up on what LSTM is

#

I am reading that the limit for them is "Manual Optimization: Requires tuning for best performance" How do you know you optimized it well?

rancid thorn
#

Its supposed to take the features of a series of days and predict the next day/sequence of days

rancid thorn
#

If it doesnt then thats not the issue

limber plover
#

What do you mean a bit? Why not indefinite?

#

Is there away to can plot out the predicted data vs original?

#

One of the things I see is "PyTorch provides a clean and flexible API to build and train LSTM models" Yet they also state "Version Gaps: API changes may affect older code"

rancid thorn
rancid thorn
#

Also you can check how good the model is from loss

limber plover
#

@rancid thorn I did not understand the way you phrased it. "before tuning you need to have a model that works at least a bit" what does this mean exactly? works at least a bit is like I guess it should work....seems uncertain..

rancid thorn
#

If the model has some fundamental flaw that renders it completely useless fine tuning is useless

#

You need rough tuning before fine tuning

limber plover
#

Ahh ok, and you have to do this all manually?

rancid thorn
#

Well you have to make the code

#

Make code that works

limber plover
#

I thought tuning meant specific weight adjustments?

#

You have it coded so would there not be some constants you can adjust?

warm dune
#

to 'improve' the data and model can get a better accuracy

limber plover
#

Yeah nevermind I got them confused, its been a while, yeah you would not adjust that as that is what the NN is doing ,you would adjust the learning speed and so on...

#

Also last NN I worked with was VERY simple XOR problem that I remember right now so I could do it manually

#

Anyway, I am losing interest now, since I am doing some other project. Hope someone can help you.

warm dune
#

are u trying to improve a model?

limber plover
#

I am not the one that had the problem StraReal did, you will have to scroll up to read their specific problem.

limber plover
#

Well, you did join in midway so there is that.

#

@rancid thorn Do you know if you can express LSTM like this F = x'y' + x'y + xy' = y'(x' + x) + xy'? Similar to product of sums in digital logic?

#

Sorry I just had to ask, as I am curious

rancid thorn
limber plover
#

Yeah I know based on how large it is.

rancid thorn
#

I mean theres two main variables, the short term and long term memory

#

+the input

#

Then they go through the forget gate, the input gate and the output gate

limber plover
#

I was trying to see based on what I could find, it looks like something you can express similar to logic gates

#

I know a bit more about digital electronics and if I can connect my thinking that way, maybe I can understand it more

rancid thorn
#

This is an LSTM expressed as a mathematical formula

limber plover
#

Interesting

warm dune
rancid thorn
#

Yeah

#

And exploding gradient

#

And in doing that it adds long term memory which is really cool

limber plover
#

So similar to programmable memory?

rancid thorn
#

Not sure what that is

warm dune
#

it's like the RMSProp?

rancid thorn
# warm dune how that works?

Basically long term memory gets carried from each LSTM to the next and it passes through a forget gate, which decides what % of it to remember, and an input gate, which decides what to add to the long term memory

#

It is never outright deleted

#

And short term memory is carried from one LSTM to the next but it doesnt go to the one after it too

warm dune
rancid thorn
#

Yeah I guess

#

LSTM literally means Long/Short-Term Memory

warm dune
#

rn i'm in the optimizers

#

dont see the nn

limber plover
#

Basically it programs its own memory, it cannot be really expressed with bool logic but if you make it over time then you can sort of get it. The expresson I showed was F = x'y' + x'y + xy' = y'(x' + x) + xy' You can use this to minimze gate use and reduce to using say 3 and gates instead of 10. However, LSTM is not exactly like this but does have it basically overtime expression. Seems to use sigmoid a lot.

warm dune
#

it's like a optimizer for rnn?

#

to resolve the vanishing and gradient probleml

limber plover
#

Well, I am not accurate this is just my understanding, its not pure logic like that or static, so you cannot really use it. You cannot use digital logic gate expressions from input to out for this. IT chnages over time as needed, so in digital logice its 1 or 0 for input but with this its way more complicated than that.

rancid thorn
#

But it is an RNN

limber plover
#

Ok anyway, I really need to stop thinking about this for now.

rancid thorn
#

You should watch this

#

Its really good

warm dune
#

and your rnn its for what?

rancid thorn
#

in this case time series forecasting

#

Which is basically having a series of values and predicting the next

warm dune
rancid thorn
#

Also if you dont know exactly what RNNs are watch this
https://www.youtube.com/watch?v=AsNTP8Kwu80

When you don't always have the same amount of data, like when translating different sentences from one language to another, or making stock market predictions from different companies, Recurrent Neural Networks come to the rescue. In this StatQuest, we'll show you how Recurrent Neural Networks work, one step at a time, and then we'll show you th...

▶ Play video
rancid thorn
#

But in the worst case scenario I guess it will be study

warm dune
#

i'm not at that level yet.

#

i want to start studying NN until may, cuz currently I'm focusing more on the intermediate level, such as problems and gradient types and their optimizers (LR, regularization, ADAM, RMSprop, LR scheduler)

rancid thorn
#

Learn the basics of NNs

#

Backpropagation, the chain rule, RNNs, what an NN even is

#

And then go into the details of every step

warm dune
rancid thorn
#

Oh yeah then youre good lol

warm dune
#

activations functions and more

#

but just for simples like linear models

#

i never code a nn

#

just doing model with 1 layer (linear models)

rancid thorn
#

really get it printed into memory

#

Output...

warm dune
rancid thorn
#

No no its bad in everything

#

theres no variation

warm dune
rancid thorn
#

not really

warm dune
rancid thorn
#

the date feature is the period of the year

#

at start and end of the year its 0

#

and towards the middle its 1

#

Just a cosine wave made to wrap from 0 to 1

warm dune
#

in train set

rancid thorn
#

I have these
Epoch 02 | train loss: 1.00803 | val loss: 0.96275

warm dune
#

do you check if overfitting?

warm dune
rancid thorn
#

The thing is it doesnt improve

warm dune
rancid thorn
#

no I just took this one as sample

rancid thorn
warm dune
#

can you send the first, the middle and the last?

warm dune
rancid thorn
#

No no

#

with overfitting the train loss would go down by a lot

#

It would end up looking right

#

Sure it wouldnt be useful

#

But it would end up looking right

#

But it doesnt

warm dune
#

the train loss down and the val loss increase

#

thats not a overfitting?

rancid thorn
#

over time, it slightly is, but overall its not

#

What really happening here

#

Is that its as if the model wasnt even being trained

#

Its as if it was back to the start at every epoch

warm dune
rancid thorn
#

yes

warm dune
rancid thorn
#

checked it a thousand times

#
for epoch in range(1, NUM_EPOCHS + 1):
    model.train()
    train_losses = []

    for seq, target in train_loader:
        seq = seq.permute(1, 0, 2)

        optimizer.zero_grad()
        pred = model(seq)
        loss = criterion(pred, target)
        loss.backward()
        torch.nn.utils.clip_grad_norm_(model.parameters(), max_norm=1.5)
        optimizer.step()
        train_losses.append(loss.item())

    # ---- validation ----
    model.eval()
    val_losses = []
    with torch.no_grad():
        for seq, target in val_loader:
            seq = seq.permute(1, 0, 2)
            pred = model(seq)
            loss = criterion(pred, target)
            val_losses.append(loss.item())

    avg_train = np.mean(train_losses)
    avg_val   = np.mean(val_losses)

    print(f'Epoch {epoch:02d} | train loss: {avg_train:.5f} | val loss: {avg_val:.5f}')


    if avg_val < best_val:
        best_val = avg_val
        torch.save(model.state_dict(), 'best_weather_lstm.pt')
        cprint(' -> checkpoint saved', 'm')```
warm dune
#

i see in somewhere the thinking to avoid that

#

i'ts like

#

the loss don't decrease, so the weights dont change
the weights dont change, so we need to check the weights numbers (check the optimizer)
why the weights dont change? (lr lower/higher, the model find a local minimum) and more

#

i think in this case the model find a local minimum, do you check that?

rancid thorn
#

doesnt work

warm dune
rancid thorn
warm dune
#

try the AdamW

warm dune
rancid thorn
#

No, if it was that it would do a little optimization before flatlining

#

here it flatlines at the start and just stops

warm dune
#

well

#

try to increase the batch size

#

or put a lr schuler

#

or idk try to increase the alpha regularization terms

rancid thorn
#

already tried changing all the hyper parameters

warm dune
#

the data have outliers?

rancid thorn
#

I really dont know what could possibly be the issue

rancid thorn
warm dune
#

like starts in 0.00001 and goes to 0.001

rancid thorn
#

no but it doesnt really matter if with both high and low values it doesnt work

rancid thorn
#

yep, its normalized

warm dune
#

" In your model (in the LSTM/RNN definition), is the batch_first parameter set to False? If it's set to True and you perform this permute, you are training the model with batch instead of time. This would explain why the validation loss never stabilizes: the model is trying to find temporal patterns in dimensions that are, in fact, different samples. "

rancid thorn
#

"This would explain why the validation loss never stabilizes" not the issue at hand

#

Also its set to False

warm dune
#

and after goes down

#

idk if it's a overfitting problem that are invisible or something

rancid thorn
limber plover
#

@rancid thorn Still on that problem huh? Have your tired asking the pytorch community? I am sure they have discord.

tawdry heart
#

@rancid thorn

#

oh nvm i cant send the invite mb

rare bane
#

Ooh do we reckon openai has a massive lifeline, now that they've signed with the US secretary of defense?

#

I know they were leaking money bad, but surely that's on the up for them from here

acoustic fjord
#

Working on Kaggle right now, didn't even know this could happen lol

high rivet
#

hi everyone, Tooba here. I am a SWE Junior, currently in 6th semester.

#

Need your help: I am currently studying "Data Science for SE" course and for its 2nd assignment, I have to effectively visualize big data, which is basically the data of 4 parameters: No2, O3, PM2.5, PM10 of 100 stations hourly data across the world for the year 2025. The problem is that the data is so huge that I am unable to visualize it meaningfully so as to convey anything properly. See the image attached. Well, I need your suggestions or maybe yt tutorials to effective visualizations of such big data....thank you : )

west wing
#

find their RFE

#

if its a linear data

#

you can also check their VIF score to find multicollinearity

rancid thorn
#

What its trying to predict

#

Cyclically repeats, always in the exact same way

#

What it did

#

So the code must be wrong

#

Because i do know for a fact that an LSTM can predict this

left tartan
rancid thorn
#

What do you mean?

left tartan
#

And, even then, if the intervals are constant, that dimension doesn't provide anything useful since it's ordered.

rancid thorn
#

theres no separate date and time dimensions

#

Theres one feature

#

date

#

Then the dataframe is ordered of course, but that doesnt indicate in any way seasonality, as if the weather snapshots were taken, say, 2 days apart, then the formula would have to change

wet dome
#

Is anyone working as a data engineer here?

left tartan
rancid thorn
#

Yeah, I'll send it again

#

!paste

arctic wedgeBOT
#
Pasting large amounts of code

So that everyone can easily read your code, you can paste it in this website:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

pliant steppe
#

Anyone here could help with a computer vision task? Im supposed to perform transfer learning on a pretrained model like YOLOv8-seg which i did, and the end result is very bad, i lowered the confidence threshold to 0.05 and it only gave me a single prediction of a small piece of a dendrite as seen in the image.
the labels it was trained on are manually done by me and my friend and we took a good amount of time to do them correctly so that isnt the problem either, im suspecting the biggest problem is the image size i set when training and triggering inference which is 1024 but the real image dimensions are:
(2188, 3072, 3) h, w, channels
(1094, 1536, 3)

Maybe because the dendrites are so thin in some areas or faint it ends up squishing it out of existence when resizing.

rancid thorn
#

On line 50 and 51 theres a commented line with the actual features id wanna use

left tartan
#

So, I reduced your problem down to 1 dimension: max temp

rancid thorn
#

Howd you make this visualization

#

can you send the code?

left tartan
#

Plotly, I will, just cleaning it.

#

This is how I like to visualize predictions

rancid thorn
left tartan
#

I think your lr was probably the biggest problem

rancid thorn
#

whatd you do to it?

left tartan
#

Changes to 1e-3, from 1e-1

#

With 1e-1: Epoch 20 | train loss: 0.98093 | val loss: 1.14488

#

At 1e-3: Epoch 20 | train loss: 0.45296 | val loss: 0.49629

rancid thorn
#

But its not 1e-1

#

its 1e-3

left tartan
#

Oh, 1e-5 is also Epoch 20 | train loss: 0.97913 | val loss: 1.11007

#

There's a bunch of hyperparameters to play around with her

#

layers, dropout, lr, etc

rancid thorn
#

Well but the loss was 1e-3 in the code I sent

left tartan
rancid thorn
#

Why? Thats worse

#

I also removed the weight decay now

left tartan
#

Anyway, all I'm saying is, reduce to a single parameter: features = ['MaxTemp']

rancid thorn
#

Can you send the code to plot it like you did?

left tartan
#

Yup, one sec

pliant steppe
rancid thorn
#

@left tartan ?

left tartan
rancid thorn
#

oh sure lol

left tartan
#

I made a few small changes to your code, mainly changing lr... but also had to set device in a few places for cpu/gpu. The main thing in my case is I changed features to feature s= ["MaxTemp"].

main grail
#

Hey friends. I'm developing a stocks trading app that learns from trading behavior. It can describe any market condition. It is something like a state machine. But I have no experience with ML. If someone is interested in having a look. I'd love some feedback.

opaque condor
#

By getting the device
That play torch can run on is there a way of limiting how much is used

warm fossil
#

Hi, is leetcode good for practicing for interviews?

waxen kindle
#

It's good to practice leetcode questions, which may or may not be asked by interviewers

serene scaffold
sterile sierra
serene scaffold
sterile sierra
#

personality DSL for agents, compatible w langchain, lets you be deterministic about ai personalities

#

compiler's in OCaml and is lightning fast

jaunty helm
sterile sierra
# jaunty helm quickly skimming through it, I'm not sure how this isn't just a glorified system...

The output is a system prompt because that's the interface LLMs expose and calling it a "prompt swapper" is like calling TypeScript a "glorified JS writer." Behind that output is an OCaml compiler with a real lexer/parser, typed AST, semantic analysis (cycle detection, conflicting modifiers, contradictory rules), and multi-target codegen (JSON, Lua, SQL, Cypher - not just prompts). The numeric values aren't for the LLM to interpret literally howeber they drive the DSL's rule system: behavioral conditionals, cross-trait interactions, evolution deltas, and compile-time validation that "high helpfulness" can't participate in.

jaunty helm
# sterile sierra The output is a system prompt because that's the interface LLMs expose and calli...

behavioral conditionals, cross-trait interactions, evolution deltas, and compile-time validation
literally what does that mean? I mean I can guess, but it sounds like you understand it more, so please go ahead?

I guess my thoughts are like, ts is a lot more complicated than js, but it provides very visible benefits (like, well I mean, types)
the dsl thing is a lot more complicated, but in the end needs me to do the same amount of work as just writing a system prompt without it anyway?

#

ig if it works for you great, though currently I'm not seeing like too much benefit

rich moth
pliant steppe
#

oh my god it workeddd

#

from artificial stupidity to intelligence 📈

rancid thorn
pliant steppe
jaunty helm
#

<@&831776746206265384> looks like ad ^?

zenith nova
#

!cleanban 906481045044625428 ads

arctic wedgeBOT
#

:incoming_envelope: :ok_hand: applied ban to @pliant temple permanently.

warm dune
rancid thorn
#

nah

past bramble
#

Where can I find data for code of multiple programming langs, preferably labelled in large amounts?

serene scaffold
past bramble
past bramble
worldly dawn
past bramble
worldly dawn
final kiln
#

y did they have to choose json for tool definition in LLMs and for structured output

#

is supa token intensive

#

even if the LLM is cheap it just reduces performance cuz of all the cluttering in the input prompt

spring field
#

why does it have to be structured at all? does it make a difference to the LLM?

#

maybe it does make a difference if you want to parse the output

#

(and subsequently deterministically change said output to feed it back to the LLM)

arctic silo
#

I'm building a QA agent and I'm handling the context so how can I split the codeabse and index it so be used later on or there an mcp server that handle all of that ??

#

in general how to handle the context it contains hundred line of codes?

jaunty helm
#

there has been code-oriented embedding models coming out lately that you could try to RAG with

jaunty helm
#

that said tho I went looking and landed here
I might try yaml instead 🤔

final kiln
#

im impressed that yaml is actually better than json

final kiln
#

wonder if they have the code for their bench

#

interesting stuff

jaunty helm
final kiln
jaunty helm
#

ig just prompt for yaml output? probably a lot less reliable

#

local might work ig
llamacpp grammar keeps winning
-# never used the others so wouldn't know about them

final kiln
#

long term Id probably just use json cuz they are actively training the models for it

jaunty helm
#

honestly I wouldn't be surprised if newer models have/will have tokenizers specifically optimizing for json token length

fair aspen
#

hey guys

#

what's the best way to run llms locally with python?

#

I've tried many different things but none of them worked (vllm, transformers)

serene scaffold
fair aspen
#

When I say it didn't work, I mean that I kept getting error after error and got burned out

serene scaffold
#

You have to show the code and the whole error message for people to be able to help you.

serene scaffold
#

(but if you just try to run them with ollama, and you don't know why you were getting errors before, you'll probably get errors with ollama.)

grand minnow
fair aspen
#

yay

#

I think I'm having an issue with ROCm

#

print(torch.cuda.is_available()) returns false

#

I have an RX 9060 XT by the way

grand minnow
fair aspen
grand minnow
grand minnow
final kiln
#

is the whole "LLMs are a reflection of ourselves" typa thing

#

at least me personally, I'd rank yaml > XML > json as for ease of understanding

fair aspen
#

thanks

jaunty helm