#data-science-and-ml | Python | Page 145

strong cove Sep 5, 2024, 7:18 AM

#

Ye it sounds hard

lapis sequoia Sep 5, 2024, 8:21 AM

#

it’s like 20 terabytes all together

verbal oar Sep 5, 2024, 8:35 AM

#

I have master's so what phd math I need, according to prevoius message/s?

#

and do people come from applied math who implemented these scikitlearn things? I mean these more advanced not these which you learn at university

#

so these books like math for machine learning or math for deep learning?

odd stratus Sep 5, 2024, 8:39 AM

#

lapis sequoia it’s like 20 terabytes all together

lmao, yeah id get an external hardrive and then use PILGRIM and OS to get the files one at a time when needed during runtime
doing preprocessing on the data might also help but it would take a while for 9 million images

lapis sequoia Sep 5, 2024, 8:41 AM

#

odd stratus lmao, yeah id get an external hardrive and then use PILGRIM and OS to get the fi...

So it would be a bad idea to store them on s3? why?

quaint mulch Sep 5, 2024, 8:47 AM

#

keep the kernel size 3x3, very rarely you need anything else
don't use stride or dilation (unless you are doing wavenet)
do just add 1 padding (if I remember correct).
this way, you keep the dimension the same between layers, you can add them res-net style.

I hate doing this math too

odd stratus Sep 5, 2024, 8:47 AM

#

lapis sequoia So it would be a bad idea to store them on s3? why?

its way easier and faster and permanent
i have no idea what the cost of s3 cloud storage is but i do know its a monthly cost and a hard drive is only an up front cost

verbal oar Sep 5, 2024, 8:48 AM

#

I mean sth like lars,omp etc

quaint mulch Sep 5, 2024, 8:50 AM

#

verbal oar I mean sth like lars,omp etc

Orthogonal Matching Pursuit (OMP) and Least Angle Regression (LARS)?
I'm not sure how they are particularly relevant?

verbal oar Sep 5, 2024, 8:51 AM

#

yes these abbr

#

first I dont know what it is but second I see variant of regression

odd stratus Sep 5, 2024, 8:51 AM

#

lapis sequoia So it would be a bad idea to store them on s3? why?

quick calculation i did shows that it costs about~
$460 for a month of standard S3 20 Terabytes
and a 20 Terabyte hardrive costs about $800-900

so if youre going to be using it for long term storage its better to buy the hardrive i guess

full furnace Sep 5, 2024, 8:51 AM

#

quaint mulch Orthogonal Matching Pursuit (OMP) and Least Angle Regression (LARS)? I'm not sur...

Ur Indonesian?

quaint mulch Sep 5, 2024, 8:52 AM

#

Yes

verbal oar Sep 5, 2024, 8:53 AM

#

is there pdf of user guide or at least single html of it I mean scikitlearn I wanto to skim this

full furnace Sep 5, 2024, 8:54 AM

#

quaint mulch Yes

Saya mau nanya seputar interview

past bramble Sep 5, 2024, 9:07 AM

#

quaint mulch keep the kernel size 3x3, very rarely you need anything else don't use stride or...

I'll take your advice, for now after long time I ended up with this:

Im doing GANs, so I used 5x5 kernel, and 2 strides in transpose to upscale them each time. I kept the padding same..

quaint mulch Sep 5, 2024, 9:08 AM

#

past bramble I'll take your advice, for now after long time I ended up with this: Im doing G...

btw, I'm not familiar with GAN, so I could be wrong. Generally speaking, unless there is a very specific reason, why not just use the existing architectures?

past bramble Sep 5, 2024, 9:21 AM

#

quaint mulch btw, I'm not familiar with GAN, so I could be wrong. Generally speaking, unless ...

what architectures? If you mean the layers and structure, I want to learn how they work by building things myself

#

man I dont know if this will work, even kaggle's GPU P100 ran into memory error. Reduced by batch size for images from 32 -> 16 -> 8 now. Even this is taking too long.
~3 minutes for each batch

#

I hope the result will be good

#

I accidentally added cmap='gray' to see results every epoch 💀
It's already been through 8 epochs for 25 minutes, I can't change it now

#

i was thinking why it's all black and white

quaint mulch Sep 5, 2024, 9:25 AM

#

past bramble what architectures? If you mean the layers and structure, I want to learn how th...

That's true, but for me personally, I would to prefer by getting started with

finding a few famous architecture,
going through the source code to make sure I absolutely understand every single line and WHY
tweaking them and see what works / fail, usually to answer the why question, Why they do it this way, not some other way? Let's try the other way and see. Sometimes you figure out why they do it that way, sometimes you just became an inventor.

Going from scratch are very useful, but also painful.

past bramble Sep 5, 2024, 9:29 AM

#

quaint mulch That's true, but for me personally, I would to prefer by getting started with 1....

that's a great way to learn as well, to each their own. I won't stick to my method for too long if the results aren't as good. I'll listen to yours if I need so

What have you learned so far?

quaint mulch Sep 5, 2024, 9:30 AM

#

I put nearly everything I learned so far online https://www.arianprabowo.com/research-and-publications https://scholar.google.com/citations?user=ozZvUN4AAAAJ

Arian Prabowo - Research and Publications

BTS: Building Timeseries Dataset:
Empowering Large-Scale Building Analytics

Arian Prabowo

‪University of New South Wales‬ - ‪‪Cited by 225‬‬ - ‪Spatiotemporal‬ - ‪forecasting‬ - ‪GNN‬ - ‪contrastive learning‬ - ‪geometric deep learning.‬

verbal oar Sep 5, 2024, 9:31 AM

#

its just approx 400 pages related to supervised unsupervised learning so its doable to read

#

rest are examples, api reference total 2.5k pages

lapis sequoia Sep 5, 2024, 9:32 AM

#

odd stratus quick calculation i did shows that it costs about~ $460 for a month of standard ...

What about storing it in Glacier

verbal oar Sep 5, 2024, 9:33 AM

#

oh I see you are in geometric deep learning so to do for example deep render I need scene graph so GNN?

quaint mulch Sep 5, 2024, 9:34 AM

#

verbal oar oh I see you are in geometric deep learning so to do for example deep render I n...

I'm not sure if this question is addressed to me. And if it is, I still don't understand the question.

verbal oar Sep 5, 2024, 9:35 AM

#

yes, for example if someone do deep rendering then need scene graph for it?

#

I mean when just one object so cnn is enough

#

https://github.com/Lydorn/DeepRenderEngine/tree/master

GitHub

GitHub - Lydorn/DeepRenderEngine: A Deep Learning approach to 3D re...

A Deep Learning approach to 3D rendering. Contribute to Lydorn/DeepRenderEngine development by creating an account on GitHub.

lapis sequoia Sep 5, 2024, 9:37 AM

#

lapis sequoia What about storing it in Glacier

Worried that If I store it on a hard drive it will get corrupted

verbal oar Sep 5, 2024, 9:37 AM

#

this is interesting

#

but for scene he uses just cnn

#

and he says in readme to make triangles need rnn (so sequences)

odd stratus Sep 5, 2024, 9:43 AM

#

lapis sequoia What about storing it in Glacier

that would take forever to get each file at runtime no?

lapis sequoia Sep 5, 2024, 9:45 AM

#

odd stratus that would take forever to get each file at runtime no?

Yeah this is my concern

#

Because the files have to be redownloaded right?

quaint mulch Sep 5, 2024, 9:46 AM

#

I'm not sure how to answer your questions, but I have a few comments.
Firstly, when I say geometric deep learning, I usually refer to non-euclidean geometry.
Secondly, I am not familiary with neural rendering. I have read some papers, I think they have really interesting, but I have never used it, so I can't make any practical suggestions.
Finally, it seems that the best approach is using radiance field instead of CNN https://paperswithcode.com/task/neural-rendering

Papers with Code - Neural Rendering

Given a representation of a 3D scene of some kind (point cloud, mesh, voxels, etc.), the task is to create an algorithm that can produce photorealistic renderings of this scene from an arbitrary viewpoint. Sometimes, the task is accompanied by image/scene appearance manipulation.

somber tulip Sep 5, 2024, 9:46 AM

#

Hey, I want to evaluate the quality of my documents corpus. Quality means that it should provide information, be coherent etc… my corpus could be in any language. For the moment I tokenized my text and compute shanon entropy but I want to mesure in a better way

#

If people someone could help me I would be very grateful

odd stratus Sep 5, 2024, 9:51 AM

#

lapis sequoia Because the files have to be redownloaded right?

yeah, that would be an issue
if you want speed you need to have direct access tot hem
so neither of the deep storage models will work for you
you might be able to use the infrequent access model though, but i think youd be using standard if youre going to be using the images a lot for training
and if thats the case, a secondary storage connected to the computer is a lot easier to work with and cheaper over time as it only inurs an up front cost
but its up to you what suits your needs

lapis sequoia Sep 5, 2024, 9:53 AM

#

odd stratus yeah, that would be an issue if you want speed you need to have direct access to...

How do I ensure data doesn’t get corrupted

odd stratus Sep 5, 2024, 9:58 AM

#

lapis sequoia How do I ensure data doesn’t get corrupted

lead box /j

#

not sure, i havent worked with that amount of data before

#

but if its jsut training data, i think adding a small detector before loading each file would work well enough

#

because if a single file amongst 9 million gets corrupted, as long as you can stop it getting into the network, then you should be fine

lapis sequoia Sep 5, 2024, 10:00 AM

#

odd stratus because if a single file amongst 9 million gets corrupted, as long as you can st...

To detect if the file is corrupted?

odd stratus Sep 5, 2024, 10:01 AM

#

lapis sequoia To detect if the file is corrupted?

during the preprocessing stage of loading the file for use in training etc.
when loading and processing it, if it was corrupted it would cause a runtime error
so place some tests to check and stop those types of files, and then continue with a different file

lapis sequoia Sep 5, 2024, 10:08 AM

#

odd stratus during the preprocessing stage of loading the file for use in training etc. when...

So here’s what I want to do:
I have a bunch of images of cans, I want to segment just the can and embed that image for similarity search later so if someone uploads a can it’ll find the exact brand etc.
I was thinking yolo to draw the bounding box around the can (some images don’t have cans at all), then SAM to segment it.
Does this approach make sense? Or is there a better way

dusky pagoda Sep 5, 2024, 10:16 AM

#

that relu function looks weird, usually you would implement it as np.max(x, 0)

odd stratus Sep 5, 2024, 10:18 AM

#

dusky pagoda that relu function looks weird, usually you would implement it as `np.max(x, 0)`

pithink arent they effectively the same thing? or is it doing some vectorisation shenanigans im missing?

dusky pagoda Sep 5, 2024, 10:19 AM

#

x % 1 is doing x mod 1 (remainder when x is divided by 1)

#

which looks like this

dusky pagoda Sep 5, 2024, 10:21 AM

#

dusky pagoda that relu function looks weird, usually you would implement it as `np.max(x, 0)`

you can also write it as np.where(x > 0, x, 0) but that's a bit slower afaik

serene grail Sep 5, 2024, 10:22 AM

#

dusky pagoda which looks like this

That's so weird, I had to think about that for a little bit

odd stratus Sep 5, 2024, 10:23 AM

#

dusky pagoda `x % 1` is doing x mod 1 (remainder when x is divided by 1)

i forgot i was messing with the mod function during the day lol

#

def sigmoid(x):
    return 1 / (1 + np.exp(-x))
def relu(x):
    '''
    if x>0:
        return x
    return 0
    '''
    return np.where(x > 0, x, 0)
    #'''
def leaky_relu(x):
    '''
    if x>0:
        return x
    return 0
    '''
    return np.where(x > 0, x, x*0.5)
    #'''
def activationfunction(x):
    f = 0
    if f==0:
        return(sigmoid(x))
    elif f==1:
        return(relu(x))
    elif f ==2:
        return(leaky_relu(x))
def sigmoid_derivative(x):
    return x * (1 - x)
def relu_derivative(x):
    '''
    if x>0:
        return 1
    return 0
    '''
    return np.where(x > 0, 1, 0)
    #'''
def leaky_relu_derivative(x):
    '''
    if x>0:
        return 1
    return 0.5
    '''
    return np.where(x > 0, 1, 0.5)
    #'''

this is what i was using before hand

dusky pagoda Sep 5, 2024, 10:25 AM

#

Ok, that makes a bit more sense

odd stratus Sep 5, 2024, 10:25 AM

#

and then it just loops constanly outputting [ 0, 0] with a loss value of 0.5

dusky pagoda Sep 5, 2024, 10:26 AM

#

can you check the distribution of all the weights during training?

odd stratus Sep 5, 2024, 10:27 AM

#

not currently

dusky pagoda Sep 5, 2024, 10:27 AM

#

maybe a boxplot of them using matplotlib

#

or just calculate and print the min/max/mean/stddev

odd stratus Sep 5, 2024, 10:30 AM

#

https://paste.pythondiscord.com/EXIQ

heres the initialisation weights and biases

dusky pagoda Sep 5, 2024, 10:31 AM

#

hmm, what are all those 1's at the end?

odd stratus Sep 5, 2024, 10:31 AM

#

oh wait i got it to print the data during runtime and everything immediately gets set to NaN for some reason

odd stratus Sep 5, 2024, 10:31 AM

#

dusky pagoda hmm, what are all those 1's at the end?

the biases

dusky pagoda Sep 5, 2024, 10:32 AM

#

oh, is that standard practice?

odd stratus Sep 5, 2024, 10:32 AM

#

idk, its jsut what i do, works fine for sigmoid

dusky pagoda Sep 5, 2024, 10:33 AM

#

We can check back with it once the NaNs are gone I guess

odd stratus Sep 5, 2024, 10:34 AM

#

i set the biases to zero
cause i had them as zero before
and the NaNs are gone

#

oh wait i had it running sigmoid nevermind

dusky pagoda Sep 5, 2024, 10:36 AM

#

(L106) I think it's because it's dividing by zero here ```py
derivativeA = -(target / activations[-1]) + (1 - target) / (1 - activations[-1])

#

since it's really common for activation to be 0

#

I'm not sure how one would fix that though

odd stratus Sep 5, 2024, 10:43 AM

#

pithink well thats kinda silly

dusky pagoda Sep 5, 2024, 10:43 AM

#

Let me refresh my memory on backprop real quick

full furnace Sep 5, 2024, 10:44 AM

#

dusky pagoda Let me refresh my memory on backprop real quick

Make a Ann from scratch

dusky pagoda Sep 5, 2024, 11:07 AM

#

# AA = previous layer, A = current layer
# W = weights, B = biases
# dX_dY = del X / del Y
def backprop_layer(W, B, Z, A, dC_dA):
    # Z = WA + B
    dZ_dW = AA; dZ_dAA = W; dZ_dB = 1
    # A = activation(Z)
    dA_dZ = activation_derivative(Z)

    dC_dW = dC_dA * dA_dZ * dZ_dW
    dC_dAA = dC_dA * dA_dZ * dZ_dAA
    dC_dB = dC_dA * dA_dZ * dZ_dB
``` I think this was the gist of it?

#

@odd stratus how did you come up with the formula in your code?

odd stratus Sep 5, 2024, 11:28 AM

#

dusky pagoda <@847392618564026368> how did you come up with the formula in your code?

mish mashing a bunch of stuff from youtube cause the math was bonkers to try understand the first try lmao

odd stratus Sep 5, 2024, 11:33 AM

#

dusky pagoda <@847392618564026368> how did you come up with the formula in your code?

also wdym formulat?
is that a typo or smthn

dusky pagoda Sep 5, 2024, 11:33 AM

#

oh yeah that is a typo for formula

#

my bad

dusky pagoda Sep 5, 2024, 11:35 AM

#

odd stratus mish mashing a bunch of stuff from youtube cause the math was bonkers to try und...

I would approach it by implementing backprop for one layer then taking it backwards till the first layer

dusky pagoda Sep 5, 2024, 11:38 AM

#

dusky pagoda ```py # AA = previous layer, A = current layer # W = weights, B = biases # dX_dY...

I think this is pretty close, except for some matrix multiplications here and there

odd stratus Sep 5, 2024, 11:46 AM

#

the main problem was trying to make it so that it can scale to have any layer sizes and depth like i wanted

dusky pagoda Sep 5, 2024, 11:53 AM

#

odd stratus the main problem was trying to make it so that it can scale to have any layer si...

once you implement it for one layer, you can call it in a loop to make it general

verbal oar Sep 5, 2024, 12:01 PM

#

hmm this is just chain rule

dusky pagoda Sep 5, 2024, 12:02 PM

#

Yes, backprop is essentially backwards chain rule: https://www.3blue1brown.com/lessons/backpropagation-calculus

3Blue1Brown - Backpropagation calculus

The math of backpropagation, the algorithm by which neural networks learn.

verbal oar Sep 5, 2024, 12:03 PM

#

to go from D to A you go to C,B like in graph

#

where D is end A is start

#

so DC, CB, BA is DA

#

there is reference in grokking machine learning about these multiplying of partials etc

#

Appendix B Math behind gradient descent

verbal oar Sep 5, 2024, 1:00 PM

#

yes this is just calculating partials and substituting and multiplying

past bramble Sep 5, 2024, 2:14 PM

#

i don't like kaggle notebooks

#

my 3 hours of gpu "memory error"

#

i had saved checkpoints but after reloading they weren't there

tiny bluff Sep 5, 2024, 3:04 PM

#

hi

tired lodge Sep 5, 2024, 5:23 PM

#

how would i train an AI to speak like a friend of mine? he gracefully supplied me with some of his writings (hes a literature nerd) and i thought it would be funny to train an AI that could imitate his works

unkempt apex Sep 5, 2024, 5:27 PM

#

tired lodge how would i train an AI to speak like a friend of mine? he gracefully supplied m...

train from scratch??
( need to learn more then )

OR

use pre-trained models!

tired lodge Sep 5, 2024, 5:32 PM

#

unkempt apex train from scratch?? ( need to learn more then ) OR use pre-trained models!

pre-trained sounds like a good idea. i have like 20,000 words to train it on

unkempt apex Sep 5, 2024, 5:33 PM

#

tired lodge pre-trained sounds like a good idea. i have like 20,000 words to train it on

20000 words??
I guess it should be context related for you right?

#

then only try a simple text model and train for your context

tired lodge Sep 5, 2024, 5:33 PM

#

unkempt apex 20000 words?? I guess it should be context related for you right?

what does that mean

unkempt apex Sep 5, 2024, 5:35 PM

#

tired lodge what does that mean

like for example, suppose I am training a model which will act as my resume chatbot, so like if you ask it about my self, my skilss, it will give me that info

#

this is consider as "context" to make personalised

tired lodge Sep 5, 2024, 5:37 PM

#

unkempt apex this is consider as "context" to make personalised

ah ok i get it now

#

how and where do i find one of those?

unkempt apex Sep 5, 2024, 5:37 PM

#

ahh, search that

#

or if you get more confused share here, so that others can also help you

#

about that particular model

rich moth Sep 5, 2024, 5:55 PM

#

unkempt apex ahh, search that

whats up @unkempt apex ? Long time no see. What have you been working on these da ys?

unkempt apex Sep 5, 2024, 5:56 PM

#

rich moth whats up <@842272827393441854> ? Long time no see. What have you been working ...

done, road extraction from satellite images

#

#

you said, we will do something together?, why you were not online these days?

#

@rich moth ???

tribal meteor Sep 5, 2024, 6:27 PM

#

Learning AI in University, anyone have a good youtube channel for learning fundamentals?

#

Currently learning efficient tree / graph searches. Using pruning and cost eval functions.

#

Working on stuff like game theory, min-max, alpha pruning, ect. So like basic basics

left tartan Sep 5, 2024, 6:41 PM

#

tribal meteor Working on stuff like game theory, min-max, alpha pruning, ect. So like basic ba...

basic basics? Check out CS50 for AI

tribal meteor Sep 5, 2024, 6:42 PM

#

left tartan basic basics? Check out CS50 for AI

Like a Senior in college who is taking his first ai courses. Taken lots of theory and algorithms classes, but never really worked / developed in AI. Ty, Will Check it out

left tartan Sep 5, 2024, 6:43 PM

#

tribal meteor Like a Senior in college who is taking his first ai courses. Taken lots of theor...

https://www.youtube.com/playlist?list=PLZHQObOWTQDNU6R1_67000Dx_ZCJB-3pi is also a nice primer

deep sparrow Sep 5, 2024, 7:47 PM

#

what knowledge is needed to understand this

gaunt wren Sep 5, 2024, 8:27 PM

#

Is 85-15 class balance in a binary classification problem bad enough for logreg to predict all 0s?

#

if so, how would i solve this?

lapis sequoia Sep 5, 2024, 8:30 PM

#

gaunt wren if so, how would i solve this?

Ubalanced data only affects the model intercept. If you can guess, or god tells you, you can apply the correction to the intercept

#

Also look at the ROC to help w/ thresholding

gaunt wren Sep 5, 2024, 8:33 PM

#

so, I should just pick out a balanced sample and use that for training

#

or at least that'd be the easiest way

lapis sequoia Sep 5, 2024, 8:35 PM

#

There's no need. You could use an ensemble method if you want

gaunt wren Sep 5, 2024, 8:36 PM

#

such as RFC?

lapis sequoia Sep 5, 2024, 8:38 PM

#

xgboost and if you want to explore it more, you can look at using different weights for each class

#

that assumes your goal is the highest prediction accuracy, the model will pretty much be a black box

gaunt wren Sep 5, 2024, 8:41 PM

#

Im just trying to explore a few different algorithms

lapis sequoia Sep 5, 2024, 8:42 PM

#

gaunt wren Im just trying to explore a few different algorithms

xgboost is very commonly used in industry

gaunt wren Sep 5, 2024, 8:42 PM

#

and trying to understand why some dont perform that well, such as log reg. My first assumption was the class imbalance

lapis sequoia Sep 5, 2024, 8:43 PM

#

gaunt wren and trying to understand why some dont perform that well, such as log reg. My fi...

there could be other reasons such as not having a linear correlator with the features

#

you could look at something like decision trees as well

gaunt wren Sep 5, 2024, 8:44 PM

#

would tf be worth trying as well?

lapis sequoia Sep 5, 2024, 8:44 PM

#

tensorflow doesn't mean anything

rich moth Sep 5, 2024, 9:01 PM

#

unkempt apex <@204385862081970178> ???

We took a big vacation and I just focused on some other projects around the house. Oh, I got Baldur's gate 3 with some work buddies, that took over a month of my life. Ya, I did! Always looking to work on something, I recently started tinkering again with that capture the flag game using pygame and ML to train ai agents using q-learning and some other stuff. Also still messing around with the AI model that can learn and generate images using captions

unkempt apex Sep 5, 2024, 9:02 PM

#

rich moth We took a big vacation and I just focused on some other projects around the hous...

did you worked on that game? on what part?

rich moth Sep 5, 2024, 9:04 PM

#

unkempt apex did you worked on that game? on what part?

Ya, I made a lot of changes still debugging it though. I caught up training this model havennt had a channce to mess with since monday.

rich moth Sep 5, 2024, 9:07 PM

#

unkempt apex did you worked on that game? on what part?

The players are suppose to learn from the environment now also interact with obstacles and colab with teammates

storm valve Sep 6, 2024, 2:21 AM

#

if anyone is familiar with transformer.pipeline, is there a way to natively map a pipeline over multiple inputs?

#

using a threadpool works quite well, but i'm wondering if there isn't already a built in way


    with ThreadPoolExecutor() as executor:
        results = executor.map(model_pipeline, list_of_strings)```

#

from transformers import pipeline

model_pipeline = pipeline(
    "text-classification", model="model"
)

with ThreadPoolExecutor() as executor:
    results = executor.map(model_pipeline, list_of_strings)
``` better than this i mean

#

oh, i can just pass the list to the pipeline it looks like

scenic parcel Sep 6, 2024, 3:56 AM

#

anyone use darts for time series forecasting

verbal venture Sep 6, 2024, 4:11 AM

#

is Q * K * V the final answer to Q?

#

like it's the best possible answer to Q?

tawdry monolith Sep 6, 2024, 5:27 AM

#

Is it normal to forgot parameter and functions?

quaint rivet Sep 6, 2024, 5:44 AM

#

has anyone worked with labelbox? I'm trying to export my annotated image. I don't want in export in json format. I want mask image

scarlet anchor Sep 6, 2024, 5:46 AM

#

Hey, how can i usea set of multiple CSV Files into my training dataset for feeding into my LSTM network?
Or in other words, I want to use multiple CSV Files as training data for LSTM. How can i do it?

I do not want to concatenate all the CSV Files

rich moth Sep 6, 2024, 6:22 AM

#

scarlet anchor Hey, how can i usea set of multiple CSV Files into my training dataset for feedi...

Are you using hugging face for the CSV files or are they already in a directory?

rich moth Sep 6, 2024, 6:25 AM

#

unkempt apex did you worked on that game? on what part?

When you got time to take a look at it we can build a github page or something together.

scarlet anchor Sep 6, 2024, 6:25 AM

#

rich moth Are you using hugging face for the CSV files or are they already in a directory?

they are already in a directory

quaint rivet Sep 6, 2024, 6:27 AM

#

which tools should i use create mutli class segemenation dataset?

rich moth Sep 6, 2024, 6:27 AM

#

scarlet anchor they are already in a directory

You can create something with torch.utils.data . Check out Dataset and DataLoader.

scarlet anchor Sep 6, 2024, 6:28 AM

#

rich moth You can create something with torch.utils.data . Check out Dataset and DataLoad...

Thanks

#

prolly create a custom data loader?

rich moth Sep 6, 2024, 6:29 AM

#

quaint rivet which tools should i use create mutli class segemenation dataset?

What are the specifics for your project in terms of multi-class segmentation?

rich moth Sep 6, 2024, 6:29 AM

#

scarlet anchor Thanks

That's what I'd would look into.

quaint rivet Sep 6, 2024, 6:30 AM

#

rich moth What are the specifics for your project in terms of multi-class segmentation?

my project is image segmentation and i'm using unet for it. I have more than one features in image. So, i'm looking for some tools to create dataset

#

i have tried labelbox,apeer etc

#

but none of giving me desired result

scarlet anchor Sep 6, 2024, 6:31 AM

#

rich moth That's what I'd would look into.

thank you 🙂

rich moth Sep 6, 2024, 6:32 AM

#

quaint rivet my project is image segmentation and i'm using unet for it. I have more than one...

Torch, numpy, labellmg and check out opencv

#

Try labellmg

rich moth Sep 6, 2024, 6:33 AM

#

quaint rivet my project is image segmentation and i'm using unet for it. I have more than one...

not really sure

quaint rivet Sep 6, 2024, 6:33 AM

#

rich moth Try labellmg

ig i have tried labelimg

rich moth Sep 6, 2024, 6:33 AM

#

quaint rivet ig i have tried labelimg

hmm... vgg image annotator?

quaint rivet Sep 6, 2024, 6:35 AM

#

rich moth hmm... vgg image annotator?

yeah i have tried. I think i have to go through long process. If that's case. VGG annotator will give me image coco json format after that i have to convert it in mask.

odd stratus Sep 6, 2024, 6:44 AM

#

dusky pagoda ```py # AA = previous layer, A = current layer # W = weights, B = biases # dX_dY...

im still getting a bit lost here tbh
i kinda suck at backpropogation
i understood the forward propogation lmao

ionic valley Sep 6, 2024, 6:47 AM

#

is Leetcode still relevant for DS/ML/AI or is that mostly asked for SDE roles? I’d like to know if I’m wasting my time grinding LC

rich moth Sep 6, 2024, 6:49 AM

#

quaint rivet yeah i have tried. I think i have to go through long process. If that's case. VG...

can you use another tool to convert the mask format into something that works for training a unet model?

#

pycoocotools? not sure : \

ionic valley Sep 6, 2024, 6:50 AM

#

ionic valley is Leetcode still relevant for DS/ML/AI or is that mostly asked for SDE roles? I...

mostly just asking for internships

quaint rivet Sep 6, 2024, 6:50 AM

#

rich moth can you use another tool to convert the mask format into something that works fo...

which tools u would recommend me?

quaint rivet Sep 6, 2024, 6:50 AM

#

rich moth pycoocotools? not sure : \

ok

scarlet anchor Sep 6, 2024, 6:55 AM

#

ionic valley is Leetcode still relevant for DS/ML/AI or is that mostly asked for SDE roles? I...

They are essential for getting the job. LC is absolutely useless after getting into the job

rich moth Sep 6, 2024, 6:58 AM

#

unkempt apex done, road extraction from satellite images

Im interested in hearing more about what you've been working on.

#

Has anyone seen @Lisan Al Gaib

tawdry gyro Sep 6, 2024, 11:16 AM

#

Does anyone knows what is that? It imports my libraries but I am scared to not crash when I have astronomy Olympiad with computers in a week.

verbal oar Sep 6, 2024, 11:40 AM

#

read message

split olive Sep 6, 2024, 11:44 AM

#

We'll call C the quadratic cost function; it's also sometimes known as the mean squared error or just MSE.

I'm confused. Both of them are MSE but different?

#

nvm i got it

quaint rivet Sep 6, 2024, 11:57 AM

#

rich moth can you use another tool to convert the mask format into something that works fo...

finally i'm able to achieve it. I don't know how much it's correct. Let's see. Thanks for your help

ionic valley Sep 6, 2024, 12:51 PM

#

scarlet anchor They are essential for getting the job. LC is absolutely useless after getting i...

thanks

unkempt apex Sep 6, 2024, 1:34 PM

#

rich moth Has anyone seen @Lisan Al Gaib

hee left!

scarlet anchor Sep 6, 2024, 2:21 PM

#

anyone knows a good library or downloadable model that I can use in python for converting speech to text?

serene scaffold Sep 6, 2024, 2:22 PM

#

scarlet anchor anyone knows a good library or downloadable model that I can use in python for c...

whisper is probably the best one, but you need a GPU to use it.

scarlet anchor Sep 6, 2024, 2:24 PM

#

serene scaffold whisper is probably the best one, but you need a GPU to use it.

yea and also that I don't wanna leverage open ai 😭

#

thanks

agile cobalt Sep 6, 2024, 2:24 PM

#

scarlet anchor yea and also that I don't wanna leverage open ai 😭

it is open source and completely local, I wouldn't even consider it leveraging open ai?

#

(also not sure if I'd consider openai significantly worse than amazon, meta, google etc. - you should leverage open source as much as you can regardless of it source imo)

scarlet anchor Sep 6, 2024, 2:26 PM

#

okk

scarlet anchor Sep 6, 2024, 2:32 PM

#

serene scaffold whisper is probably the best one, but you need a GPU to use it.

Can whisper work offline?

serene scaffold Sep 6, 2024, 2:32 PM

#

scarlet anchor Can whisper work offline?

if it's on your computer, then yes

scarlet anchor Sep 6, 2024, 2:33 PM

#

agile cobalt (also not sure if I'd consider openai significantly worse than amazon, meta, goo...

true

agile cobalt Sep 6, 2024, 2:34 PM

#

scarlet anchor Can whisper work offline?

You could use it via an API, in which case you don't need of a GPU nor have to download model weights or run anything resource intensive yourself, or you can download and run it locally.

If you download and run it yourself, you do not rely on any online services (after downloading everything) at all

scarlet anchor Sep 6, 2024, 2:36 PM

#

Thanks @agile cobalt @serene scaffold

tiny bluff Sep 6, 2024, 2:40 PM

#

hi, do you have a roadmap for machine learning

#

?

tepid tartan Sep 6, 2024, 4:09 PM

#

deep sparrow what knowledge is needed to understand this

???

tiny bluff Sep 6, 2024, 4:13 PM

#

tepid tartan ???

actually it is for beginners

#

you need nothing for understand this

tepid tartan Sep 6, 2024, 4:18 PM

#

I'm actually a beginner

tiny bluff Sep 6, 2024, 4:18 PM

#

you can watch this video without anything

#

this video teaches ever common details for you

tepid tartan Sep 6, 2024, 4:19 PM

#

Recommended me something

tiny bluff Sep 6, 2024, 4:20 PM

#

i learn python basics before 2-3 years and i would like to reverse the python topics and learn machine learning like a proffesional

#

and i search and find a roadmap for ml

#

and i follow steps which are in the ml roadmap i find

#

i find roadmap at this channel

tepid tartan Sep 6, 2024, 4:25 PM

#

@tiny bluff I'm trying the basic understanding with stats and SQL first before touching python

tiny bluff Sep 6, 2024, 4:26 PM

#

it is okey

#

it is your choice

sour zodiac Sep 6, 2024, 4:29 PM

#

is there any1 who is familiar with qlearning that could help me in how to pick my alpha, gamma, epsilon and epsilon decay? Im not sure how to determine what values they should be

granite nymph Sep 6, 2024, 4:40 PM

#

Hi guys, what topics are typically required for ML interns to be confident with it

agile cobalt Sep 6, 2024, 4:43 PM

#

look up positions you would apply to and see what they're asking.

you'll probably want at least some statistics, linear algebra and basic numpy syntax/usage though

tepid tartan Sep 6, 2024, 4:46 PM

#

tiny bluff i find roadmap at this channel

What else that mosh does?

tiny bluff Sep 6, 2024, 5:09 PM

#

i dont know actually i deal with only machine learning however you can search

real whale Sep 6, 2024, 6:13 PM

#

Hello

This is a very basic question but I am still in the earlier stages of wrapping my head around the relevant details.

I'm a soon to be second year AI and Datasci student engaged in the RSNA 2024 Lumbar Spine Degenerative Classification purely for the learning curves.

https://www.kaggle.com/competitions/rsna-2024-lumbar-spine-degenerative-classification

A peer of mine, perhaps correctly, says that we have to split the images into training, test and validation classifications. He wants to do this using code that randomly selects images and puts them into any one of the 3 categories.

However the competition already presents testing and training datasets with, I'm sure I remember correctly but couldn't find the documentation that details it, a final unseen set of images that it performs the classification on so as to determine the effectiveness of the model.
Also nowhere in the EfficientNet sample can I see anything that does that classification.

https://www.kaggle.com/code/charlesexiaviour/rsna-efficientnet-starter-notebook

I think I am right here in that in terms of testing and validation the images are already classified and it's only through a dictionary that some of the images need the conditions and plains added to them.

Thanks for any and all help, any clarification will help a great deal.

RSNA 2024 Lumbar Spine Degenerative Classification

Classify lumbar spine degenerative conditions

deep sparrow Sep 6, 2024, 8:06 PM

#

is anyone up to challenge to code some sort of algorithm that analyses students requirements (14 students for now) and creates schedule (monday - friday, time 13:00 - 21:00 with 15 minutes break.) i can send you chart with the information from the students (with false names, only time will be correct nothing else)

odd meteor Sep 6, 2024, 9:16 PM

#

granite nymph Hi guys, what topics are typically required for ML interns to be confident with ...

Adding to what Etrotta said...

I remembered gathering the job description of about 9 companies I wanted to intern for, then I used a spreadsheet to track the common skills mentioned by those companies.

This gave me a clear idea on what my area of weakness was and what I needed to further improve on.

agile anvil Sep 6, 2024, 10:02 PM

#

Is there any way to parse this which degrades gracefully under morphisms? https://www.partnersincareoahu.org/vacancy-grid-2024

PARTNERS IN CARE

VACANCY GRID 2024 — PARTNERS IN CARE

agile anvil Sep 6, 2024, 10:28 PM

#

What if "but what about the poor AIs" is merely a sophisticated metaphor for "but what about the middle class"? https://www.marktechpost.com/2024/08/21/megaagent-a-practical-ai-framework-designed-for-autonomous-cooperation-in-large-scale-llm-agent-systems

MarkTechPost

MegaAgent: A Practical AI Framework Designed for Autonomous Coopera...

Large Language Models (LLMs) have advanced rapidly, becoming powerful tools for complex planning and cognitive tasks. This progress has spurred the development of LLM-powered multi-agent systems (LLM-MA systems), which aim to simulate and solve real-world problems through coordinated agent cooperation. These systems can be applied to various sce...

untold fable Sep 7, 2024, 4:03 AM

#

What's the difference between skit - learn and other machine learning library

agile cobalt Sep 7, 2024, 4:12 AM

#

scikit-learn helps you to train, evaluate and run inference using a bunch of 'traditional' ML models such as linear regression, decision trees, and random forests

pytorch / tensorflow / keras are focused specifically on Neural Networks, though they support a lot of different architectures for them

#

there are a few dozens of others somewhat popular libraries you'll see, and hundreds of niche libraries

e.g. numpy can be used for nearly any operation involving multi dimensional arrays (vectors / matrixes / so on), jax is similar to numpy but includes automatic differentiation, transformers & diffusers are focused specifically on running inference for popular models, and there's a lot of libraries that are just wrappers on top of others

#

they also have varying levels of support for runnings things in the CPU vs GPU, but I'm not gonna go into detail about that

late lichen Sep 7, 2024, 4:24 AM

#

i want to improve the old code i made its a simplified NEAT (on my bio) and i have no idea how to do it someone please assist me

tepid tartan Sep 7, 2024, 5:52 AM

#

Find a roadmap with actual videos and lessons, including projects. @tiny bluff @spare forum

jaunty helm Sep 7, 2024, 6:52 AM

#

roadmap.sh

tepid tartan Sep 7, 2024, 6:57 AM

#

jaunty helm roadmap.sh

Is that better roadmap?

spare forum Sep 7, 2024, 8:21 AM

#

Just don't be afraid to start tbh there is not an absolute roadmap ressources etc... Every time I've spent time searching roadmaps and shi nothing ended up done, everytime I applied freestyle learning I did projects etc... And learned the most

muted plume Sep 7, 2024, 9:14 AM

#

anyone have good sources to learn order precedence?

#

#

we got given this but i have no clue what this is trying to say

#

i assume down the list = order

#

but is there a reason bitwise not is higher up then the others?

dreamy isle Sep 7, 2024, 9:39 AM

#

muted plume we got given this but i have no clue what this is trying to say

precedence is highest topmost, lower precedence as it goes downwards

#

each column tells you what that operation applies on

muted plume Sep 7, 2024, 9:40 AM

#

so bitwise and, happens before things like logic operaters?

spring field Sep 7, 2024, 11:17 AM

#

tepid tartan Is that better roadmap?

no, roadmaps in general are pretty bad

#

the best way to learn is by doing projects

spare forum Sep 7, 2024, 11:33 AM

#

Tbh everytime ppl search for roadmaps for weeks and end up doing very little

verbal oar Sep 7, 2024, 1:07 PM

#

what variational means, I relate it with probability and some prior is it good thinking?

versed bough Sep 7, 2024, 1:42 PM

#

deep sparrow is anyone up to challenge to code some sort of algorithm that analyses students ...

Sounds like someone wants their homework done for them

mystic ruin Sep 7, 2024, 1:48 PM

#

So, basically I trying to install intel-extension-for-pytorch, But I'm encountering huge errors, Full log: https://paste.pythondiscord.com/JBBQ.
Any solutions?

mild dirge Sep 7, 2024, 2:19 PM

#

mystic ruin So, basically I trying to install `intel-extension-for-pytorch`, But I'm encount...

You do not have a C++ compiler it seems

#

Did you install G++ or some other compiler? @mystic ruin

mystic ruin Sep 7, 2024, 2:22 PM

#

mild dirge Did you install G++ or some other compiler? <@1026388699203772477>

I think I don't have

mild dirge Sep 7, 2024, 2:23 PM

#

do that 😛

verbal oar Sep 7, 2024, 3:00 PM

#

or just look in some glossary?

strange oriole Sep 7, 2024, 3:49 PM

#

hi

proven inlet Sep 7, 2024, 4:37 PM

#

How can i make gpt2 model to generate questions from answers? I have list of text messages and random conversations, I'm trying to convert them to Q-A type

#

Type your answer: The capital of france is paris.
Generated Question: Given the following statement, generate a relevant question: 'The capital of france is paris.'.

"If you use the word paris, you may get a similar answer. The word is a synonym for 'posterior, adverbial, pungent, repugnant, distressing,
objectionable'. But if you use the word adverbial, adverbial, pungent, repugnant, distressing, objectionable, you will get the same

#

prompt = f"Given the following statement, generate a relevant question: '{input_text}'."

#

what am i doing wrong??

#

Expected output: What is the capital of france?

serene scaffold Sep 7, 2024, 4:54 PM

#

@proven inlet gpt2 isn't instruction-following like ChatGPT is

proven inlet Sep 7, 2024, 4:54 PM

#

serene scaffold <@1018096765225938985> gpt2 isn't instruction-following like ChatGPT is

But doesn't chatgpt use gpt in it?

serene scaffold Sep 7, 2024, 4:55 PM

#

It just keeps generating text that's probable to follow whatever you pass to it

serene scaffold Sep 7, 2024, 4:55 PM

#

proven inlet But doesn't chatgpt use gpt in it?

ChatGPT is a gpt model that's tuned to be interactive and instruction following

proven inlet Sep 7, 2024, 4:55 PM

#

oh

#

How can i tune a gpt model to chatbot with texts but not Q-A types?

#

chatgpt used text mostly to train afaik

scarlet anchor Sep 7, 2024, 4:57 PM

#

For a time series prediction which model would be more ideal? other than LSTM

serene scaffold Sep 7, 2024, 4:57 PM

#

proven inlet chatgpt used text mostly to train afaik

Gpt is trained entirely on text. There is nothing that has any meaning to gpt except text.

serene scaffold Sep 7, 2024, 4:58 PM

#

proven inlet How can i tune a gpt model to chatbot with texts but not Q-A types?

You'd probably have an easier time with a "small" language model like mixtral 7b

proven inlet Sep 7, 2024, 4:58 PM

#

serene scaffold Gpt is trained entirely on text. There is nothing that has any meaning to gpt ex...

Yes gpt is a LLM

serene scaffold Sep 7, 2024, 4:58 PM

#

proven inlet Yes gpt is a LLM

I know that.

proven inlet Sep 7, 2024, 4:59 PM

#

serene scaffold You'd probably have an easier time with a "small" language model like mixtral 7b

im actually trying to generate chatbot so small language model would not be enough i guess

serene scaffold Sep 7, 2024, 4:59 PM

#

serene scaffold You'd probably have an easier time with a "small" language model like mixtral 7b

Mixtral is better at instruction following than non-chat gpt models

#

It's still a "large" language model. But the L in LLM is meaningless now.

proven inlet Sep 7, 2024, 5:00 PM

#

can i finetune gpt2 to become a basic chatbot?

serene scaffold Sep 7, 2024, 5:01 PM

#

You don't have enough training data or time for that

proven inlet Sep 7, 2024, 5:01 PM

#

is 5k list of messages enough for that

serene scaffold Sep 7, 2024, 5:01 PM

#

Not even close.

proven inlet Sep 7, 2024, 5:02 PM

#

Oh.

serene scaffold Sep 7, 2024, 5:02 PM

#

The amount of training data and compute time required to create and tune these models is astronomical

#

That's why only large companies like meta are putting out LLMs. Everyone else is innovating by finding creative ways to prompt them.

proven inlet Sep 7, 2024, 5:03 PM

#

serene scaffold The amount of training data and compute time required to create and tune these m...

just of curiosity, could i train gpt2 with pure text but not Q-A type? With 1B diffirent texts eg

serene scaffold Sep 7, 2024, 5:03 PM

#

How many words?

proven inlet Sep 7, 2024, 5:03 PM

#

over 5B

serene scaffold Sep 7, 2024, 5:03 PM

#

And what would you be training it to do?

proven inlet Sep 7, 2024, 5:04 PM

#

Chatbot

serene scaffold Sep 7, 2024, 5:04 PM

#

So you'd be fine tuning it to produce text that follows a certain structure. Namely dialogue structure

#

Which is what ChatGPT is

#

You might be able to do it with that many words.

proven inlet Sep 7, 2024, 5:04 PM

#

But they don't have to be Q-A format or do they?

#

Like can i use training data for wikipedia and books

#

But not dialogues

serene scaffold Sep 7, 2024, 5:06 PM

#

If you train it on Wikipedia, it will generate content that's structured like a Wikipedia article

#

And it probably won't behave naturally if you ask it a question in a conversational way

proven inlet Sep 7, 2024, 5:07 PM

#

serene scaffold If you train it on Wikipedia, it will generate content that's structured like a ...

But it also won't be repeating me right? When i ask for what is the capital of France for example

#

will it continue my sentence?

#

if not, what makes it to not continue the sentence

serene scaffold Sep 7, 2024, 5:08 PM

#

If you prompt gpt2 with "the capital of France is", it will probably finish the sentence correctly.

proven inlet Sep 7, 2024, 5:08 PM

#

Yes but chatbots dont do that

#

im wondering how

serene scaffold Sep 7, 2024, 5:09 PM

#

You have to tune it on text that is structured as the kinds of interactions that you want to have with it

#

But you probably don't have enough data or compute time for that

#

So you should probably use an existing language model that is interactive, like mixtral

proven inlet Sep 7, 2024, 5:10 PM

#

Okay thanks, I'll use mixtral

river cape Sep 7, 2024, 5:29 PM

#

Guys I want to know whether aws provides any free services which can be used in ml?

spare forum Sep 7, 2024, 5:35 PM

#

Free trial with limited access, not like free forever

#

(AWS sagemaker)

#

gcp provide free credits for new accounts which is 300€ equivalent

river cape Sep 7, 2024, 5:37 PM

#

spare forum Free trial with limited access, not like free forever

How long is that access for?

spare forum Sep 7, 2024, 5:37 PM

#

I believe 1 year

river cape Sep 7, 2024, 5:37 PM

#

spare forum I believe 1 year

After that they charge?

spare forum Sep 7, 2024, 5:37 PM

#

Yes

river cape Sep 7, 2024, 5:37 PM

#

spare forum Yes

Have you tried azure?

spare forum Sep 7, 2024, 5:38 PM

#

Nope mainly aws, gcp and databricks (just for learning)

river cape Sep 7, 2024, 5:40 PM

#

spare forum Nope mainly aws, gcp and databricks (just for learning)

So for a year I can use it using one email and create another account with another email to get another year for free?

spare forum Sep 7, 2024, 5:45 PM

#

You still put credit card and shi so pbby not so easy, and the use is very bounded, pretty much it's only okay for side projects and learning

past bramble Sep 7, 2024, 6:07 PM

#

may I use tensorflow on windows on python 12

spring field Sep 7, 2024, 6:45 PM

#

you have my permission
also Python 12? firEyes

anyway, apparently on Windows the latest TF versions only work through WSL because sth sth they dropped Win support? not entirely sure, but sth along those lines
basically yes, but only through WSL

past bramble Sep 7, 2024, 6:46 PM

#

spring field you have my permission also Python 12? <:firEyes:785674652755689492> anyway, ap...

yeah it works only on python 11 not python 12 I'm not sure why

spring field Sep 7, 2024, 6:47 PM

#

I personally am only on Python 3

past bramble Sep 7, 2024, 7:03 PM

#

oh i skipped to the future

#

let's go back, python 3.11 and python 3.12

river cape Sep 7, 2024, 7:24 PM

#

Btw guys

#

Do i need to install tensorrt

#

I already have cuda and cudnn installed

verbal venture Sep 7, 2024, 8:54 PM

#

@wooden sail @iron basalt just want to confirm my understanding here is correct. The attention model does Q * K to update "words that represent each other". The model has no actual understanding of this. What it's doing is changing the weights so the Q * K (attention between each words) becomes better over time. This is simply a matter of running dot product on all words in the corpus numerous times to find a relationship between them. This relationship can somehow be captured by dot product attention, because that represents cosine similarity, but ultimately the reason the model can converge to this representation is because backprop will adjust the weights of the model to better create Q and K vectors. When the model makes a mistake, it will adjust the weights, do Q * K again, and the newest iteration of Q * K will be a slightly better "relationship" capture between words

clever sparrow Sep 7, 2024, 9:16 PM

#

past bramble may I use tensorflow on windows on python 12

you cant use cuda with python 3.12 on windows without wsl

#

use 3.11 if you dont want to deal with wsl

rich moth Sep 7, 2024, 11:39 PM

#

This is from my first epoch on the multi-modal learning system I've been working on, where I’m combining a VQ-VAE model for image reconstruction with feature aggregation using CLIP for text-image alignment, and BLIP for generating descriptive captions. So far the results seems promising

past bramble Sep 8, 2024, 3:41 AM

#

clever sparrow you cant use cuda with python 3.12 on windows without wsl

what is wsl?

rich moth Sep 8, 2024, 4:06 AM

#

windows subsystem linux

#

#

thats wsl2 running ubuntu

past bramble Sep 8, 2024, 4:21 AM

#

linux inside windows?

rich moth Sep 8, 2024, 4:34 AM

#

yup, its the bees knees

#

You on windows 11 ?

past bramble Sep 8, 2024, 4:49 AM

#

yup

rich moth Sep 8, 2024, 4:49 AM

#

Open the microsoft store and search for WSL

tepid tartan Sep 8, 2024, 4:59 AM

#

spare forum Tbh everytime ppl search for roadmaps for weeks and end up doing very little

I want to start somewhere before doing any projects. Need some knowledge

rich moth Sep 8, 2024, 5:06 AM

#

past bramble yup

its pretty east to install these days. let me know if you have any questios

past bramble Sep 8, 2024, 5:34 AM

#

rich moth its pretty east to install these days. let me know if you have any questios

is it heavyweight or requires a lot of set up? I'm planning to change my pc so if it is as I said I'll do when I get the new pc

unkempt apex Sep 8, 2024, 5:34 AM

#

rich moth This is from my first epoch on the multi-modal learning system I've been working...

Plunder is back!

rich moth Sep 8, 2024, 5:36 AM

#

past bramble is it heavyweight or requires a lot of set up? I'm planning to change my pc so i...

Not at all. You enable a few things like hyper-v I believe but after you enable of those options in windows you can install a bunch of different distro types

unkempt apex Sep 8, 2024, 5:36 AM

#

past bramble is it heavyweight or requires a lot of set up? I'm planning to change my pc so i...

why not to directly use Linux?

rich moth Sep 8, 2024, 5:37 AM

#

unkempt apex Plunder is back!

Whats up buddy! Just watching some tv, got my model training right now. Whare you you doing?

unkempt apex Sep 8, 2024, 5:37 AM

#

rich moth Whats up buddy! Just watching some tv, got my model training right now. Whare ...

( it's morning here ) just hop onto PC, now will learn about BERT, any suggestions?

rich moth Sep 8, 2024, 5:38 AM

#

past bramble is it heavyweight or requires a lot of set up? I'm planning to change my pc so i...

I think you can just even install WSL from the MS store and it enable it for you .

#

After that you can search for the distro and verison you want

rich moth Sep 8, 2024, 5:40 AM

#

unkempt apex ( it's morning here ) just hop onto PC, now will learn about BERT, any suggestio...

I messed around with a few BERT models .(https://huggingface.co/docs/transformers/model_doc/bert)

BERT

rich moth Sep 8, 2024, 6:00 AM

#

unkempt apex ( it's morning here ) just hop onto PC, now will learn about BERT, any suggestio...

What are you trying to do with it ?

past bramble Sep 8, 2024, 6:01 AM

#

rich moth I think you can just even install WSL from the MS store and it enable it for you...

sounds more easier, let me search

#

there's a lot of "Ubuntu" results, which one do I use

rich moth Sep 8, 2024, 6:02 AM

#

past bramble sounds more easier, let me search

surprising, huh?

past bramble Sep 8, 2024, 6:03 AM

#

wait it says "Ubuntu 22.04.3 LTS" is already installed

rich moth Sep 8, 2024, 6:04 AM

#

open a terminal and type wsl

past bramble Sep 8, 2024, 6:04 AM

#

guess it's already installed ```bash

wsl
To run a command as administrator (user "root"), use "sudo <command>".
See "man sudo_root" for details.

#

wonder how

unkempt apex Sep 8, 2024, 6:06 AM

#

rich moth What are you trying to do with it ?

just to learn!

wooden sail Sep 8, 2024, 6:06 AM

#

depending on how you installed wsl, it comes with ubuntu by default

rich moth Sep 8, 2024, 6:07 AM

#

cool beans! ya Im not sure how it got installed, but lemme know if you got any questions. It works great. Its nice having the option to do both in one place.

#

Oh that reminds me I was going to setup a plex server on my laptop.

past bramble Sep 8, 2024, 6:13 AM

#

I'm sure I was running into errors when I used tensorflow on python 3.12, which is why I installed 3.11

#

weird I tried running tensorflow on 3.12 venv now it isn't raising any errors now

#

this was the error I had some days ago:

#python-discussion message

past bramble Sep 8, 2024, 6:21 AM

#

rich moth cool beans! ya Im not sure how it got installed, but lemme know if you got any ...

thanks a lot helping me out!

#

i wanna show my new GAN I created (based on scenary images)

rich moth Sep 8, 2024, 6:24 AM

#

looks great.

unkempt apex Sep 8, 2024, 6:54 AM

#

past bramble i wanna show my new GAN I created (based on scenary images)

on which dataset u train this?

past bramble Sep 8, 2024, 7:02 AM

#

unkempt apex on which dataset u train this?

it's a competition dataset from kaggle, I don't remember exact name, it has pics of scenaries

deep sparrow Sep 8, 2024, 7:38 AM

#

versed bough Sounds like someone wants their homework done for them

nah its for non school things

untold cliff Sep 8, 2024, 10:47 AM

#

I was trying to make a c++ implementation of the BM25 information retrieval algorithm and make a wrapper to it using cython, and was comparing my results against those from this library https://github.com/dorianbrown/rank_bm25
Interestingly, for one of the variants, the BM25L variant, the results I got were different and after quite a bit of time of debugging, it turned out that if I copy the source code of the library and then run the tests I get the same results. I get different results only one I use it as a pip package and I was very curious about the reason for such behavior.

GitHub

GitHub - dorianbrown/rank_bm25: A Collection of BM25 Algorithms in ...

A Collection of BM25 Algorithms in Python. Contribute to dorianbrown/rank_bm25 development by creating an account on GitHub.

#

I turns out that, after inspecting the code of the package after pip installing it against the source code on github, that there was a small difference in the formula used. I don't know how pip packages are made so it is still a mystery to me how such an error happened, but yeah this seems to be the reason, unless someone here can shed more light about it.

faint quail Sep 8, 2024, 12:33 PM

#

I totally know what that means

wanton quiver Sep 8, 2024, 1:06 PM

#

hey @hot obsidian can tell me about what thing i have to learn in data science or have you source where i can learn it

jaunty helm Sep 8, 2024, 1:26 PM

#

wanton quiver hey <@1043617372432506941> can tell me about what thing i have to learn in data...

data science is very broad
there are resources in pinned

rigid timber Sep 8, 2024, 2:16 PM

#

are there any free inference options?

jaunty helm Sep 8, 2024, 2:22 PM

#

rigid timber are there any free inference options?

as in LLMs? run local; some on openrouter are also free if you want to try that

there are very 'small' models like gemma-2b, phi, minitron-4b, etc. that don't need that good of a GPU (the 3 mentioned above can all be comfortably ran by a 4gb vram card with quantization)
CPU inference is also an option if you're desperate, then you're not limited by the GPU, but CPU clock speed & ram & ram speed

past bramble Sep 8, 2024, 2:27 PM

#

are there any libraries to get text embeddings?

jaunty helm Sep 8, 2024, 2:30 PM

#

past bramble are there any libraries to get text embeddings?

I'd assume libraries that focus on inference would allow you to do that
so check ollama, transformers(huggingface) ig

past bramble Sep 8, 2024, 2:33 PM

#

jaunty helm I'd assume libraries that focus on inference would allow you to do that so check...

I can't find any docs for transformers module, do you know where to find them?

jaunty helm Sep 8, 2024, 2:34 PM

#

past bramble I can't find any docs for `transformers` module, do you know where to find them?

https://huggingface.co/docs/transformers/index

past bramble Sep 8, 2024, 2:35 PM

#

thanks!

pine escarp Sep 8, 2024, 2:41 PM

#

jaunty helm https://huggingface.co/docs/transformers/index

The transformers module is specifically for hugging face?

jaunty helm Sep 8, 2024, 2:43 PM

#

pine escarp The transformers module is specifically for hugging face?

it's maintained by huggingface (& community) and has easy integration with it

pine escarp Sep 8, 2024, 2:44 PM

#

jaunty helm it's maintained by huggingface (& community) and has easy integration with it

I see.

past bramble Sep 8, 2024, 2:45 PM

#

damn 800 floats for a single text

#

quite a big vector

#

I was planning to try making a small text model using embeddings and conversations data

jaunty helm Sep 8, 2024, 2:48 PM

#

past bramble I was planning to try making a small text model using embeddings and conversatio...

I mean each base model should have differing embedding sizes

jaunty helm Sep 8, 2024, 2:50 PM

#

past bramble I was planning to try making a small text model using embeddings and conversatio...

found this leaderboard https://huggingface.co/spaces/mteb/leaderboard
might be helpful to you

past bramble Sep 8, 2024, 2:53 PM

#

ohh cool

rigid timber Sep 8, 2024, 3:21 PM

#

jaunty helm as in LLMs? run local; some on openrouter are also free if you want to try that ...

both LLMs and image detection models, I tried to run them locally on my laptop but it’s just not good enough. Tried hugging face inference endpoints but it kept declining my card so Im looking for a free alternative

jaunty helm Sep 8, 2024, 3:25 PM

#

rigid timber both LLMs and image detection models, I tried to run them locally on my laptop b...

I mean you're not gonna get "good free" models so

#

why lend you compute for free when they can ask for a subscription / pay per token

rigid timber Sep 8, 2024, 3:28 PM

#

jaunty helm I mean you're not gonna get "good free" models so

precisely, how do you go about running some model with the help of flask. Like a simple input output type of web application for lets say an image detection model

jaunty helm Sep 8, 2024, 3:32 PM

#

rigid timber precisely, how do you go about running some model with the help of flask. Like a...

dunno, you'll have to ask someone more knowledgeable for specifics
but I don't imagine it to be too different from everything else

rigid timber Sep 8, 2024, 3:59 PM

#

jaunty helm dunno, you'll have to ask someone more knowledgeable for specifics but I don't i...

Thanks alot, i'll look for tutorials while Im at it

lilac lichen Sep 8, 2024, 4:51 PM

#

is there any recommendations to get a team to work with on any pet project and way to run projects not on PC?

scarlet anchor Sep 8, 2024, 5:08 PM

#

Where is federated learning actually used?

agile cobalt Sep 8, 2024, 5:13 PM

#

There are some projects like https://github.com/bigscience-workshop/petals, but idk how widely used they are in practice though

past meteor Sep 8, 2024, 5:16 PM

#

scarlet anchor Where is federated learning actually used?

I see it spoken of a lot in things like EHR records

#

I also saw a use case of a streaming service using it for their recommender system

scarlet anchor Sep 8, 2024, 5:20 PM

#

@agile cobalt I wanted something an application where hardware is used

agile cobalt Sep 8, 2024, 5:24 PM

#

scarlet anchor <@256442550683041793> I wanted something an application where hardware is used

what do you mean?

scarlet anchor Sep 8, 2024, 5:33 PM

#

agile cobalt what do you mean?

like for instance applying federated learning on an edge device for instance

#

like this

agile cobalt Sep 8, 2024, 5:36 PM

#

the amount of processing power micro controllers have is really low compared to GPUs... you'd need of thousands of them in order to match one GPU used in data centers, and the latency & amount of data you'd have to transfer between them makes it pretty inpractical

#

even running inference on micro controllers is already hard

#

you might be able to continuously fine-tune a small model in a micro controller, but I wouldn't expect to see anyone using them for federated training

serene grail Sep 8, 2024, 5:38 PM

#

scarlet anchor like this

To be fair, with those specs like 4GB RAM that doesn't really look like a microcontroller, that's a SBC, like a Raspberry Pi, for example

scarlet anchor Sep 8, 2024, 6:06 PM

#

serene grail To be fair, with those specs like 4GB RAM that doesn't really look like a microc...

its a microprocessor

rich moth Sep 8, 2024, 6:25 PM

#

This is my best run yet just on the first epoch. The colors and shapes actually look decent and a steady loss from all the components. This is my best verison so far.

worthy oasis Sep 8, 2024, 9:31 PM

#

someone please help me with some tutorial o good book to initiate on DataScience

rich moth Sep 8, 2024, 10:30 PM

#

scarlet anchor like for instance applying federated learning on an edge device for instance

this platform is perfect for what I’ve been working on. Ive developed a text-image multimodal model that’s just 60MB, its ideal for embedding and staying lightweight. It integrates CLIP for text-image alignment, BLIp for text generation, and Sentencetransformers for embeddings

graceful niche Sep 8, 2024, 11:00 PM

#

did anyone use WALDO? I'm having trouble finding the model files ( like they do not exist )
this WALDO btw https://github.com/stephansturges/WALDO

GitHub

GitHub - stephansturges/WALDO: Whereabouts Ascertainment for Low-ly...

Whereabouts Ascertainment for Low-lying Detectable Objects. The SOTA in FOSS AI for drones! - stephansturges/WALDO

faint quail Sep 9, 2024, 12:24 AM

#

spent months building a object detector neural network library from scratch to finally achieve this holy

sage sparrow Sep 9, 2024, 3:01 AM

#

Hi, what are the main issues people usually face with data scientists? From the client's side of things

#

I thought I'd do some research since I don't have enough data/experience about it myself

agile cobalt Sep 9, 2024, 3:02 AM

#

"client's side of things"?

sage sparrow Sep 9, 2024, 3:04 AM

#

The ones hiring/in need of the data scientists' services

agile cobalt Sep 9, 2024, 3:05 AM

#

wherever you'll look you'll find pretty biased views in multiple ways, but maybe try looking at some freelancing offers & some Kaggle compeitions

sage sparrow Sep 9, 2024, 3:11 AM

#

agile cobalt wherever you'll look you'll find pretty biased views in multiple ways, but maybe...

I see. Are you a data scientist?

scarlet anchor Sep 9, 2024, 4:16 AM

#

rich moth this platform is perfect for what I’ve been working on. Ive developed a text-ima...

Great to hear

storm valve Sep 9, 2024, 4:17 AM

#

any idea on where i can get a corpus of python-related words? for now i've resolved to extracting things from the source code directly like imports, function names, assignments but i would like more general stuff

unkempt apex Sep 9, 2024, 5:20 AM

#

faint quail spent months building a object detector neural network library from scratch to f...

opensource ??

past bramble Sep 9, 2024, 6:10 AM

#

for a starter I'm thinking of using single numbers to represent each word instead of vectors (text embeddings)

are there any existing algorithms to convert words to a number? I want to make my own encoder/decoder to go back and forth easily

#

first thing that hit me was using indices and ascii of each character, math operations on it to come up with unique numbers for each word

#

then it hit me there might be cases where it's not unique as well

past meteor Sep 9, 2024, 6:43 AM

#

storm valve any idea on where i can get a corpus of python-related words? for now i've resol...

Define python related?

storm valve Sep 9, 2024, 6:44 AM

#

past meteor Define python related?

well stuff that occurs in python or in the docs i guess

past meteor Sep 9, 2024, 6:45 AM

#

You could use AST to parse the stdlib and grab whatever you want?

#

But I think your question is: does such a corpus already exist

storm valve Sep 9, 2024, 6:45 AM

#

past meteor You could use AST to parse the stdlib and grab whatever you want?

i've already done something quite similar, but it doesn't quite grab a lot

storm valve Sep 9, 2024, 6:45 AM

#

past meteor But I think your question is: does such a corpus already exist

correct

#

my google fu fails there

past meteor Sep 9, 2024, 6:46 AM

#

My answer is, not that I know of. Maybe someone else can pitch in 😄

storm valve Sep 9, 2024, 6:47 AM

#

i've gone so far as processing the source code of programs i'm reading and building small corpuses of of them but still not quite enough sadly

past meteor Sep 9, 2024, 6:47 AM

#

What are you trying to do?

storm valve Sep 9, 2024, 6:48 AM

#

removing gibberish from LLM output

#

correct output contains a lot of python terms, so i also use the python corpus to filter out what's not gibberish

strong cove Sep 9, 2024, 7:19 AM

#

faint quail spent months building a object detector neural network library from scratch to f...

Well done

rich moth Sep 9, 2024, 7:38 AM

#

So this is from 10 epochs. Everything seems to be improving gradually. Its learning, but its slow going. I might need to play with the learning rates a bit more but i think Its gonna take a long time to train

past bramble Sep 9, 2024, 8:17 AM

#

rich moth So this is from 10 epochs. Everything seems to be improving gradually. Its lea...

impressive, it looks like it will reconstruct the same image for same prompts, is that expected?

verbal oar Sep 9, 2024, 9:01 AM

#

is vae from scratch hard to do?
I saw for example building in keras but its rather simple and its was not from scratch

#

now I'm reading an introduction to variational autoencoders from Kingma, Welling

odd stratus Sep 9, 2024, 9:05 AM

#

anyone have a large plain text file for LLM ?

past bramble Sep 9, 2024, 10:19 AM

#

odd stratus anyone have a large plain text file for LLM ?

containing? I have looked for a conversations dataset on kaggle, it's a csv btw

odd stratus Sep 9, 2024, 10:19 AM

#

past bramble containing? I have looked for a conversations dataset on kaggle, it's a csv btw

just a large amount of text is all

past bramble Sep 9, 2024, 10:32 AM

#

odd stratus just a large amount of text is all

web scrape bunch of Wikipedia and save it on text file

odd stratus Sep 9, 2024, 11:50 AM

#

i just copy pasted the lord of the rings lmaoo

verbal oar Sep 9, 2024, 12:43 PM

#

project gutenberg maybe, alice in wonderland etc dont sure

quaint mulch Sep 9, 2024, 1:30 PM

#

odd stratus anyone have a large plain text file for LLM ?

You can start with a list from wikipedia https://en.wikipedia.org/wiki/List_of_datasets_for_machine-learning_research#Internet

List of datasets for machine-learning research

These datasets are used in machine learning (ML) research and have been cited in peer-reviewed academic journals. Datasets are an integral part of the field of machine learning. Major advances in this field can result from advances in learning algorithms (such as deep learning), computer hardware, and, less-intuitively, the availability of high-...

#

This is publicly available https://pile.eleuther.ai/

The Pile

The Pile is a 825 GiB diverse, open source language modelling data set that consists of 22 smaller, high-quality datasets combined together.

#

is 825 GB large enough?

odd stratus Sep 9, 2024, 1:32 PM

#

quaint mulch You can start with a list from wikipedia https://en.wikipedia.org/wiki/List_of_d...

oh very cool

odd stratus Sep 9, 2024, 1:33 PM

#

quaint mulch is 825 GB large enough?

toooo large lmao, ive only got 20GB of storage on my computer lmao

quaint mulch Sep 9, 2024, 1:33 PM

#

odd stratus toooo large lmao, ive only got 20GB of storage on my computer lmao

Well, I guess you want a small one then? hahaha

mystic ruin Sep 9, 2024, 1:37 PM

#

I am trying to setup pytorch for my A770 GPU, I followed the docs, got this error when importing pytorch:

PS C:\kanemoto\vscode\llm> python .\main.py
Traceback (most recent call last):
  File "C:\kanemoto\vscode\llm\main.py", line 1, in <module>
    import torch
  File "C:\Users\kanemoto\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.11_qbz5n2kfra8p0\LocalCache\local-packages\Python311\site-packages\torch\__init__.py", line 139, in <module>
    raise err
OSError: [WinError 126] The specified module could not be found. Error loading "C:\Users\kanemoto\AppData\Local\Packages\PythonSoftwareFoundation.Python.3.11_qbz5n2kfra8p0\LocalCache\local-packages\Python311\site-packages\torch\lib\backend_with_compiler.dll" or one of its dependencies.```
The `backend_with_compiler.dll` exists in its path.
I have the latest Microsoft Visual C++ Redistributable installed.

Any idea?

odd stratus Sep 9, 2024, 1:46 PM

#

quaint mulch Well, I guess you want a small one then? hahaha

yeah, just large enough so that it can get a bunch of speech, but not too large that its gonna take a while or break the computer

past bramble Sep 9, 2024, 2:21 PM

#

quaint mulch is 825 GB large enough?

nah doesnt satisfy my hunger

#

@odd stratus u building an LLM?

odd stratus Sep 9, 2024, 2:22 PM

#

past bramble <@847392618564026368> u building an LLM?

yeah, im gonna try to

quaint mulch Sep 9, 2024, 2:27 PM

#

past bramble nah doesnt satisfy my hunger

How about this https://data.commoncrawl.org/crawl-data/CC-MAIN-2024-33/index.html ? Is it big enough?

quaint mulch Sep 9, 2024, 2:28 PM

#

mystic ruin I am trying to setup pytorch for my A770 GPU, I followed the [docs](https://inte...

how did you install pytorch? #packaging-and-distribution might also help

mystic ruin Sep 9, 2024, 2:31 PM

#

quaint mulch how did you install pytorch? <#1216841603080257616> might also help

I used this command to install the libraries:

python -m pip install torch==2.1.0.post3 torchvision==0.16.0.post3 torchaudio==2.1.0.post3 intel-extension-for-pytorch==2.1.40+xpu --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/cn/```

#

(from the docs)

quaint mulch Sep 9, 2024, 2:35 PM

#

did you pass the sanity check?

indigo wing Sep 9, 2024, 3:07 PM

#

hey guys should I buy collab pro and cloud storage for training? Is it worth it?

scarlet anchor Sep 9, 2024, 3:13 PM

#

indigo wing hey guys should I buy collab pro and cloud storage for training? Is it worth it?

Depends on ur requirements

#

why do u need colab pro, in the first place?

past bramble Sep 9, 2024, 3:13 PM

#

quaint mulch How about this https://data.commoncrawl.org/crawl-data/CC-MAIN-2024-33/index.htm...

aha 69TB nice it's perfect

past bramble Sep 9, 2024, 3:14 PM

#

odd stratus yeah, im gonna try to

me too, are you gonna convert text to embeddings?

upbeat prism Sep 9, 2024, 3:22 PM

#

Hi, so I want to classify if a number between 1 and 100 is even or odd. Now I want to achieve that with the most simple MLP.

class SimpleClassifier(nn.Module):
    def __init__(self):
        super(SimpleClassifier, self).__init__()
        # One input node, two hidden nodes, one output node
        self.hidden = nn.Linear(1, 2)  # From input to two hidden nodes
        self.output = nn.Linear(2, 1)  # From two hidden nodes to output

    def forward(self, x):
        # Forward pass: input -> hidden layer (ReLU activation) -> output (Sigmoid activation)
        x = torch.relu(self.hidden(x))  # Apply ReLU to the hidden layer
        x = torch.sigmoid(self.output(x))  # Sigmoid to get the output between 0 and 1
        return x

I don't get much better than 50% accuracy i.e. guessing. :D

Here's my training loop:

def train_model(model, criterion, optimizer, dataloader, epochs=100):
    for epoch in range(epochs):
        epoch_loss = 0.0
        for inputs, labels in dataloader:
            # Zero the parameter gradients
            optimizer.zero_grad()

            # Forward pass
            outputs = model(inputs)

            # Compute loss
            loss = criterion(outputs, labels)

            # Add L1 regularization
            l1_loss = 0
            l1_weight = 0.001
            loss
            for param in model.parameters():
                l1_loss += torch.sum(torch.abs(param))
            loss += l1_weight * l1_loss
            # loss = criterion(outputs, labels)  # Unsqueeze labels to match output shape

            # Backward pass and optimize
            loss.backward()
            optimizer.step()

            # Accumulate loss
            epoch_loss += loss.item()

What could I improve? I really wanna keep the MLP this simple

#

hmm maybe it's just not possible mathematically? I basically have two linear functions, I wouldn't know how I could do it by hand

past meteor Sep 9, 2024, 3:27 PM

#

upbeat prism Hi, so I want to classify if a number between 1 and 100 is even or odd. Now I wa...

You need an activation function

upbeat prism Sep 9, 2024, 3:27 PM

#

past meteor You need an activation function

there's relu

#

maybe I just write my own using modulu and basically hardcode it ^^

past meteor Sep 9, 2024, 3:28 PM

#

Aha, I didn't see that. Then you likely need to increase the number of parameters

upbeat prism Sep 9, 2024, 3:28 PM

#

I just need some model that has very distinct grads

past meteor Sep 9, 2024, 3:28 PM

#

But not so much so as to memorize which number is even and odd

upbeat prism Sep 9, 2024, 3:30 PM

#

yeah of course but I wanted to make a minimal example for something. I wanted to find a classification that for a given input has very distinct gradients

verbal oar Sep 9, 2024, 3:36 PM

#

yes I think or one of its dependencies is issue, pytorch reply

past meteor Sep 9, 2024, 3:39 PM

#

past meteor Aha, I didn't see that. Then you likely need to increase the number of parameter...

I'm curious about how well neural networks can extrapolate anyway

#

Lots of chance it will not work with larger numbers

upbeat prism Sep 9, 2024, 3:42 PM

#

I couldn't even think of how to do it manually but anyway found something else that might work

unkempt apex Sep 9, 2024, 5:23 PM

#

past bramble me too, are you gonna convert text to embeddings?

it's compulsory right ?? to convert text to embeddings?

rich moth Sep 9, 2024, 6:03 PM

#

past bramble impressive, it looks like it will reconstruct the same image for same prompts, ...

ya, its expected. when you give it the same input, it should reliably produce similar outputs.

rich moth Sep 9, 2024, 6:08 PM

#

verbal oar is vae from scratch hard to do? I saw for example building in keras but its rath...

it took a couple months, but I added a manifold autoencoder and attention aggregation as well as clip and blip to help with the text-image alignment and caption generation and enough trial and error to kill a horse

past bramble Sep 9, 2024, 6:08 PM

#

unkempt apex it's compulsory right ?? to convert text to embeddings?

who made it compulsary? we can use our own ways if we wish to, not that I know any other ways yet

past bramble Sep 9, 2024, 6:09 PM

#

rich moth ya, its expected. when you give it the same input, it should reliably produce si...

nice

unkempt apex Sep 9, 2024, 6:09 PM

#

past bramble who made it compulsary? we can use our own ways if we wish to, not that I know a...

compulsory means, effective way to pass tokens!
btw what are other ways also?

rich moth Sep 9, 2024, 6:13 PM

#

unkempt apex compulsory means, effective way to pass tokens! btw what are other ways also?

besides embeddings?

unkempt apex Sep 9, 2024, 6:15 PM

#

rich moth besides embeddings?

yeah

rich moth Sep 9, 2024, 6:32 PM

#

rich moth besides embeddings?

nothing I can think of as effective, not really. But what if you stacked embeddings of tokens from a sentence or sequence to form a larger image-like structure

verbal oar Sep 9, 2024, 6:41 PM

#

ah so you mixed things

rich moth Sep 9, 2024, 6:42 PM

#

verbal oar ah so you mixed things

exactly

verbal oar Sep 9, 2024, 6:43 PM

#

I'm asking because when I see calculus of variations (variatonal) inspiration, and wonder if it is difficult in code as in math formulation

#

there is much of derivation

#

for example I saw in wikipedia derivation of q or p dont remember

rich moth Sep 9, 2024, 6:52 PM

#

verbal oar I'm asking because when I see calculus of variations (variatonal) inspiration, a...

i bypassed a lot of complexities using a vector quantizer to represent the latent space

deep abyss Sep 9, 2024, 6:57 PM

#

I am having some troubles with tensorflow. I am loading tf_flowers dataset using tensorflow_datasets. The moment I run the jupyter cell and load it, 1.9 GB of 4 GB of my dedicated VRAM gets used which was all free before, the total size of the dataset is just around 233 MB, Also, when I try to train some models with single dense layer only and 128 neurons, I get ResourceExhaustedError saying Out Of Memory while only 2.1 GB of my dedicated VRAM is used and 1.9 GB is still left. How do I deal with this without restarting the kernel each time?

arctic silo Sep 9, 2024, 8:44 PM

#

Hi talents I installed annaconda 2024 version and I'm using jupyter notebook Its too slow any one has this problem

#

I used the old version and its not slow as this

left tartan Sep 9, 2024, 9:03 PM

#

arctic silo Hi talents I installed annaconda 2024 version and I'm using jupyter notebook Its...

Too many variables to comment, but it's unlikely just changing anaconda version had any measurable impact. What are you doing with your notebook?

arctic silo Sep 9, 2024, 9:03 PM

#

some data anlysis

#

pandas ,numpy and this kind of module

left tartan Sep 9, 2024, 9:12 PM

#

arctic silo pandas ,numpy and this kind of module

That's pretty vague

arctic silo Sep 9, 2024, 9:13 PM

#

why ? what do you mean ?

pine escarp Sep 9, 2024, 9:40 PM

#

arctic silo why ? what do you mean ?

He's asking you to be specific.

low void Sep 9, 2024, 11:32 PM

#

I started on kaggle Few days ago what do I need to know before starting the titanic competition
I just finished the introduction to programming course by Alexis Cook

serene scaffold Sep 9, 2024, 11:44 PM

#

low void I started on kaggle Few days ago what do I need to know before starting the tita...

do you know the story of the titanic, and how life boat seats were allocated?
domain knowledge really shines here.

low void Sep 9, 2024, 11:46 PM

#

serene scaffold do you know the story of the titanic, and how life boat seats were allocated? do...

I know the story and I would consider going back to refresh my memory on it

I may even try seeing the movie again

After that what's the next step?

serene scaffold Sep 9, 2024, 11:50 PM

#

low void I know the story and I would consider going back to refresh my memory on it I m...

the movie won't help.

low void Sep 9, 2024, 11:51 PM

#

serene scaffold the movie won't help.

OK then, YouTube resources would do right?

serene scaffold Sep 9, 2024, 11:52 PM

#

low void OK then, YouTube resources would do right?

you only need to understand how it was decided who would get on the lifeboats.

and you should be able to manipulate tabular data with pandas to highlight those determining factors.

#

I don't know that course. you might do the kaggle pandas tutorial.

low void Sep 9, 2024, 11:55 PM

#

serene scaffold you only need to understand how it was decided who would get on the lifeboats. ...

Will the knowledge on the introduction to programming do?

serene scaffold Sep 9, 2024, 11:56 PM

#

serene scaffold I don't know that course. you might do the kaggle pandas tutorial.

.

low void Sep 9, 2024, 11:58 PM

#

serene scaffold .

OK thanks the help, I'll keep you updated on the development

untold fable Sep 10, 2024, 2:43 AM

#

Where to learn ai

#

In yt

odd stratus Sep 10, 2024, 3:56 AM

#

past bramble me too, are you gonna convert text to embeddings?

what are embeddings?
im just gonna try go letter by letter

rich moth Sep 10, 2024, 4:18 AM

#

unkempt apex yeah

You got me thinking of a different type of technique. Instead of passing standard embeddings, im stacking them to create an image like representation. I made a CNN that reshapes the embeddings into a 2d grid and applies connvoultions to extract patterns and intergrates it with the image data. I intergrated it in my project ive been working on and its training now

small wedge Sep 10, 2024, 4:25 AM

#

rich moth You got me thinking of a different type of technique. Instead of passing stand...

Thats cool but seems a bit counter intuitive to me, since the intuition behind convolution is that it gives you information about the neighbors of an "anchor datum". In other words, it would give you information relating to the position of the embeddings on the grid and the neighbors surrounding your anchor, which doesnt really make sense for embeddings in the same way it would for pixels. But I'll be interested to see if the results you get are good nonetheless.

Is there some specific reason you built it like this like it's used in a paper or are you just throwing stuff at the wall for research?

jaunty helm Sep 10, 2024, 4:30 AM

#

the attention mechanism should (hopefully) be taking care of the relationships between the words already

rich moth Sep 10, 2024, 4:34 AM

#

small wedge Thats cool but seems a bit counter intuitive to me, since the intuition behind c...

good point, but the reason im trying this is im hoping the CNN can learn to capture the higher level patterns and relationships between the token embeddings even if its not strickly spatial. i dont know if it will pan out, but i figured it worth a shot.

small wedge Sep 10, 2024, 4:34 AM

#

Gotcha

desert oar Sep 10, 2024, 4:38 AM

#

rich moth good point, but the reason im trying this is im hoping the CNN can learn to capt...

you might want to look into how the "standard" embeddings are constructed

odd stratus Sep 10, 2024, 5:24 AM

#

so im new to a.i. what sort of layers and systems should i be implementing and using?

unkempt apex Sep 10, 2024, 5:31 AM

#

rich moth You got me thinking of a different type of technique. Instead of passing stand...

share the results! ( just always u do ), it will be interesting to see that then

past bramble Sep 10, 2024, 5:37 AM

#

unkempt apex compulsory means, effective way to pass tokens! btw what are other ways also?

not really, figuring it out

past bramble Sep 10, 2024, 5:37 AM

#

odd stratus what are embeddings? im just gonna try go letter by letter

💀 well no

#

I hope you know what tokens are in LLMs

#

each token gets converted into vectors of n dimensions

#

basically an array of n dimensions containing floats

#

two tokens with same meaning will have similar vectors, such as boy and male

#

when you perform math operations you will quite often get the same result
example:
distance = King - man

now we can use it this way:

woman + distance
which is equal to Queen

unkempt apex Sep 10, 2024, 5:51 AM

#

odd stratus what are embeddings? im just gonna try go letter by letter

watch 3b1b video of Deep learning, then you will understand it deeply!

#

This is original text
priknik horn red electric air horn compressor interior dual tone trumpet loud compatible with sx

and this is tokenized from BERT normal tokenizer

 '##k',
 '##nik',
 'horn',
 'red',
 'electric',
 'air',
 'horn',
 'compressor',
 'interior',
 'dual',
 'tone',
 'trumpet',
 'loud',
 'compatible',
 'with',
 's',
 '##x']```

#

is it good?, but why '##' is being added to letters

odd stratus Sep 10, 2024, 6:13 AM

#

past bramble 💀 well no

oh yeah, i remembered seeing something like that but i had no idea how it works

past bramble Sep 10, 2024, 6:18 AM

#

odd stratus oh yeah, i remembered seeing something like that but i had no idea how it works

Those vectors are created by some ways idk, but you can use the same text embeddings from an already existing open source model

odd stratus Sep 10, 2024, 6:21 AM

#

past bramble Those vectors are created by some ways idk, but you can use the same text embedd...

pithink watching through the 3b1b videos
i know what i want to do and how it works, i just dont know how to do it or what i would need to do to start doing it

#

does the a.i. learn the vectors itself through training, or are the vectors premade upon loading into the perceptron?

past bramble Sep 10, 2024, 6:22 AM

#

I was thinking of using it and then I saw the size of one vector for one of the models was "800", it's huge to me

past bramble Sep 10, 2024, 6:22 AM

#

odd stratus does the a.i. learn the vectors itself through training, or are the vectors prem...

the vectors are premade based on existing data

odd stratus Sep 10, 2024, 6:23 AM

#

past bramble the vectors are premade based on existing data

pithink interestingg
i have no idea how i would use or gain the vectors though lmao

past bramble Sep 10, 2024, 6:23 AM

#

odd stratus <:pithink:652247559909277706> interestingg i have no idea how i would use or gai...

using existing vectors is what we can do, I still think they are large

#

@odd stratus when you said letter by letter, are you passing in ascii values? How are you going to pass them?

odd stratus Sep 10, 2024, 6:26 AM

#

past bramble <@847392618564026368> when you said letter by letter, are you passing in ascii v...

ascii values squashed to the scale of 0-1 or smthn
and the output can be a 128 vector output or smthn the a.i. can chose from

past bramble Sep 10, 2024, 6:29 AM

#

odd stratus ascii values squashed to the scale of 0-1 or smthn and the output can be a 128 v...

Im sure it'll struggle creating words. Passing words instead might be better.

I was thinking of coming up with some algorithm that converts words to numbers and back, still thinking

past bramble Sep 10, 2024, 6:48 AM

#

I have a really bad idea

#

I make a list of words, everytime I come accross a new word I append it

#

and the indices will be the values I pass in to train the model and get the output

deep abyss Sep 10, 2024, 8:26 AM

#

deep abyss I am having some troubles with `tensorflow`. I am loading `tf_flowers` dataset u...

Is there any solution to this...?

unkempt apex Sep 10, 2024, 8:45 AM

#

deep abyss Is there any solution to this...?

so restarting session works?

#

but then slowly slowly as you move forward ( run more code ) , it gives you this error right?

deep abyss Sep 10, 2024, 8:46 AM

#

unkempt apex but then slowly slowly as you move forward ( run more code ) , it gives you this...

Yes

unkempt apex Sep 10, 2024, 8:46 AM

#

deep abyss Yes

elaborate more what are u doing in that code!

#

I mean, how u are loading dataset and all

#

are u using Dataloader class?

deep abyss Sep 10, 2024, 8:49 AM

#

unkempt apex I mean, how u are loading dataset and all

I am just using tfds.load for tf_flowers dataset with batch_size 8.

#

It is. giving a Dataset object.

unkempt apex Sep 10, 2024, 8:49 AM

#

and then?

#

just go line by line , what are u doing

deep abyss Sep 10, 2024, 8:52 AM

#

Doing some normalisation on image, and training a sequential model with a flatten layer 64 neuron dense layer and softmax output (used Adam optimizer).

unkempt apex Sep 10, 2024, 8:54 AM

#

only these?

deep abyss Sep 10, 2024, 8:56 AM

#

Here is the code to load dataset:

BATCH_SIZE = 16 # Later changed to 8 but could not solve the problem
IMG_WIDTH = 128
IMG_HEIGHT = 128

builder = tfds.builder("tf_flowers")
builder.download_and_prepare(download_dir=r"D:\tensorflow_datasets")
train_ds, test_ds = builder.as_dataset(
    split=["train[:80%]", "train[80%:]"],
    shuffle_files=True,
    batch_size=BATCH_SIZE
)
class_names = builder.info.features["label"].names
print(class_names)
def preprocess_images(image_batch):
    # Resizing the images
    image_batch["image"] = tf.image.resize(image_batch["image"], (IMG_HEIGHT, IMG_WIDTH))
    # Scaling the images
    image_batch["image"] = tf.image.convert_image_dtype(image_batch["image"], tf.float32)
    # Format expected by `fit` method
    return (image_batch["image"], image_batch["label"])


prepared_train_ds = train_ds.map(preprocess_images, num_parallel_calls=tf.data.AUTOTUNE)
prepared_test_ds = test_ds.map(preprocess_images, num_parallel_calls=tf.data.AUTOTUNE)

Model code:

model2 = tf.keras.Sequential([
    tf.keras.layers.Flatten(input_shape=(IMG_HEIGHT, IMG_WIDTH, 3)),
    tf.keras.layers.Dense(16, activation="relu"),
    tf.keras.layers.Dense(len(class_names), activation="softmax")
])

model2.compile(
    optimizer="adam",
    loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=False),
    metrics=["accuracy"]
)

#

I later changeed the dense layer neurons from 64 to 16 to resolve the error, but I couldn't.

unkempt apex Sep 10, 2024, 8:59 AM

#

share the full traceback also!

#

!paste

arctic wedgeBOT Sep 10, 2024, 8:59 AM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

deep abyss Sep 10, 2024, 9:14 AM

#

unkempt apex share the full traceback also!

I am currently not able to reproduce the error, but from a previous training, here is the error:

ResourceExhaustedError: {{function_node __wrapped__Mul_device_/job:localhost/replica:0/task:0/device:GPU:0}} failed to allocate memory [Op:Mul]

I copied it from my GPT prompt where I first asked about this problem. I am unable to provide the full traceback.

unkempt apex Sep 10, 2024, 9:16 AM

#

deep abyss I am currently not able to reproduce the error, but from a previous training, he...

to able to reproduce error?? what?? , then share current error

deep abyss Sep 10, 2024, 9:16 AM

#

unkempt apex to able to reproduce error?? what?? , then share current error

Yeah, the situation is very random...

unkempt apex Sep 10, 2024, 9:16 AM

#

https://stackoverflow.com/questions/69641708/tensorflow-python-framework-errors-impl-resourceexhaustederror-failed-to-alloca

Stack Overflow

tensorflow.python.framework.errors_impl.ResourceExhaustedError: fai...

Hi I am a beginner in DL and tensorflow,
I created a CNN (you can see the model below)
model = tf.keras.Sequential()

model.add(tf.keras.layers.Conv2D(filters=64, kernel_size=7, activation="relu&

#

have u tried all this?

verbal oar Sep 10, 2024, 9:17 AM

#

ResourceExhaustedError docs?

unkempt apex Sep 10, 2024, 9:17 AM

#

verbal oar ResourceExhaustedError docs?

it says reduce batch size

#

https://www.tensorflow.org/api_docs/python/tf/errors/ResourceExhaustedError

TensorFlow

tf.errors.ResourceExhaustedError | TensorFlow v2.16.1

Raised when some resource has been exhausted while running operation.

verbal oar Sep 10, 2024, 9:17 AM

#

so out of memory, as I supposed

unkempt apex Sep 10, 2024, 9:18 AM

#

verbal oar so out of memory, as I supposed

but he says he have 4gb vram

deep abyss Sep 10, 2024, 9:18 AM

#

unkempt apex it says reduce batch size

Well, I tried that, but I turns of that tensorflow might be saving models in GPU memory until kernel is shutdown/restart...

#

So, reducing batch_size didn't worked for me.

unkempt apex Sep 10, 2024, 9:19 AM

#

deep abyss Well, I tried that, but I turns of that tensorflow might be saving models in GPU...

because it is helpful for ourselves

verbal oar Sep 10, 2024, 9:19 AM

#

reduce dimension size of model weights

#

hmm but batch size is not too big 16

unkempt apex Sep 10, 2024, 9:20 AM

#

deep abyss So, reducing batch_size didn't worked for me.

wait wait, try the same notebook on kaggle or collab

jaunty helm Sep 10, 2024, 9:20 AM

#

not familiar with tf, but maybe

prepared_train_ds = train_ds.map(preprocess_images, num_parallel_calls=tf.data.AUTOTUNE)
```this part's doing copies and so your gpu can't hold all of the data?

unkempt apex Sep 10, 2024, 9:21 AM

#

wtfff

deep abyss Sep 10, 2024, 9:21 AM

#

I also tried: tf.keras.backend.clear_session() but didn't release the memory.

jaunty helm Sep 10, 2024, 9:21 AM

#

jaunty helm not familiar with tf, but maybe ```py prepared_train_ds = train_ds.map(preproces...

cause python would have to hold both train_ds and prepared_train_ds (and the test ones)

verbal oar Sep 10, 2024, 9:22 AM

#

For example, this error might be raised if a per-user quota is exhausted, or perhaps the entire file system is out of space. If running into ResourceExhaustedError due to out of memory (OOM), try to use smaller batch size or reduce dimension size of model weights.

deep abyss Sep 10, 2024, 9:22 AM

#

jaunty helm not familiar with tf, but maybe ```py prepared_train_ds = train_ds.map(preproces...

Well, on my physical disk the whole dataset size is around 233 MB, but it uses 1.9 GB of my GPU memory when I load it.

jaunty helm Sep 10, 2024, 9:22 AM

#

assuming it's copying, if you did like

train_ds = train_ds.map(preprocess_images, num_parallel_calls=tf.data.AUTOTUNE)
```the unprocessed data could be collected and reduce mem

jaunty helm Sep 10, 2024, 9:23 AM

#

deep abyss Well, on my physical disk the whole dataset size is around 233 MB, but it uses 1...

dunno

#

maybe the data is compressed so when you load it it takes more memory than it might seem

unkempt apex Sep 10, 2024, 9:26 AM

#

yeah

#

@deep abyss have u checked dataset manually?

#

it's all images right

deep abyss Sep 10, 2024, 9:26 AM

#

unkempt apex it's all images right

Yes

unkempt apex Sep 10, 2024, 9:26 AM

#

deep abyss Yes

and all of that is just 278 mb

deep abyss Sep 10, 2024, 9:27 AM

#

unkempt apex and all of that is just 278 mb

https://www.tensorflow.org/datasets/catalog/tf_flowers

TensorFlow

tf_flowers | TensorFlow Datasets

unkempt apex Sep 10, 2024, 9:28 AM

#

no it's only 221 mb

#

another option as I said, try to run the same code on colab now

#

with the GPU they provide

verbal oar Sep 10, 2024, 9:29 AM

#

profiling would be helpful I think

deep abyss Sep 10, 2024, 9:31 AM

#

unkempt apex another option as I said, try to run the same code on colab now

Okay I am trying...

unkempt apex Sep 10, 2024, 9:31 AM

#

deep abyss Okay I am trying...

if error not occurs, change upgrade your GPU then 😂

verbal oar Sep 10, 2024, 9:32 AM

#

https://www.tensorflow.org/guide/profiler

TensorFlow

Optimize TensorFlow performance using the Profiler | TensorFlow C...

#

some memory profile specifically

unkempt apex Sep 10, 2024, 9:35 AM

#

use pytorch always 🫂

#

I never used tf actually

verbal oar Sep 10, 2024, 9:37 AM

#

I must try pytorch

deep abyss Sep 10, 2024, 9:38 AM

#

While loading the dataset in Colab it takes no GPU memory, the usage remains constant to 0.1 GB out of 16 GB but in my system it instantly consumes 1.9 GB of dedicated GPU VRAM (I have RTX 3050 with 4 GB dedicated VRAM). Why that might be...?

verbal oar Sep 10, 2024, 9:38 AM

#

is porting from torch(lua) relatively easy to pytorch?

#

because I see some deep render in torch and want do it in pytorch

odd stratus Sep 10, 2024, 9:39 AM

#

past bramble Im sure it'll struggle creating words. Passing words instead might be better. I...

ordinals?

past bramble Sep 10, 2024, 9:39 AM

#

trying to improve my image generation model, looks good from epoch 8 :)

odd stratus Sep 10, 2024, 9:39 AM

#

im using ordinals for the letter inputs and outputs

verbal oar Sep 10, 2024, 9:39 AM

#

looks better and better I think

deep abyss Sep 10, 2024, 9:40 AM

#

deep abyss While loading the dataset in Colab it takes no GPU memory, the usage remains con...

And I am using tensorflow 2.10 as that iis the only supported version in Windows.

verbal oar Sep 10, 2024, 9:41 AM

#

what might be to know some bottlenecks etc use profiler

past bramble Sep 10, 2024, 9:42 AM

#

odd stratus ordinals?

what's that?

verbal oar Sep 10, 2024, 9:42 AM

#

I assume ordinal numers but not sure

past bramble Sep 10, 2024, 9:43 AM

#

ordinal numbers and encoding text doesn't relate

verbal oar Sep 10, 2024, 9:44 AM

#

hmm but I saw somewhere this term ordinals, forgot where

#

maybe OrdinalEncoder

#

https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OrdinalEncoder.html

scikit-learn

OrdinalEncoder

Gallery examples: Release Highlights for scikit-learn 1.3 Release Highlights for scikit-learn 1.2 Categorical Feature Support in Gradient Boosting Combine predictors using stacking Poisson regressi...

#

looks like make sense

#

to preserve inherent ordering

past bramble Sep 10, 2024, 9:48 AM

#

hm

unkempt apex Sep 10, 2024, 9:48 AM

#

past bramble trying to improve my image generation model, looks good from epoch 8 :)

using GAN?

unkempt apex Sep 10, 2024, 9:49 AM

#

deep abyss While loading the dataset in Colab it takes no GPU memory, the usage remains con...

is it giving any error on collab?

deep abyss Sep 10, 2024, 9:50 AM

#

unkempt apex is it giving any error on collab?

No, not till now....

past bramble Sep 10, 2024, 9:50 AM

#

unkempt apex using GAN?

I suppose

unkempt apex Sep 10, 2024, 9:52 AM

#

which model? are u using , I have tried U-Net!

past bramble Sep 10, 2024, 9:53 AM

#

wdym? GAN doesn't explain it?

unkempt apex Sep 10, 2024, 9:54 AM

#

u using GAN now right?

past bramble Sep 10, 2024, 9:54 AM

#

yeah

unkempt apex Sep 10, 2024, 9:54 AM

#

how's your structure of Generator then?

past bramble Sep 10, 2024, 9:54 AM

#

bunch of CNNs

unkempt apex Sep 10, 2024, 9:54 AM

#

generally people make similar to CNN

#

yeah that's what, but we can also make similiar like U-Net

past bramble Sep 10, 2024, 9:55 AM

#

that's a new one

unkempt apex Sep 10, 2024, 9:55 AM

#

cGAN !

past bramble Sep 10, 2024, 9:56 AM

#

conditonal GAN? I guess I made my number model that way

unkempt apex Sep 10, 2024, 9:56 AM

#

yup

past bramble Sep 10, 2024, 9:57 AM

#

odd stratus Sep 10, 2024, 10:00 AM

#

past bramble what's that?

!e

print(ord("E"))
print(chr(69))

arctic wedgeBOT Sep 10, 2024, 10:00 AM

#

odd stratus !e ```py print(ord("E")) print(chr(69)) ```

:white_check_mark: Your 3.12 eval job has completed with return code 0.

001 | 69
002 | E

past bramble Sep 10, 2024, 10:03 AM

#

odd stratus !e ```py print(ord("E")) print(chr(69)) ```

well that's ASCII

#

I don't think it'll be effective

#

dunno I haven't tried

#

btw a question
_____________

#

I need outputs from a neural network from a set of numbers, which each represent a word. How can I make it that the network only outputs from the set I have defined?

Example:
I have the set: ```py
[0.1, 0.2, 0.3, 0.4, 0.5]


The output:```py
[0.3,  0.1, 0.4]
```or  ```py
[0.3, 0.1, 0.4, 0, 0]  # padding on the right

The output size isn't fixed, since a conversation response can be of any size.

How can I go about making such an output layer?

desert oar Sep 10, 2024, 10:46 AM

#

past bramble I need outputs from a neural network from a set of numbers, which each represent...

Where did those numbers come from?

Look up "one hot encoding": each word is encoded as a vector of all 0s, with 1 in the position corresponding to the word. So your input sequence is a sequence of vectors, not of numbers.

jaunty helm Sep 10, 2024, 10:51 AM

#

desert oar Where did those numbers come from? Look up "one hot encoding": each word is enc...

I think they mean the output should only output those in the set

past bramble Sep 10, 2024, 10:52 AM

#

desert oar Where did those numbers come from? Look up "one hot encoding": each word is enc...

I will be encoding the words into numbers, I want to input numbers for learning and experimenting purpose.
Thanks for the idea on one hot encoding, I didn't think I could use that here. For now I still want to try on numbers first before vectors

odd stratus Sep 10, 2024, 11:04 AM

#

past bramble I don't think it'll be effective

its getting a 90% accuracy after 30000 epochs lmaoo

unkempt apex Sep 10, 2024, 11:19 AM

#

wait, 30k epochs seriously? 😂

odd stratus Sep 10, 2024, 11:21 AM

#

unkempt apex wait, 30k epochs seriously? 😂

pithink i have no idea if thats good or not but it only takes about 30 minutes

#

im basing the concepts off of the image generating a.i's where the a.i. only needs to predict one letter at a time to create a full "image" but the image being the output text
and my input data is the entire movie scene script for The Fellowship of The Ring lmaooo

odd stratus Sep 10, 2024, 11:41 AM

#

ive restarted a few times and trained my model a bit in each to see how it works
it seems to follow two trends

it repeatedly outputs a single letter after initialising but around epoch 3000 it starts choosing different letters
it either
a. starts getting everything correct
b. or it starts averaging results and getting incorrect output
could just be random initialising data
but it does get really accurate results when its initialised data is lucky lmao 50/50

past bramble Sep 10, 2024, 12:05 PM

#

odd stratus im basing the concepts off of the image generating a.i's where the a.i. only nee...

lol have you tried it out?

#

can you show it's responses

#

30k in 30 minutes is fast ngl

odd stratus Sep 10, 2024, 12:06 PM

#

testing different stuff to get it to do a full generated output

verbal oar Sep 10, 2024, 12:08 PM

#

do you use git or freestyling with code?

#

or just jupyter notebook or locally

odd stratus Sep 10, 2024, 12:08 PM

#

past bramble 30k in 30 minutes is fast ngl

these are the layer sizes [101,1000,300,500,500,258] output layer is 258

verbal oar Sep 10, 2024, 12:10 PM

#

for example I think I should not do sth like "added vae from scratch" but rather more modular messages?

#

like added encoder
added decoder

#

or not use git would be faster

#

when I'm not using git I code faster

#

I know how to use it but dont know when, at what messages to have

past bramble Sep 10, 2024, 12:13 PM

#

odd stratus these are the layer sizes [101,1000,300,500,500,258] output layer is 258

for a sec I thought read 101,1000,300,500,500,258 as a single 20 digit number 🤣

past bramble Sep 10, 2024, 12:13 PM

#

verbal oar I know how to use it but dont know when, at what messages to have

jus say whatever you did

regal light Sep 10, 2024, 12:14 PM

#

how do we utilize tensorflow gpu on pycharm
i tried every possible way, but I can't find the right solution

verbal oar Sep 10, 2024, 12:15 PM

#

writing git messages is like naming variables 😂

odd stratus Sep 10, 2024, 12:16 PM

#

past bramble for a sec I thought read 101,1000,300,500,500,258 as a single 20 digit number 🤣

lmaooooo

desert oar Sep 10, 2024, 1:55 PM

#

jaunty helm I think they mean the output should only output those in the set

Correct, but there's no way to do that. One hot encoding is how you do that. On the other hand, if the output is a real number within some range, there are things you can do to constrain the range of the output. But you can't put arbitrary constraints on the output beyond that. If you try, you run into some fundamental trickiness of the real numbers, among other problems

desert oar Sep 10, 2024, 1:57 PM

#

past bramble I will be encoding the words into numbers, I want to input numbers for learning ...

Unfortunately, one hot encoding is precisely how you encode a fixed set of numbers in a model. You are mapping words to integers, and then mapping those integers to elements in a vector. There are other ways to do it that are mostly used in research fields like psychology, but for the purposes of machine learning they are equivalent, so one hot encoding is preferred because it's the simplest and easiest to interpret

#

It looks like you're trying to use numbers other than integers, maybe decimal numbers within some range? Consider that 0.1, 0.2, ... 1.0 are identical to 1, 2, ..., 10 -- you just divide everything by 10

#

So without loss of generality, you can always transform a finite set of numbers to natural numbers counting up from 1 or 0 as desired

#

It turns out that this is true even for the rational numbers. digging into that is the content of a course in real analysis

#

I hate to tell you not to experiment with something, but at least hopefully you understand now why people do what they do (and don't do what you're trying to do)

buoyant vine Sep 10, 2024, 2:05 PM

#

Sorry to derail Salt's excellent explanation, but a bit of a question around training LLMs or at least, looking for guidance around what approach to take:

I'm currently looking to try build a model the predicts the next set of relevant tokens upto N tokens for Y variants, where N and Y are small (think maybe 10 at most) where it is trying to predict the most relevant tokens based on a input training dataset that varies in size.

I guess it technically falls under generative AI but it has some caveats:

The aim is not to produce accurate grammar or longer sentences, just tokens.
The system does not want to do KNN or other semantic search type of logic to get the most relevant tokens, i.e. RAG is out of the question.

I haven't tried it yet but I wondered if you could take some basic encoder-decoder model and fine-tune it to the new dataset forcing it to generating the tokens related to that dataset only. But not sure if that is the right or most efficient way to do so.

simple tapir Sep 10, 2024, 2:10 PM

#

What do you guys suggest for mlops? ZenML, MLFlow or something else?

buoyant vine Sep 10, 2024, 2:10 PM

#

simple tapir What do you guys suggest for mlops? ZenML, MLFlow or something else?

Had a good experience with both MLFlow and Neptune (SAAS)

simple tapir Sep 10, 2024, 2:11 PM

#

What do you think about ZenML?

buoyant vine Sep 10, 2024, 2:15 PM

#

Haven't tried it so can't really say

small wedge Sep 10, 2024, 3:06 PM

#

buoyant vine Sorry to derail Salt's excellent explanation, but a bit of a question around tra...

Could you just generate embeddings for your new dataset and insert them into the pretrained model?

#

here's a relevant paper on this technique, although they were testing different languages https://openreview.net/pdf?id=MsjB2ohCJO1

past bramble Sep 10, 2024, 3:17 PM

#

desert oar Unfortunately, one hot encoding is precisely how you encode a fixed set of numbe...

I have reconsidered with the way you have explained it. I know how to use one hot encoding to provide input, what about the output? I'm not aware of activation functions or any solution for recieving output in this way

small wedge Sep 10, 2024, 3:22 PM

#

past bramble I have reconsidered with the way you have explained it. I know how to use one ho...

what are you trying to do?

simple nimbus Sep 10, 2024, 3:23 PM

#

hey, given a sentence, is there any way to figure out which chapter (textbook) or topic (pre determined) is it from? from research online I was told to use BERT but is there any simpler way? looks like I have to Train BRET with quite some data to begin with

past bramble Sep 10, 2024, 3:24 PM

#

small wedge what are you trying to do?

make a small LLM type of model. do you need more context?

small wedge Sep 10, 2024, 3:24 PM

#

the way LLM's choose a word is by having a softmax across their entire vocabulary

#

the token with the highest probability is chosen

serene scaffold Sep 10, 2024, 3:26 PM

#

simple nimbus hey, given a sentence, is there any way to figure out which chapter (textbook) o...

BERT is just another language model, as is GPT.
But I would avoid using language models for this, if you can get away with it. You could instead figure out what the "keywords" are for each chapter, and make a decision based on which of those keywords appear in the sentence.

#

or, if you have the whole textbook available, you can just... find the sentence in the textbook.

simple nimbus Sep 10, 2024, 3:27 PM

#

sometimes I need to identify what the sentences is about

serene scaffold Sep 10, 2024, 3:28 PM

#

simple nimbus sometimes I need to identify what the sentences is about

Consider this sentence:

I'd never want to go anywhere without my wonderful towel.
Is this sentence about "I" or "towel"?

simple nimbus Sep 10, 2024, 3:29 PM

#

Its about "I"

serene scaffold Sep 10, 2024, 3:29 PM

#

simple nimbus Its about "I"

so, you just need to identify the grammatical subject of each sentence?

simple nimbus Sep 10, 2024, 3:31 PM

#

but for sentence analysis that I am doing, towel was more appropriate answer

#

sorry for confusion

past bramble Sep 10, 2024, 3:32 PM

#

small wedge the token with the highest probability is chosen

What about the output size? how do they create text such that their content doesn't exceed the max limit and it's constructed accordingly by stopping with punctuations. picking tokens with highest probability until you reach a stop punctuation before hitting the max length?

small wedge Sep 10, 2024, 3:33 PM

#

the model outputs 1 token at a time

#

you request from it as many tokens as you want (input -> output1 -> input + output1 -> output2 -> ...)

#

you could use punctuation as a way to stop if you want shrug it doesn't really matter

simple nimbus Sep 10, 2024, 3:36 PM

#

for example given


---

Consider the following pairs:

1. Port of Rotterdam: First major port in Europe registered as a company
2. Port of Shanghai: Largest privately owned port in the world
3. Port of Singapore: Largest container port in the world

How many of the above pairs are correctly matched?

(a) Only one pair  
(b) Only two pairs  
(c) All three pairs  

---

I have to determine if this question is from geography or history

past bramble Sep 10, 2024, 3:39 PM

#

small wedge you request from it as many tokens as you want (input -> output1 -> input + outp...

are you saying it guesses the next word?

small wedge Sep 10, 2024, 3:39 PM

#

that's how LLMs work yes

past bramble Sep 10, 2024, 3:39 PM

#

small wedge you could use punctuation as a way to stop if you want <:shrug:33226818151723827...

you can't have a. punctuation just. anywhere

past bramble Sep 10, 2024, 3:41 PM

#

small wedge that's how LLMs work yes

so I train it with a dataset where the input is a text and the output is the word it's supposed to guess?
that's weird I have to figure out how to do that when the dataset I have is conversation pairs

small wedge Sep 10, 2024, 3:43 PM

#

past bramble so I train it with a dataset where the input is a text and the output is the wor...

in essence yes but it gets a bit more complicated with transformer architecture. Are you planning on using that or are you just gonna make a simple one with LSTM or something?

small wedge Sep 10, 2024, 3:43 PM

#

past bramble so I train it with a dataset where the input is a text and the output is the wor...

can you give an example of an input/label pair in your dataset?

past bramble Sep 10, 2024, 3:49 PM

#

small wedge can you give an example of an input/label pair in your dataset?

I had found this on kaggle

past bramble Sep 10, 2024, 3:50 PM

#

small wedge in essence yes but it gets a bit more complicated with transformer architecture....

It's an experiment, I won't be using it which is why I am trying to be different from the way normal LLMs work

small wedge Sep 10, 2024, 3:51 PM

#

modern llms use transformers and multihead attention all that

#

but you can make something like this with simple RNN stuff like LSTM or GRU

#

yeah so if you wanted to have a chat bot that can generate novel conversations that don't exist in it's dataset you'd probably wanna go the softmax route and feed it stuff like "I'm fine, how about yourself? " -> "I'm fine, how about yourself? I'm" -> "I'm fine, how about yourself? I'm " etc.

#

the big issue you'll probably run into here if you've never played with this kinda NLP before is probably stop words

#

your dataset is not massive, and there might be a lot of words that appear very often like "i'm" "i've" even spaces that the model can easily find local minima for when just spamming the same word over and over as an output. There are 2 minds to dealing with this which is basically to remove common stopwords altogether from the dataset to avoid having the model break during training (this unfortunately leads to the model not being able to accurately generate those stopwords without further fine tuning) or just leaving the stop words in and praying to any gods that will listen that it doesn't break.

desert oar Sep 10, 2024, 4:12 PM

#

past bramble I have reconsidered with the way you have explained it. I know how to use one ho...

you do in fact use one-hot encoding for outputs as well. that's the standard technique for classification in all cases, not just for text (where you are "classifying" each output token with a word). the difference is that you don't get strict 1 and 0 values -- you get a score in each vector element, and conventionally we treat the highest-scoring element as 1 and all the others as 0. ideally you would use the softmax function to ensure that the scores are all between 0 and 1, and they all add up to 1, which helps ensure that the output is sane, aids interpretation, and allows you to use loss functions that treat the output as a multinomial probabilty model, which is exactly what we have here

#

i suggest taking a look at the classic word2vec model: it's a good entry point into a lot of these concepts and still forms the conceptual basis for a lot of what we do in ML with text even 10+ years after the model came out

#

(most of the ideas in word2vec are based on older ideas in ML and statistics but at that point you're going very deep into the fundamentals, which is a good thing, but probably unsatisfying if you want to just play around and build some toy projects)

desert oar Sep 10, 2024, 4:15 PM

#

buoyant vine Sorry to derail Salt's excellent explanation, but a bit of a question around tra...

aren't all the big LLMs are trained on next-token prediction anyway?

#

as far as i understand, that's precisely what "GPT" is/was: a decoder-only model with a huge number of parameters trained on a huge amount of data turns out to be great at generating text

desert oar Sep 10, 2024, 4:18 PM

#

serene scaffold BERT is just another language model, as is GPT. But I would avoid using language...

i kind of disagree. i think a language model is a reasonable approach to obtain a good-quality document embedding. BERT in particular has a small context window so you might need to do something like compute word vectors by sliding the context window across each chapter. then you can just do KNN or train a classifier on the document vectors

#

source: we used BERT vectors at work for text classification shortly after the model came out, and it improved our results compared to other vector embeddings

#

and we didn't fine-tune, we just used the off-the-shelf model weights

#

but as a learning exercise, yeah i think using pre-trained vectors starves you of an opportunity to explore and experiment and practice with building your own things

past bramble Sep 10, 2024, 4:24 PM

#

desert oar you do in fact use one-hot encoding for outputs as well. that's the standard tec...

oh Im getting some ideas now, thanks a lot!

odd stratus Sep 10, 2024, 4:28 PM

#

when im training my a.i.
it isnt outputting quality answers
im training it to predict the next letter in a sequence
however instead of outputting the next predicted letter
its output vector is just an average of the training data

e.g. if 25% of the output was the letter e and 10% was the letter a
its output isnt accurate and instead constantly outputs e as e is the most correct average

how do i prevent it?

buoyant vine Sep 10, 2024, 4:28 PM

#

desert oar aren't all the big LLMs are trained on next-token prediction anyway?

Yep, but the goal of the most of the existing models want to predict human text as such, i.e. it has certain things like gramar correctness and formating sentences, which we don't really want.

The goal is it needs to be fast and lightweights, so it can't do things like RAG or things which end up involving running both the model and then KNN ontop of that.

#

the primary objective is keyword & phrase supplimenting to keyword search queries, but most systems like word2vec or GloVe, etc... are trained on general (normally english) text, making it liable to predicting words that don't exist in the corpus

strong notch Sep 10, 2024, 4:31 PM

#

Does anyone here work with AI in healthcare, or is anyone interested?

small wedge Sep 10, 2024, 4:32 PM

#

odd stratus when im training my a.i. it isnt outputting quality answers im training it to pr...

can you show your code?

buoyant vine Sep 10, 2024, 4:32 PM

#

small wedge here's a relevant paper on this technique, although they were testing different ...

Looks interesting, I will have a peak at this, ty!

odd stratus Sep 10, 2024, 4:33 PM

#

small wedge can you show your code?

https://paste.pythondiscord.com/GY6A

desert oar Sep 10, 2024, 4:34 PM

#

buoyant vine the primary objective is keyword & phrase supplimenting to keyword search querie...

why not train your own word vectors? it's super fast and easy with fasttext

buoyant vine Sep 10, 2024, 4:35 PM

#

Hmm possibly, how well does that work with predicting phrases of text though?

desert oar Sep 10, 2024, 4:35 PM

#

not enough source data?

#

oh, not well because it's cbow and skipgram neither of which is what you want i think

buoyant vine Sep 10, 2024, 4:35 PM

#

possibly, the source data itself is a black box, because it depends ultimiately on who is using the engine

#

different users will have bigger or smaller indexes

desert oar Sep 10, 2024, 4:35 PM

#

how much text do you have? maybe you can use nanogpt

#

that is: use the basic transformer architecture for its original purpose of sequence modeling, forget all the LLM stuff

#

i haven't seen this embedding replacement technique that waterfall posted though, so maybe that's promising

#

it definitely sounds like it might help you, from the abstract

buoyant vine Sep 10, 2024, 4:37 PM

#

Yeah need to dig into it, effectively the biggest issue here is amount of compute required. The goal is this is a suplimental system which can periodly train on the user's search corpus and then that gets used to help supliment search queries

#

giving you an illusion of hybrid or semantic search

#

but without the ANN/KNN related activities

#

In theory you could use word2vec and Glove on some pre-compiled (small) index, but I'm not sure how well they work when trying to form or predict phrases of 2 or 3 words

serene grail Sep 10, 2024, 4:46 PM

#

buoyant vine Sorry to derail Salt's excellent explanation, but a bit of a question around tra...

Does this imply that RAG normally uses KNN? I don't know anything about RAG besides "you want an LLM to produce accurate outputs about fish, so you use RAG with fish articles and hope/assume that your source is accurate"
That's my understanding so far

buoyant vine Sep 10, 2024, 4:53 PM

#

normally RAG has some sort of database that provides context to the LLM

#

which is normally in some form of vector search

#

doesn't have to be, but it is very common

small wedge Sep 10, 2024, 4:58 PM

#

buoyant vine but without the ANN/KNN related activities

have you looked into any sparse encoding search techniques? or would that still be too computationally costly?

serene grail Sep 10, 2024, 4:58 PM

#

And KNN is a form of vector search?

buoyant vine Sep 10, 2024, 4:58 PM

#

it is still realistically very computationally expensive

buoyant vine Sep 10, 2024, 4:59 PM

#

serene grail And KNN is a form of vector search?

Yes

serene grail Sep 10, 2024, 5:00 PM

#

Thank you!

buoyant vine Sep 10, 2024, 5:03 PM

#

small wedge have you looked into any sparse encoding search techniques? or would that still ...

The issue is also the fact that it slows down time to search and ingesting times.

Currently in the landscape trying to do hybrid search with something like sparse encoding or just ANN/KNN you end up using 10-100x more compute than a regular keyword based system would, and often endup scanning a lot more data in the process.

The flip side is often people don't actually want the full semantic behaviour, they just want some similar keywords or terms of phrases to be included in the results when search for something like "high heels" for example. Adding vector search often ends up meaning you need a GPU instance to quickly embed all your data and respond to queries quickly, and then also see a much sharper increase of costs when you dataset grows and your time to search goes down because building the indexes takes longer.

small wedge Sep 10, 2024, 5:06 PM

#

yeahh

solid tangle Sep 10, 2024, 5:08 PM

#

hello

#

need a guide on how to create a neural network from scratch

small wedge Sep 10, 2024, 5:09 PM

#

do you have any ML experience or knowledge prior to this?

solid tangle Sep 10, 2024, 5:09 PM

#

nah

elder pilot Sep 10, 2024, 5:10 PM

#

Hi guys can I get an AI roadmap recommendation

solid tangle Sep 10, 2024, 5:10 PM

#

small wedge do you have any ML experience or knowledge prior to this?

if im being honest i just need the steps lol

#

i rlly dont need to create it i just want to write about it

#

but i sorta want to understand it

small wedge Sep 10, 2024, 5:10 PM

#

https://towardsdatascience.com/mnist-handwritten-digits-classification-from-scratch-using-python-numpy-b08e401c4dab

solid tangle Sep 10, 2024, 5:11 PM

#

ok lemme give it a read ty

small wedge Sep 10, 2024, 5:12 PM

#

elder pilot Hi guys can I get an AI roadmap recommendation

https://www.3blue1brown.com/topics/neural-networks watch the first 4 videos here at least then pick a course you like

Here are 2 that are popular
https://see.stanford.edu/Course/CS229
https://developers.google.com/machine-learning/crash-course/

solid tangle Sep 10, 2024, 5:12 PM

#

ok tyy

rich moth Sep 10, 2024, 8:30 PM

#

Just finished an evaluation step on my model. I had to make a bunch of changes to get it working still got some tweaking todo probably. Ill let it run for a bit then we can see some results.

rich moth Sep 10, 2024, 8:56 PM

#

Honestly, for the first reconstruction this is one of the best ive seen.

shadow viper Sep 10, 2024, 9:46 PM

#

good day everyone, i'm not familiar with GPUs so i want to ask since i want to make use of google colab to train a model thats based on vision transformer from scrarch.
using the google colab T4 GPU or the google colab TPU v2-8
which one would you advice to train the vision transformer?

serene scaffold Sep 10, 2024, 9:47 PM

#

shadow viper good day everyone, i'm not familiar with GPUs so i want to ask since i want to m...

If you don't know why you want to use a tensor processing unit (TPU), just use the GPU.

shadow viper Sep 10, 2024, 9:49 PM

#

serene scaffold If you don't know why you want to use a tensor processing unit (TPU), just use t...

thanks man

faint quail Sep 10, 2024, 10:11 PM

#

Dopamine

shadow viper Sep 10, 2024, 10:57 PM

#

serene scaffold If you don't know why you want to use a tensor processing unit (TPU), just use t...

OMGGG.... I'm currently training with the GPU T4 and I'm not even gonna lie, its so awesome.
i use my laptop CPU(16 gb ram, core i7 and 3.0ghz) to train it normally before but i will stop every other tasks just because I'm scared my system doesn't blow up or crash. but now, omg, its as if I'm doing nothing. i cant even hear my laptop fan make any sound, i can literally type freely without any lag. and its fasttttt!!!!!!!!!!!!!!!!!!!!!!!

i'm so saving up for a real time GPU

unkempt wigeon Sep 10, 2024, 11:14 PM

#

What should I use for a kernel for a converted image matrix my apologies

serene scaffold Sep 10, 2024, 11:18 PM

#

shadow viper OMGGG.... I'm currently training with the GPU T4 and I'm not even gonna lie, its...

the computation is happening on a google server rack somewhere, so you shouldn't notice a resource spike on your laptop.
if you buy a computer with a GPU, and you do machine learning on that GPU, you probably will hear the fans go up, and you might not be able to do other things on your computer while it's training.

shadow viper Sep 10, 2024, 11:21 PM

#

serene scaffold the computation is happening on a google server rack somewhere, so you shouldn't...

yh, thats another fact.
quick question, my laptop has a GPU but im unable to train tensorflow on it so i use my CPU instead. now here comes the question. say i get an external GPU, like the big Nvidia RTX and the likes, will i have any issue with the training?

serene scaffold Sep 10, 2024, 11:23 PM

#

shadow viper yh, thats another fact. quick question, my laptop has a GPU but im unable to tra...

You just need a GPU that supports CUDA, which is pretty much exactly NVIDIA GPUs. But the deep learning that people have been doing for the last two-or-so years can't effectively be done on gaming-tier GPUs.

#

your money is probably better spent renting cloud compute.

shadow viper Sep 10, 2024, 11:27 PM

#

serene scaffold your money is probably better spent renting cloud compute.

hey, i just finished epoch 1, and in less than 5 mins(when i left here after my question and now that im back typing this), im done with epoch two. omg, this is so beautiful. i'm so happy about this, im literally sad right now i might cry, because this project has made me gone through hell

serene scaffold Sep 10, 2024, 11:29 PM

#

shadow viper hey, i just finished epoch 1, and in less than 5 mins(when i left here after my ...

sounds like you're experiencing a lot right now.

shadow viper Sep 10, 2024, 11:31 PM

#

serene scaffold sounds like you're experiencing a lot right now.

its just so beautiful man. been trying to balance school with this project. but with this, its a game changer.

shadow viper Sep 10, 2024, 11:34 PM

#

serene scaffold If you don't know why you want to use a tensor processing unit (TPU), just use t...

thanks for this reply Stelercus, truly helpful, i'm grateful

serene scaffold Sep 10, 2024, 11:44 PM

#

shadow viper thanks for this reply Stelercus, truly helpful, i'm grateful

I don't think I was especially helpful, but I hope you'll remember this moment the next time you feel like a challenge is insurmountable.

shadow viper Sep 10, 2024, 11:45 PM

#

serene scaffold I don't think I was especially helpful, but I hope you'll remember this moment t...

lemon_sentimental trust me, I will

serene scaffold Sep 10, 2024, 11:47 PM

#

shadow viper <:lemon_sentimental:754441881743786104><:lemon_sentimental:754441881743786104> ...

is that supposed to be a tear of joy?
you can use lemon_sentimental

shadow viper Sep 10, 2024, 11:49 PM

#

serene scaffold is that supposed to be a tear of joy? you can use <:lemon_sentimental:7544418817...

you're always of help. Thank you

upper patio Sep 10, 2024, 11:50 PM

#

Any opinions on groq ? Im trying to use it in my saas but not quite sure if that would be the best

slate raven Sep 11, 2024, 7:54 AM

#

Computervision: I cannot open 2 camera's at the same time.
Everything worked fine on my windows 11 laptop, then I transferred all my code to my linux / ubuntu. When I only open one camera with cv2.Videocapture(0) it works fine. All my different cameras work fine with index 0. But when I plug in 2 cameras and try Videocapture(0) and videocapture(1) at the same time i get that error message:

[ WARN:0@0.008] global cap_v4l.cpp:999 open VIDEOIO(V4L2:/dev/video1): can't open camera by index [ERROR:0@0.408] global obsensor_uvc_stream_channel.cpp:158 getStreamChannelGroup Camera index out of range Error: Failed to capture image.

I also tried index 2, 3 and 4, and it gives me the same error, while there are 3 cameras plugged in my laptop
Btw google and chatgpt weren't of any help.

Thank you in advance for your help :)

faint quail Sep 11, 2024, 8:08 AM

#

odd stratus when im training my a.i. it isnt outputting quality answers im training it to pr...

that likely means you're model isnt complex enough to fit your problem, try a larger model, also try other optimizations like Adam, or RMSProp

quaint rivet Sep 11, 2024, 11:44 AM

#

i have written an unet model for image segementation. When i run my model. I'm getting loss as nan. I don't know why i'm getting it nan

#

i even have checked my input value

severe inlet Sep 11, 2024, 12:08 PM

#

im working on a data science project on colab with some friends. one of our datasets is a 9gb csv file. is there anyway to import/load it into colab to work on it as a dataframe? or how should i go about working with this massive file?

jaunty helm Sep 11, 2024, 12:30 PM

#

severe inlet im working on a data science project on colab with some friends. one of our data...

maybe read it in chunks

severe inlet Sep 11, 2024, 1:24 PM

#

sorry how do i read it in in chunks if i need to have it uploaded somewhere first..?

wooden sail Sep 11, 2024, 1:56 PM

#

you can load the file into your google drive and mount the drive in colab

#

though for a file that size, you may or may not need a paid tier of either google drive or colab

#

if the data is obtained from some website/API, you'd have to process it as you obtain it

severe inlet Sep 11, 2024, 2:41 PM

#

wooden sail if the data is obtained from some website/API, you'd have to process it as you o...

what do u mean by process it as i obtain ?

#

clean it as it imports?

wooden sail Sep 11, 2024, 2:44 PM

#

yeah, process it in chunks

untold fable Sep 11, 2024, 3:16 PM

#

Hey guys do have any ideas

#

How to use machine learning for iot project

#

Or real time projects

small wedge Sep 11, 2024, 4:02 PM

#

That's a cool one

#

I saw an old project that predicted poses through walls using wifi signal data

#

https://youtu.be/kBFMsY5ZP0o?si=XXohU0PTKsUP1lw_

past bramble Sep 11, 2024, 4:18 PM

#

I wanna make one now

#

other than VAE and GAN, what do we have for image generation?

small wedge Sep 11, 2024, 5:00 PM

#

past bramble other than VAE and GAN, what do we have for image generation?

https://www.researchgate.net/publication/359177684_Image_Generation_A_Review there are a handful of techniques that don't involve an autoencoder or a GAN like variational u-nets, but you can see like 95% of the image generation research follows some variation of an AE or GAN

unkempt apex Sep 11, 2024, 5:22 PM

#

quaint rivet i have written an unet model for image segementation. When i run my model. I'm g...

explain more with code sir!

unkempt apex Sep 11, 2024, 5:23 PM

#

untold fable How to use machine learning for iot project

There are bunch of projects online

quaint rivet Sep 11, 2024, 5:24 PM

#

unkempt apex explain more with code sir!

I think i have issues with my data. When i used some kaggle dataset. It was working. But when i used my dataset. It wasn't

unkempt apex Sep 11, 2024, 5:25 PM

#

see, then maybe print first 5 rows from your dataset

quaint rivet Sep 11, 2024, 5:25 PM

#

Strange thing is that I'm getting loss value as nan

quaint rivet Sep 11, 2024, 5:25 PM

#

unkempt apex see, then maybe print first 5 rows from your dataset

Yeah mask value contains only 0 and 255

unkempt apex Sep 11, 2024, 5:25 PM

#

quaint rivet Strange thing is that I'm getting loss value as nan

then as simple, it's how you are calculating and on what thing you are calculating

quaint rivet Sep 11, 2024, 5:26 PM

#

I still not able to figure. Where this is causing nan

unkempt apex Sep 11, 2024, 5:27 PM

#

then share some info, so others can also take a look at that

quaint rivet Sep 11, 2024, 5:27 PM

#

Ok

quaint rivet Sep 11, 2024, 5:27 PM

#

unkempt apex then share some info, so others can also take a look at that

What sorts of things i can share with u?

unkempt apex Sep 11, 2024, 5:27 PM

#

I said already, first 5 rows from dataset, and maybe code on how you are calculating loss

quaint rivet Sep 11, 2024, 5:29 PM

#

unkempt apex I said already, first 5 rows from dataset, and maybe code on how you are calcula...

Ok

past bramble Sep 11, 2024, 5:31 PM

#

small wedge <https://www.researchgate.net/publication/359177684_Image_Generation_A_Review> t...

I wanted to make my scenary image generator more realistic. I used GAN so I was looking for better ways. If that's the best way I wonder if changing my neural network structure would help. My second try involved adding more layers, results were even worse

quaint rivet Sep 11, 2024, 5:34 PM

#

unkempt apex I said already, first 5 rows from dataset, and maybe code on how you are calcula...

original image array```
0 116.743820 129.932129 140.204529 123.849365 110.228264 104.687317
1 118.085236 128.580093 133.103256 111.298531 115.019913 110.951637
2 99.976089 112.731461 117.565979 125.454437 116.122879 115.366837
3 117.441841 130.380569 128.740417 114.199303 128.042313 137.160263
4 140.550476 141.988953 121.252663 107.138397 132.045837 136.520050

#

mask image```
0 0 0 0 0 0 0 0 0 0 0 ... 0 0 0
1 0 0 0 0 0 0 0 0 0 0 ... 0 0 0
2 0 0 0 0 0 0 0 0 0 0 ... 0 0 0
3 0 0 0 0 0 0 0 0 0 0 ... 0 0 0
4 0 0 0 0 0 0 0 0 0 0 ... 0 0 0

#

root_train_dir = "D:\\feature-extraction\\assets\\train"
root_test_dir = "D:\\feature-extraction\\assets\\test"

train_x = glob.glob(root_train_dir+"\\images\\" + "*.npy")
train_y = glob.glob(root_train_dir+"\\masks\\" + "*.npy")

test_x = glob.glob(root_test_dir+"\\images\\" + "*.npy")
test_y = glob.glob(root_test_dir+"\\masks\\" + "*.npy")


def load_data(x, y):
    X = np.array([np.load(i) for i in x])
    Y = np.array([np.load(j) for j in y])


    return X, Y

callbacks=[
    EarlyStopping(monitor='val_loss', patience=5, restore_best_weights=True),
    
]

X_train , Y_train = load_data(train_x, train_y)
X_test , Y_test = load_data(test_x, test_y)
print(X_train.shape, Y_train.shape)


history = model.fit(X_train, Y_train, validation_data=(X_test, Y_test),epochs = 10, batch_size=8, callbacks=callbacks)

unkempt apex Sep 11, 2024, 5:35 PM

#

your dataset are images?

quaint rivet Sep 11, 2024, 5:35 PM

#

yeah

#

ofc i have unet model

#

That's why it's hard to find error

unkempt wigeon Sep 11, 2024, 6:43 PM

#

untold fable How to use machine learning for iot project

What were you thinking my apologies

verbal oar Sep 11, 2024, 6:44 PM

#

hmm looks like you have somewhere division by zero?

#

divide by zero will result in NaN

wooden sail Sep 11, 2024, 7:19 PM

#

the hyperparams of the model/fitting method might also be set incorrectly, causing the model parameters to blow up

rich moth Sep 11, 2024, 7:43 PM

#

quaint rivet yeah

maybe use np.isnan to print X and Y to see if it contains any NaN values

unkempt wigeon Sep 11, 2024, 8:20 PM

#

I have a question do I need to make a lot of pathways for a neuron to teach a neural network sorry because I got pillow installed in my network so I can put any image and turn it into an array my apologies

rich moth Sep 11, 2024, 11:08 PM

#

unkempt wigeon I have a question do I need to make a lot of pathways for a neuron to teach a ne...

check out convolutionnal layers

unkempt wigeon Sep 11, 2024, 11:11 PM

#

rich moth check out convolutionnal layers

It's in the same file as the python cold I'm trying the final destination Plus the image I'm sorry

rich moth Sep 11, 2024, 11:16 PM

#

Anyone ever play around with OpenAI Gym? I want to test my AI logic in unique enviorments. I made a CTF game using pygame but I wanted to try something different.

unkempt wigeon Sep 11, 2024, 11:18 PM

#

rich moth check out convolutionnal layers

It is that referring to a YouTuber sorry

unkempt wigeon Sep 11, 2024, 11:19 PM

#

rich moth Anyone ever play around with OpenAI Gym? I want to test my AI logic in unique e...

No but you can possibly make your own environment using pie game just find some sprites it's not the same but you could add some grabbing logic to objects to make it be able to move blocks to walk entrances from enemies etc sorry

#

@rich moth I'm sorry

#

#===(imports)===#
from PIL import Image
import numpy as np
from matplotlib.image import imread
#==============#


image_array = imread('C:\Users\Willo\Desktop\ais\eye0.png')
array =np.array(image_array) 
X = array

print(X.shape)

verbal venture Sep 11, 2024, 11:45 PM

#

what's the difference between RAG and AI search? And is this RAG? if not what should I google to learn how to make this?

rich moth Sep 11, 2024, 11:47 PM

#

unkempt wigeon No but you can possibly make your own environment using pie game just find some ...

I made a capture the flag game using pygame for it, I just wanted to try out some other things with it

unkempt wigeon Sep 11, 2024, 11:47 PM

#

What specifically a 3D because I believe there's a function that you could use to make 2D games although I don't know too much about training I'm just trying to make a neural network at the beginning although I didn't make a training simulation for Paul I'm sorry

rich moth Sep 11, 2024, 11:50 PM

#

verbal venture what's the difference between RAG and AI search? And is this RAG? if not what sh...

You can embedded all that data into say something like elasticsearch index. Then you can build an AI pipeline around it so you can query the data.

verbal venture Sep 11, 2024, 11:51 PM

#

rich moth You can embedded all that data into say something like elasticsearch index. The...

what's the difference between that and RAG vector store emebeddings

unkempt wigeon Sep 11, 2024, 11:55 PM

#

What do I need for the convolution to get it so that the image can be turned into an array I'm sorry

carmine cairn Sep 11, 2024, 11:58 PM

#

Hey, I would like to get address, number and web site data from my saved places in google maps the saved as a .csv file. How can I do that without Google API? (Ex places list: https://maps.app.goo.gl/bsxbhgW9zvXzSa8n9)

unkempt wigeon Sep 12, 2024, 12:02 AM

#

unkempt wigeon ```py #===(imports)===# from PIL import Image import numpy as np from matplotlib...

What do you think I may need to do because I did everything to open up the image and turn it into an array which then I can add weights to plus the bias my apologies

rich moth Sep 12, 2024, 12:03 AM

#

verbal venture what's the difference between that and RAG vector store emebeddings

its just a little more advanced , but really any vector DB. but if you want to build one I would do it with haystack and elasticsearch.

verbal venture Sep 12, 2024, 12:03 AM

#

rich moth its just a little more advanced , but really any vector DB. but if you want to...

Okay that’s sota? And what’s haystack?

#

That’s the vector db?

rich moth Sep 12, 2024, 12:04 AM

#

haystack is for building the rag pipeline

#

elasticsearch is to store the embedded data

verbal venture Sep 12, 2024, 12:13 AM

#

rich moth elasticsearch is to store the embedded data

okay and just wondering how would citations be done?

#

is that through the prompt (return me the citations) or is that through indexing metadata (done through code/software engineering)?

rich moth Sep 12, 2024, 12:20 AM

#

verbal venture okay and just wondering how would citations be done?

ya citation in RAG pipelines can be done through indexing the metadata.

#

I built one, but the UI is minimal and it looks like crap

verbal venture Sep 12, 2024, 12:22 AM

#

can you link me the code?

#

and biggest problem with RAG rn is hallucination concerns yeah?

unkempt wigeon Sep 12, 2024, 12:22 AM

#

Found out why the image wasn't showing its shape forgot 2 back slash

rich moth Sep 12, 2024, 12:23 AM

#

honestly, mine doesnt hallucinate. believe it or not.

unkempt wigeon Sep 12, 2024, 12:25 AM

#

So how many areas for weights should I have sorry because it does say three color channels but there's Image size (194,259)

unkempt wigeon Sep 12, 2024, 12:31 AM

#

rich moth I made a capture the flag game using pygame for it, I just wanted to try out som...

Nice

unkempt wigeon Sep 12, 2024, 12:31 AM

#

rich moth check out convolutionnal layers

Thank you

unkempt wigeon Sep 12, 2024, 12:49 AM

#

What should I use for the kernel?

rich moth Sep 12, 2024, 12:52 AM

#

unkempt wigeon What should I use for the kernel?

Not sure what you're asking.

unkempt wigeon Sep 12, 2024, 12:54 AM

#

rich moth Not sure what you're asking.

To sign over the image to help it come to the decision what it is now I'm trying to recreate an experiment I heard of an AI that was showing images of may have had or won't have hurt problems in the future and it reliably told the biological sex trying to create that and for a convolutional neural networks to take images and recognize them you have to have a colonel that goes over the image sliding past on the array my apologies

rich moth Sep 12, 2024, 1:04 AM

#

oh i see are you talking about transforms.Compose ?

#

maybe google that

unkempt wigeon Sep 12, 2024, 1:04 AM

#

Yes I'm sorry

unkempt wigeon Sep 12, 2024, 1:06 AM

#

rich moth maybe google that

https://youtu.be/Lakz2MoHy6o?feature=shared

YouTube

The Independent Code

Convolutional Neural Network from Scratch | Mathematics & Python Code

In this video we'll create a Convolutional Neural Network (or CNN), from scratch in Python. We'll go fully through the mathematics of that layer and then implement it. We'll also implement the Reshape Layer, the Binary Cross Entropy Loss, and the Sigmoid Activation. Finally, we'll use all these objects to make a neural network capable of classif...

▶ Play video

rich moth Sep 12, 2024, 1:07 AM

#

its part of torchvision, transforms

unkempt wigeon Sep 12, 2024, 1:09 AM

#

rich moth its part of torchvision, transforms

?

#

Any videos that may have any use my apologies

serene scaffold Sep 12, 2024, 1:17 AM

#

verbal venture what's the difference between RAG and AI search? And is this RAG? if not what sh...

do you know what RAG is? if you do, can you explain what it is according to your understanding?

verbal venture Sep 12, 2024, 1:18 AM

#

serene scaffold do you know what RAG is? if you do, can you explain what it is according to your...

yeah I know what rag is but i'm wondering if a product like this was built without rag

#

basically asking if there's more methods to local-document AI search than RAG

verbal venture Sep 12, 2024, 1:18 AM

#

rich moth I built one, but the UI is minimal and it looks like crap

if you got the code on git can you link?

serene scaffold Sep 12, 2024, 1:19 AM

#

verbal venture basically asking if there's more methods to local-document AI search than RAG

RAG has to have a document retrieval component, but RAG in itself is not a document retrieval component.

#

If someone says "oh we need some way to search for documents", and someone else says "ok let's use RAG", that doesn't solve the problem. you need to already know how you can retrieve documents in order to create a RAG system.

verbal venture Sep 12, 2024, 1:20 AM

#

yeah you're saying the retrieval is bm25/KNN vector search

verbal venture Sep 12, 2024, 1:20 AM

#

serene scaffold If someone says "oh we need some way to search for documents", and someone else ...

ok I'm asking if there's other conversational search types besides rag

serene scaffold Sep 12, 2024, 1:21 AM

#

that are conversational? Not that I know of.

#

even if someone claimed that there were, I'd want to understand how it works before I agree that it's not RAG.

verbal venture Sep 12, 2024, 1:22 AM

#

so you're saying RAG is the only solution to things like perplexity rn

serene scaffold Sep 12, 2024, 1:25 AM

#

verbal venture so you're saying RAG is the only solution to things like perplexity rn

I didn't say anything like that.
what is "perplexity", in this context?

verbal venture Sep 12, 2024, 1:25 AM

#

ah the search engine

#

perplexity.ai

serene scaffold Sep 12, 2024, 1:26 AM

#

ah

verbal venture Sep 12, 2024, 1:31 AM

#

yeah have you ever tried it?

#

I think the way it works is they return the google search API results then summarize the answers through prompt engineering and cite their sources

#

seems kinda easy technically? 2B valuation

unkempt wigeon Sep 12, 2024, 1:51 AM

#

Does anyone know how to make a efficient kernel using numpy my apologies

rich moth Sep 12, 2024, 2:14 AM

#

verbal venture if you got the code on git can you link?

I havent built a github page for yet.

rich moth Sep 12, 2024, 2:16 AM

#

verbal venture I think the way it works is they return the google search API results then summa...

Thats what mine looks like

unkempt wigeon Sep 12, 2024, 2:27 AM

#

What is the best way of getting Data for the neural network to do its job sorry

serene scaffold Sep 12, 2024, 2:28 AM

#

unkempt wigeon What is the best way of getting Data for the neural network to do its job sorry

this question is too abstract to be answered.

unkempt wigeon Sep 12, 2024, 2:30 AM

#

What I mean is sliding it all across the image and getting the values to put into a relu function for each individual value sure it will slow it down but it might learn to go to the next layer and then the next layer and then the next layer and it will tell me what it is I know it's an over simplification I'm trying to explain it to not be abstract my apologies

rich moth Sep 12, 2024, 2:36 AM

#

unkempt wigeon What I mean is sliding it all across the image and getting the values to put int...

maybe check out. https://www.youtube.com/watch?v=vT1JzLTH4G4

YouTube

Stanford University School of Engineering

Lecture 1 | Introduction to Convolutional Neural Networks for Visua...

Lecture 1 gives an introduction to the field of computer vision, discussing its history and key challenges. We emphasize that computer vision encompasses a wide variety of different tasks, and that despite the recent successes of deep learning we are still a long way from realizing the goal of human-level visual intelligence.

Keywords: Computer...

▶ Play video

unkempt wigeon Sep 12, 2024, 2:37 AM

#

Thank you

rich moth Sep 12, 2024, 2:37 AM

#

np

verbal venture Sep 12, 2024, 3:03 AM

#

rich moth Thats what mine looks like

V dense can you send the code

untold fable Sep 12, 2024, 3:38 AM

#

helllo

past bramble Sep 12, 2024, 7:55 AM

#

63.3s    12    WARNING: All log messages before absl::InitializeLog() is called are written to STDERR
63.3s    13    I0000 00:00:1726126970.029600      62 service.cc:145] XLA service 0x7e5a04003a40 initialized for platform CUDA (this does not guarantee that XLA will be used). Devices:
63.3s    14    I0000 00:00:1726126970.029656      62 service.cc:153]   StreamExecutor device (0): Tesla P100-PCIE-16GB, Compute Capability 6.0
63.5s    15    WARNING: All log messages before absl::InitializeLog() is called are written to STDERR
63.5s    16    I0000 00:00:1726126970.029600      62 service.cc:145] XLA service 0x7e5a04003a40 initialized for platform CUDA (this does not guarantee that XLA will be used). Devices:
63.5s    17    I0000 00:00:1726126970.029656      62 service.cc:153]   StreamExecutor device (0): Tesla P100-PCIE-16GB, Compute Capability 6.0
64.8s    18    I0000 00:00:1726126971.486784      62 device_compiler.h:188] Compiled cluster using XLA!  This line is logged at most once for the lifetime of the process.
65.0s    19    I0000 00:00:1726126971.486784      62 device_compiler.h:188] Compiled cluster using XLA!  This line is logged at most once for the lifetime of the process.

#

79.4s    20    2024-09-12 07:43:06.118002: E tensorflow/core/grappler/optimizers/meta_optimizer.cc:961] layout failed: INVALID_ARGUMENT: Size of values 0 does not match size of permutation 4 @ fanin shape infunctional_1_1/dropout_1/stateless_dropout/SelectV2-2-TransposeNHWCToNCHW-LayoutOptimizer
79.6s    21    2024-09-12 07:43:06.118002: E tensorflow/core/grappler/optimizers/meta_optimizer.cc:961] layout failed: INVALID_ARGUMENT: Size of values 0 does not match size of permutation 4 @ fanin shape infunctional_1_1/dropout_1/stateless_dropout/SelectV2-2-TransposeNHWCToNCHW-LayoutOptimizer
145.9s    22    2024-09-12 07:44:12.611558: E tensorflow/core/grappler/optimizers/meta_optimizer.cc:961] layout failed: INVALID_ARGUMENT: Size of values 0 does not match size of permutation 4 @ fanin shape infunctional_1_1/dropout_1/stateless_dropout/SelectV2-2-TransposeNHWCToNCHW-LayoutOptimizer
146.1s    23    2024-09-12 07:44:12.611558: E tensorflow/core/grappler/optimizers/meta_optimizer.cc:961] layout failed: INVALID_ARGUMENT: Size of values 0 does not match size of permutation 4 @ fanin shape infunctional_1_1/dropout_1/stateless_dropout/SelectV2-2-TransposeNHWCToNCHW-LayoutOptimizer

#

not the first time I get these warnings/messages. I want to know what the reason is

unkempt apex Sep 12, 2024, 9:18 AM

#

bruhh why tf ??

odd stratus Sep 12, 2024, 9:52 AM

#

im pretty happy with the results i got
the a.i. managed to write out this sentence in perfect order
it learnt to write and spell letter by letter

past bramble Sep 12, 2024, 10:11 AM

#

odd stratus im pretty happy with the results i got the a.i. managed to write out this senten...

i think it knows that quick brown fox jumps over lazy dog

odd stratus Sep 12, 2024, 10:11 AM

#

past bramble i think it knows that quick brown fox jumps over lazy dog

pithink perhapsss

past bramble Sep 12, 2024, 10:12 AM

#

odd stratus <:pithink:652247559909277706> perhapsss

what else does it know

unkempt apex Sep 12, 2024, 10:12 AM

#

RuntimeError: Trying to backward through the graph a second time (or directly access saved tensors after they have already been freed). Saved intermediate values of the graph are freed when you call .backward() or autograd.grad(). Specify retain_graph=True if you need to backward through the graph a second time or if you need to access saved tensors after calling backward.

past bramble Sep 12, 2024, 10:12 AM

#

pithink

unkempt apex Sep 12, 2024, 10:12 AM

#

anyone know about this??

#

loss.backward(retain_graph=True)

I tried this option also

#

but still error

#

training with batch_size = 32

past bramble Sep 12, 2024, 10:17 AM

#

any resources to learn about training steps in neural network, forward propagation, backward propagation, loss and gradient in detail?

unkempt apex Sep 12, 2024, 10:18 AM

#

past bramble any resources to learn about training steps in neural network, forward propagat...

bruhh, you know that kid 17 year old who has made 5 hour video on maths for DL

#

https://youtu.be/Ixl3nykKG9M?feature=shared

YouTube

Adam Dhalla

The Complete Mathematics of Neural Networks and Deep Learning

A complete guide to the mathematics behind neural networks and backpropagation.

In this lecture, I aim to explain the mathematical phenomena, a combination of linear algebra and optimization, that underlie the most important algorithm in data science today: the feed forward neural network.

Through a plethora of examples, geometrical intuitio...

▶ Play video

#

this is lit.....

past bramble Sep 12, 2024, 10:21 AM

#

nice i wanted in more detail

unkempt apex Sep 12, 2024, 10:22 AM

#

past bramble nice i wanted in more detail

more detail? still>?

past bramble Sep 12, 2024, 10:23 AM

#

unkempt apex more detail? still>?

nop sarcasm, thanks!

odd stratus Sep 12, 2024, 10:29 AM

#

past bramble what else does it know

trying to teach it the bee movie script next lmao

unkempt wigeon Sep 12, 2024, 11:01 AM

#

Anyone here who's create a (CNN) because I can use some help with the creation of colonels my apologies

unkempt apex Sep 12, 2024, 12:30 PM

#

unkempt wigeon Anyone here who's create a (CNN) because I can use some help with the creation o...

wdym?

#

colonels? | who's create?

#

u helping CNN maker , or u wanna see that

quaint rivet Sep 12, 2024, 1:30 PM

#

rich moth maybe use np.isnan to print X and Y to see if it contains any NaN values

i have tried and i didn't get any nan value

desert oar Sep 12, 2024, 3:41 PM

#

unkempt wigeon Anyone here who's create a (CNN) because I can use some help with the creation o...

do you mean "kernels"?

river cape Sep 12, 2024, 3:58 PM

#

Hello guys , so I have this project idea of building an ai model which takes the map of building , example lets say a mall, and it should give me the directions for a particular store in the mall

#

Like for example , I want to visit the nike store in the mall

#

It should the directions to that store from any where inside the mall

#

You could say like its a mini version of Google Maps

#

So if someone could give some ideas , as how to proceed?

odd stratus Sep 12, 2024, 4:07 PM

#

river cape So if someone could give some ideas , as how to proceed?

you need to have an a.i. program that you can run
if you want to use preexisting infrastructure theres a lot of libraries e.g. tensorflow etc.
or make one yourself

then you need to design the layers and layer sizes, then you need to turn your task into a well defined set of outputs
then you need to take in data in a well defined way such that it can be mapped onto the output data

e.g. x+10 = y
input x output y

#

then once you have a lot of training data, test and train the a.i. until you get results you want

desert oar Sep 12, 2024, 4:32 PM

#

river cape Hello guys , so I have this project idea of building an ai model which takes the...

you might want to look up "path finding" algorithms -- this is a classic AI thing that long predates (and generally doesn't require) deep learning

river cape Sep 12, 2024, 4:45 PM

#

desert oar you might want to look up "path finding" algorithms -- this is a classic AI thin...

An ai model which selects the best nearest alogrithm? like Djistra's?

wooden sail Sep 12, 2024, 4:49 PM

#

river cape An ai model which selects the best nearest alogrithm? like Djistra's?

wdym by "best" here?

#

i think dijkstra is optimal regarding complexity for the most general path finding problem

spare forum Sep 12, 2024, 5:05 PM

#

Everything doesn't need so called "AI"

#

oopsies 🙂

rich moth Sep 12, 2024, 7:04 PM

#

man making the game is harder than the AI part.

desert oar Sep 12, 2024, 7:16 PM

#

river cape An ai model which selects the best nearest alogrithm? like Djistra's?

No, the path-finding itself is the AI here

small wedge Sep 12, 2024, 7:26 PM

#

rich moth man making the game is harder than the AI part.

Always lmao

#

And balancing rewards to actually get your agents playing your game instead of finding a niche and exploiting it is just as hard as making the agent too

unkempt wigeon Sep 12, 2024, 7:42 PM

#

desert oar do you mean "kernels"?

Yes

unkempt wigeon Sep 12, 2024, 7:47 PM

#

rich moth man making the game is harder than the AI part.

That's amazing how do you have a reward system so I can add that to my convolutional neural network my apologies

serene scaffold Sep 12, 2024, 7:53 PM

#

unkempt wigeon That's amazing how do you have a reward system so I can add that to my convoluti...

what would this reward function do for your neural network

unkempt wigeon Sep 12, 2024, 7:56 PM

#

Being a point higher than the human player

serene scaffold Sep 12, 2024, 7:56 PM

#

what game is the CNN playing

unkempt wigeon Sep 12, 2024, 7:58 PM

#

A pong because there's two simple outputs up and down but it has to know where the ball is sorry

unkempt wigeon Sep 12, 2024, 8:03 PM

#

serene scaffold what game is the CNN playing

I broke it down into something simple pong because it only has two values that you would really need one for up and zero for down

rich moth Sep 12, 2024, 8:06 PM

#

unkempt wigeon That's amazing how do you have a reward system so I can add that to my convoluti...

im using a DQN with both self and cross-attention mechanisms. i represent player states and team dynamics with vectors, and those vectors are aggregated using attention layers to create dynamic behaviors for the agents. The self-attention focuses on individual agent features, while cross-attention helps coordinate actions based on interactions with teammates and opponents.

rich moth Sep 12, 2024, 8:07 PM

#

unkempt wigeon That's amazing how do you have a reward system so I can add that to my convoluti...

are you talking about a reward system for a CNN?

unkempt wigeon Sep 12, 2024, 8:08 PM

#

Well I was thinking that but it's probably not what's needed for a CNN so I might try training one on games first because I can build any game that I want and I can have it trained on the data that's found so I can get a better idea a feel for how to train them in the future should be a reward based or shipping just how it figures it out by itself my apologies

unkempt wigeon Sep 12, 2024, 8:13 PM

#

rich moth are you talking about a reward system for a CNN?

Can a CNN do that because I know some neural networks need training with reinforcement learning sorry

rich moth Sep 12, 2024, 8:14 PM

#

like a CNN-DQN?

unkempt wigeon Sep 12, 2024, 8:15 PM

#

DQN?

rich moth Sep 12, 2024, 8:15 PM

#

deep q-network

unkempt wigeon Sep 12, 2024, 8:20 PM

#

Yes a deep learning network

rich moth Sep 12, 2024, 8:21 PM

#

What do you want to do with it? Whats your end goal?

unkempt wigeon Sep 12, 2024, 8:27 PM

#

rich moth What do you want to do with it? Whats your end goal?

Teach a deep neral network to do anything but to start maybe games sorry

unkempt wigeon Sep 12, 2024, 8:35 PM

#

rich moth What do you want to do with it? Whats your end goal?

Well if I can't get convolution together maybe some type of deep reinforced game playing Network that I could train to do multiple different games old school and new school sorry

#

I only have the training site made I just need to figure out how to make the network I don't know if that needs to be a CNN or can it just be a regular not working that's been put into deep learning my apologies

rich moth Sep 12, 2024, 8:49 PM

#

So I made a simple DQN with a CNN . It actually works pretty damn well lol