violet gull Aug 13, 2024, 9:03 AM

#

youtube or looking up sources

warm iron Aug 13, 2024, 10:22 AM

#

Hi! is it okay to talk about Machine Learning here?

scarlet owl Aug 13, 2024, 10:22 AM

#

warm iron Hi! is it okay to talk about Machine Learning here?

Yep

#

read channel description

warm iron Aug 13, 2024, 10:26 AM

#

has anyone ever had experience working with raw ECG data?

proper crag Aug 13, 2024, 10:49 AM

#

@lyric furnace mate just keep learning python understand when you can utilize loop, if else,aggregation operations, etc...even python hv libraries just trust me ...ive seen ml code that is using for loop and the code is something like 200+ lines regardles after utilizes libraries

lapis sequoia Aug 13, 2024, 11:00 AM

#

minimal plot of different activations vs accuracy

(mnist, 2 layer perceptron.)

#

.

lapis sequoia Aug 13, 2024, 11:22 AM

#

lapis sequoia .

sin is actually surprisingly good btw

#

so i red up and duckdb is optimized for analytical queries on giant databases whereas sqlite is optimized for single writes

#

which is why duckdb is so slow for logging

#

but nested dicts are still 10 times faster, but I haven't figured out and thus tested how to do threaded or async database writes

#

when they remove gil it will be much easier

indigo wing Aug 13, 2024, 11:27 AM

#

Hey, anyone knows about sematic search engines?

toxic mortar Aug 13, 2024, 11:36 AM

#

What are your go-to methods to evaluate your clssification model performance on huge unseen dataset?

lapis sequoia Aug 13, 2024, 11:45 AM

#

lapis sequoia sin is actually surprisingly good btw

do you mean sine function? didn't get it

lapis sequoia Aug 13, 2024, 11:45 AM

#

lapis sequoia do you mean sine function? didn't get it

yeah

#

interesting, i haven't seen it used, and read a few posts yesterday that didn't seem too positive

#

like basically saying it reduces to tanh the useful part

#

the periodicity didn't really help if i understood correctly

#

i had the same results as relu with sine in segmentation but maybe its making some tasks harder

#

also interesting function modulus (abs(output)) haven't tried it

#

interesting

#

also there is a learnable piecewise function https://github.com/PiotrDabkowski/torchpwl

GitHub

GitHub - PiotrDabkowski/torchpwl: Piecewise Linear Functions (PWL) ...

Piecewise Linear Functions (PWL) implementation in PyTorch - PiotrDabkowski/torchpwl

#

i've got this paper in the pipeline https://openreview.net/pdf?id=Sks3zF9eg
talks about that

#

also I tried an activation function that takes maximum among first half of the channels and second half of the channels, and minimum, and concatenates them, and it worked, although wasnt as good as relu

#

its crazy how you can give it any weird model and it will find how to use that model

#

by it i mean gradient descent

#

that repo is piece wise linear units right?

#

yeah

#

with any number of segments, that's somewhat new to me

#

this paper explores many of them, didn't check it yet either https://arxiv.org/pdf/1710.05941

lapis sequoia Aug 13, 2024, 12:36 PM

#

(...) While sinusoidal activation functions have been successfully used for specific applications, they remain largely ignored (...)
[we] describe how the presence of infinitely many and shallow local minima emerges from the architecture.
(...) by showing that for several network architectures the presence of the periodic cycles is largely ignored (...)
etc.

may not be the best paper though (and may be incorrect.), just one i found.

runic parcel Aug 13, 2024, 1:31 PM

#

i hears Cyc is a knowledge database, but can i use it to train my model? how can i get the code?

lapis sequoia Aug 13, 2024, 2:19 PM

#

can anyone help me with this error

serene scaffold Aug 13, 2024, 2:20 PM

#

lapis sequoia can anyone help me with this error

In the future, please always show code and other text as text. Not as a screenshot.

This error message means that your val_logs variable refers to None. Not to a dict.

lapis sequoia Aug 13, 2024, 2:21 PM

#

i think your generator may have ran out of data

serene scaffold Aug 13, 2024, 2:21 PM

#

lapis sequoia i think your generator may have ran out of data

who are you talking to when you say that?

lapis sequoia Aug 13, 2024, 2:22 PM

#

serene scaffold who are you talking to when you say that?

@lapis sequoia

lapis sequoia Aug 13, 2024, 2:22 PM

#

lapis sequoia i think your generator may have ran out of data

how can I fix that?

serene scaffold Aug 13, 2024, 2:22 PM

#

lapis sequoia how can I fix that?

Show the whole code

#

!paste

arctic wedgeBOT Aug 13, 2024, 2:22 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

lapis sequoia Aug 13, 2024, 2:23 PM

#

you need to make sure that the steps _ execution *epochs is less than the n of batches, it's a common error, just read the docs for PyDataset

lapis sequoia Aug 13, 2024, 2:23 PM

#

serene scaffold Show the whole code

okay

#

the generator values are called once (till it runs out of data.), so that's why. in the case of tensorflow you can use .repeat idk if there is anything like that for pydataset.

#

import tensorflow as tf
from tensorflow.keras.preprocessing.image import ImageDataGenerator

tf.random.set_seed(42)

#preprocess the data (pixels in the range of 1 to 255)
train_datagen = ImageDataGenerator(rescale = 1./255)
valid_datagen = ImageDataGenerator(rescale = 1./255)

train_dir = '/content/pizza_steak/train'
test_dir = '/content/pizza_steak/test'

import data from directories and turn them into batches

train_data = train_datagen.flow_from_directory(directory = train_dir, batch_size=32,
target_size=(224, 224), class_mode="binary", seed=42)

test_data = valid_datagen.flow_from_directory(directory = test_dir, batch_size=32,
target_size=(224, 224), class_mode="binary", seed=42)

Build a CNN model

model_1 = tf.keras.models.Sequential([
tf.keras.layers.Conv2D(filters=10, kernel_size=3, activation='relu', input_shape=(224,224,3)),
tf.keras.layers.Conv2D(10, 3, activation = 'relu'),
tf.keras.layers.MaxPool2D(pool_size=2, padding='valid'),
tf.keras.layers.Conv2D(10, 3, activation = 'relu'),
tf.keras.layers.Conv2D(10, 3, activation = 'relu'),
tf.keras.layers.MaxPool2D(2),
tf.keras.layers.Flatten(),
tf.keras.layers.Dense(1, activation='sigmoid')
])

#compile our model
model_1.compile(loss='binary_crossentropy',
optimizer = tf.keras.optimizers.Adam(),
metrics=['accuracy'])

history_1 = model_1.fit(train_data,
epochs=5,
steps_per_epoch = len(train_data),
validation_data = test_data,
validation_steps = len(test_data))

serene scaffold Aug 13, 2024, 2:24 PM

#

@lapis sequoia please read this: #data-science-and-ml message

lapis sequoia Aug 13, 2024, 2:25 PM

#

i think this could work (not fully sure.):

steps_per_epoch = (len(train_data)//(batch_size))//epochs),

lapis sequoia Aug 13, 2024, 2:29 PM

#

lapis sequoia i think this could work (not fully sure.): `steps_per_epoch = (len(train_data)/...

not working

#

did you add //epochs

lapis sequoia Aug 13, 2024, 2:30 PM

#

lapis sequoia did you add `//epochs`

yes

#

i think that steps per execution * batch size have to be less than the training data

#

the same applies to validation steps

#

actually you could try both with the number 30 and check @lapis sequoia ?

#

otherwise you may have to ask either in a separate question in #1035199133436354600 or maybe in a forum (or wait for others to help)

lapis sequoia Aug 13, 2024, 2:39 PM

#

lapis sequoia actually you could try both with the number `30` and check <@456226577798135808>...

by taking batch size of 30?

#

no sorry, by passing into steps per execution a number smaller than the number of batches generated (not the batch size.) @lapis sequoia
same applies to validation_steps, using a number smaller than the number of test batches generated (should actually be less than len(data)//batch_size) for each case, if i understand correctly.

lapis sequoia Aug 13, 2024, 2:42 PM

#

lapis sequoia no sorry, by passing into `steps per execution` a number smaller than the number...

thanks, it worked

#

no problem, you've got several other errors

#

@lapis sequoia can you please explain me what was the issue with the previous code?

#

i think it expects an Input layer

#

yeah let me see if i can find a thread explaining it, i can't now

lapis sequoia Aug 13, 2024, 2:44 PM

#

lapis sequoia yeah let me see if i can find a thread explaining it, i can't now

yeah sure

#

Stack Overflow

tensorflow:Your input ran out of data

I am working on a seq2seq keras/tensorflow 2.0 model. Every time the user inputs something, my model prints the response perfectly fine. However on the last line of each response I get this:

You:

#

if you ever use tf.data.Dataset instead, it's got this option (extracted from link.):

`If you're using a tf.data.Dataset, you can also add the repeat() method, but be careful: it will loop indefinitely (unless you specify a number)

#

can happen is using XLA (incomplete batches break it), then use drop_remainder=True, not your case though.

oblique isle Aug 13, 2024, 2:54 PM

#

Guys what s better to analyze Academic papers ? Claude or GPT 4 ?

lapis sequoia Aug 13, 2024, 3:05 PM

#

oblique isle Guys what s better to analyze Academic papers ? Claude or GPT 4 ?

idk if this is the channel to ask.

#

https://community.openai.com/t/gpt4-comparison-to-anthropic-opus-on-benchmarks/726147

OpenAI Developer Forum

Gpt4 comparison to anthropic Opus on benchmarks

In a comparative assessment of Claude 3 Opus and GPT-4’s capabilities, Claude 3 Opus generally demonstrates superior performance across a spectrum of tasks that test for knowledge and reasoning abilities. Claude 3 Opus consistently outperforms GPT-4, with an especially notable advantage in complex reasoning and coding tasks, suggesting it is bet...

#

https://github.com/openai/simple-evals?tab=readme-ov-file#benchmark-results

GitHub

GitHub - openai/simple-evals

Contribute to openai/simple-evals development by creating an account on GitHub.

#

claude then, sonnet is actually higher apparently https://www.artificialintelligence-news.com/news/anthropics-claude-3-5-sonnet-beats-gpt-4o-most-benchmarks/

AI News

Anthropic's Claude 3.5 Sonnet beats GPT-4o in most benchmarks

Anthropic has launched Claude 3.5 Sonnet, its mid-tier model that outperforms competitors and even surpasses Anthropic's current top-tier Claude 3 Opus in various evaluations.

lapis sequoia Aug 13, 2024, 3:22 PM

#

lapis sequoia claude then, sonnet is actually higher apparently https://www.artificialintellig...

what do they do for benchmarks

#

no idea, maybe they say more in the original post @lapis sequoia https://www.anthropic.com/news/claude-3-5-sonnet

Introducing Claude 3.5 Sonnet

Introducing Claude 3.5 Sonnet—our most intelligent model yet. Sonnet now outperforms competitor models and Claude 3 Opus on key evaluations, at twice the speed.

lapis sequoia Aug 13, 2024, 4:16 PM

#

summary on sine vs tanh (paper is here https://openreview.net/pdf?id=Sks3zF9eg), pretty interesting read

tldr; sine seems better for intuitively periodic tasks (like addition.), and comparable to tanh in std cases
(not surprising that is kinda works in many tasks, but not due to periodicity.)

lapis sequoia Aug 13, 2024, 6:03 PM

#

is random grid search just straight up better than quasi random search?

#

because it has the lowest discrepancy possible

#

and quasi random search is better than random search

#

so random grid search is the best?

unkempt wigeon Aug 13, 2024, 6:31 PM

#

#===[imports]===#
import numpy as np
#===============#

X = np.array([0.1, 0.2, 0.3, 0.4])


converted_data0=np.asarray(X)

print(converted_data0)

serene scaffold Aug 13, 2024, 6:40 PM

#

unkempt wigeon ```py #===[imports]===# import numpy as np #===============# X = np.array([0.1,...

is there a question?

unkempt wigeon Aug 13, 2024, 6:41 PM

#

how can i get the array to collect to the data andrun it through the network?

serene scaffold Aug 13, 2024, 6:41 PM

#

unkempt wigeon how can i get the array to collect to the data andrun it through the network?

arrays don't "do things".

#

and what network?

lapis sequoia Aug 13, 2024, 6:46 PM

#

lapis sequoia summary on sine vs tanh (paper is here https://openreview.net/pdf?id=Sks3zF9eg),...

yeah, I think the best example is making a neural network predict sin(x) from x, and sin activation works the best for it

unkempt wigeon Aug 13, 2024, 7:09 PM

#

the nerons

serene scaffold Aug 13, 2024, 7:10 PM

#

unkempt wigeon the nerons

there is nothing in your code that appears to be neurons.

unkempt wigeon Aug 13, 2024, 7:19 PM

#

im working on the array to increse the speed this was to test the use before joing the mail code

serene scaffold Aug 13, 2024, 7:19 PM

#

unkempt wigeon im working on the array to increse the speed this was to test the use before jo...

Are you communicating with us through an automated translator?

unkempt wigeon Aug 13, 2024, 7:20 PM

#

no

serene scaffold Aug 13, 2024, 7:21 PM

#

There is no reason to have converted_data0=np.asarray(X) in your code. X is already an array.

#

Try writing more code that represents layers of a network, and write code to send an array through the network.

unkempt wigeon Aug 13, 2024, 7:32 PM

#

#===[imports]===#
import sys
import numpy as np
import matplotlib
#===============#

#===[neuron network]===#
np.random.seed(0)

X = [[1, 2 ,3,2.5],
    [2.0,5.0,-1.0, 2.0],
    [-1.5, 2.7, 3.3, -0.8]]

class Layer_Dense:
    def __init__(self, n_inputs, n_neurons):
        self.weights =0.10 * np.random.randn(n_inputs, n_neurons)
        self.biases = np.zeros((1, n_neurons))

    def forward(self, inputs):
        self.output = np.dot(inputs, self.weights) + self.biases

layer0 = Layer_Dense(4,5)              
layer1 = Layer_Dense(5,9)
layer2 = Layer_Dense(9,4)
layer3 = Layer_Dense(4,2)


layer0.forward(X)
layer1.forward(layer0.output)
layer2.forward(layer1.output)
layer3.forward(layer2.output)

print(layer3.output)

serene scaffold Aug 13, 2024, 7:33 PM

#

unkempt wigeon ```py #===[imports]===# import sys import numpy as np import matplotlib #======...

did you verify that this works?

unkempt wigeon Aug 13, 2024, 7:33 PM

#

yes

#

[[ 5.86410565e-03 4.20239779e-05]
[ 4.60184756e-03 2.41869992e-03]
[ 1.37659937e-02 -1.03951813e-02]]

#

my apoliges

serene scaffold Aug 13, 2024, 7:34 PM

#

unkempt wigeon yes

do you understand why it works?

unkempt wigeon Aug 13, 2024, 7:36 PM

#

yes its the outputs combined from the neurons getting all posible outputs from the set inputs. my apoliges

serene scaffold Aug 13, 2024, 7:37 PM

#

unkempt wigeon yes its the outputs combined from the neurons getting all posible outputs from...

why do you end every message with "my apoliges"?

unkempt wigeon Aug 13, 2024, 7:38 PM

#

segilopa ym wonk t'nod i

serene scaffold Aug 13, 2024, 7:38 PM

#

what?

main fox Aug 13, 2024, 7:40 PM

#

I thought it was another language but it's just reversed lol

serene scaffold Aug 13, 2024, 7:40 PM

#

unkempt wigeon segilopa ym wonk t'nod i

are you trolling us?

unkempt wigeon Aug 13, 2024, 7:44 PM

#

no my apoliges

serene scaffold Aug 13, 2024, 7:45 PM

#

unkempt wigeon no my apoliges

I take time out of my work day to answer questions here. so please do not shitpost.

unkempt wigeon Aug 13, 2024, 7:46 PM

#

my apoliges

serene scaffold Aug 13, 2024, 7:46 PM

#

unkempt wigeon ```py #===[imports]===# import sys import numpy as np import matplotlib #======...

so, what do you want to do now?

unkempt wigeon Aug 13, 2024, 7:47 PM

#

make it be able tolearn colors from images and other things too

serene scaffold Aug 13, 2024, 7:48 PM

#

unkempt wigeon make it be able tolearn colors from images and other things too

can you give an example of an image and what color you want it to learn?

unkempt wigeon Aug 13, 2024, 7:49 PM

#

unkempt wigeon make it be able tolearn colors from images and other things too

#

serene scaffold Aug 13, 2024, 7:50 PM

#

why is that color the learned output for that image?

unkempt wigeon Aug 13, 2024, 7:52 PM

#

fox faces and the color green to start green because you recommended it and foxes as there faces are unece in shape and perportions

unkempt apex Aug 13, 2024, 7:52 PM

#

what the hell is going on !!😂

serene scaffold Aug 13, 2024, 7:53 PM

#

unkempt wigeon fox faces and the color green to start green because you recommended it and fox...

so you want it to recognize that the image contains a fox face, and when that's the case, you want the model to output green?

unkempt wigeon Aug 13, 2024, 7:56 PM

#

maybe i should teach it colors first my apoliges

unkempt apex Aug 13, 2024, 7:57 PM

#

bruhhh....

serene scaffold Aug 13, 2024, 7:57 PM

#

unkempt wigeon maybe i should teach it colors first my apoliges

you don't have to keep apologizing
what would it mean to teach colors to the model?

unkempt apex Aug 13, 2024, 7:57 PM

#

bro is high on something I guess ..

serene scaffold Aug 13, 2024, 7:58 PM

#

unkempt apex bro is high on something I guess ..

to be fair, I did tell them to start with green. #1253470566107709480 message

unkempt apex Aug 13, 2024, 7:59 PM

#

and in that post, he also apologies..

unkempt wigeon Aug 13, 2024, 8:02 PM

#

serene scaffold you don't have to keep apologizing what would it mean to teach colors to the mod...

if it where to be given a photo it can diseur colors in that photo for example:

#

#

list of colors

#

primarly green

#

secondary blue

#

third white

#

fourth is brown

#

my apoliges

unkempt apex Aug 13, 2024, 8:05 PM

#

bruhhh..

unkempt wigeon Aug 13, 2024, 8:06 PM

#

my apoliges

unkempt apex Aug 13, 2024, 8:07 PM

#

so by this way, no one will help you

#

and only apologies you!

unkempt wigeon Aug 13, 2024, 8:10 PM

#

im sorry

#

i can figure out the rest i just need help help with one color my apoliges

#

@unkempt apex

unkempt apex Aug 13, 2024, 8:28 PM

#

unkempt wigeon <@842272827393441854>

writing assignments sir!

#

just post the questions properly, so other guys will look into it

#

without apologies

unkempt wigeon Aug 13, 2024, 8:32 PM

#

I just need help with one color input and then I can use a different output later

serene scaffold Aug 13, 2024, 8:37 PM

#

unkempt wigeon if it where to be given a photo it can diseur colors in that photo for example:

you don't need ML for that. you can write code that tells you the color of each pixel and gives you the count for each color
you'll want to cluster them so that shades of what you consider green, blue, yellow, etc. are counted the same.

unkempt wigeon Aug 13, 2024, 8:46 PM

#

im sorry

serene scaffold Aug 13, 2024, 9:00 PM

#

unkempt wigeon im sorry

why are you sorry

unkempt wigeon Aug 13, 2024, 9:06 PM

#

for bothering you

rough grove Aug 13, 2024, 11:05 PM

#

should i learn pytorch or tensorflow first

lapis sequoia Aug 13, 2024, 11:10 PM

#

serene scaffold I take time out of my work day to answer questions here. so please do not shitpo...

you chose to do so, didn't you?

twin acorn Aug 13, 2024, 11:58 PM

#

#1273012012258951178 message

need help with this, if you read the post it has details

#

ive been ignored for houirs any help is appreciated

serene scaffold Aug 14, 2024, 12:37 AM

#

lapis sequoia you chose to do so, didn't you?

I'm not sure what point you're making. I was in the process of helping that person, and they posted a message where the letters were reversed, and I asked them not to shitpost.

serene scaffold Aug 14, 2024, 12:38 AM

#

rough grove should i learn pytorch or tensorflow first

pytorch and tensorflow are two libraries that do the same thing. it's not a foregone conclusion that you need to know both. I recommend focusing on one.

but I also recommend learning a lot of other things before you get anywhere near neural networks.

rough grove Aug 14, 2024, 12:48 AM

#

serene scaffold pytorch and tensorflow are two libraries that do the same thing. it's not a fore...

I know the math behind backprop and all that good stuff but which one should I focus on

serene scaffold Aug 14, 2024, 12:48 AM

#

rough grove I know the math behind backprop and all that good stuff but which one should I f...

I use pytorch every day and have never been asked to use tensorflow in over five years.

rough grove Aug 14, 2024, 12:49 AM

#

serene scaffold I use pytorch every day and have never been asked to use tensorflow in over five...

Do u have a job in ML? What do u do cuz I wanna pursue a career in it

serene scaffold Aug 14, 2024, 1:15 AM

#

rough grove Do u have a job in ML? What do u do cuz I wanna pursue a career in it

I work in language ai

main fox Aug 14, 2024, 2:52 AM

#

serene scaffold I work in language ai

Research or industry? Also, how much regex do you use? I recently found myself with a project that initially sounded like it would need NER but regex worked very well.

unkempt apex Aug 14, 2024, 5:05 AM

#

rough grove should i learn pytorch or tensorflow first

come on , always pytorch..

rich river Aug 14, 2024, 5:29 AM

#

any data mining projects recommended?

faint quail Aug 14, 2024, 6:43 AM

#

whats the point of data mining/hoarding

indigo wing Aug 14, 2024, 7:22 AM

#

faint quail whats the point of data mining/hoarding

to gather data, to make sense of unstructured data mostly. Like whether you require that column on your dataset, example: you want to find avg height of 11-21 yrs old, you take a lot of data that contains their names, age, sex, bmi, address etc. Now which all you want, what's the dtype of the data, do you need to create more columns? This is looking mostly data mining

covert cave Aug 14, 2024, 7:52 AM

#

hii, how can I solve this error :FileNotFoundError: [Errno 2] No such file or directory: 'C:\programs\anaconda3\Lib\site-packages\matplotlib\backends\web_backend\js\mpl.js'
for this code :%matplotlib notebook
plt.plot(y_test,label='Real values')
plt.plot(california_y_predicted,label='guess values')
plt.legend();

lapis sequoia Aug 14, 2024, 9:07 AM

#

lapis sequoia yeah, I think the best example is making a neural network predict sin(x) from x,...

I'm not sure that's a good example though; that shows it's good at predicting it's own behaviour.
But if it indeed is good at tasks with some periodicity built-in, then good enough for the sine.

#

this paper has a lot of cool stuff with activations, if anyone wants to waste their time, certainly llm s may summarise it though.
https://arxiv.org/pdf/1710.05941

paper garnet Aug 14, 2024, 10:27 AM

#

is there anyone knows machine learning libraries like Tensorflow, pyTorch, Scikit-learn #data-science-and-ml

lapis sequoia Aug 14, 2024, 10:42 AM

#

are people here more in the camp of illusionists, materialists, reductionists, panpsychists, dualists,... ?

spare forum Aug 14, 2024, 10:49 AM

#

paper garnet is there anyone knows machine learning libraries like Tensorflow, pyTorch, Sciki...

Ask your question don't ask to ask

lapis sequoia Aug 14, 2024, 10:55 AM

#

lapis sequoia are people here more in the camp of illusionists, materialists, reductionists, p...

im illusionist

lapis sequoia Aug 14, 2024, 10:56 AM

#

lapis sequoia im illusionist

interesting, sad to lose dennett

lapis sequoia Aug 14, 2024, 10:57 AM

#

lapis sequoia interesting, sad to lose dennett

yeah

#

but I havent read him

#

i only red parfit

#

I just think if you are a materialist and not illusionist than you cant not be a dualist

lapis sequoia Aug 14, 2024, 11:08 AM

#

lapis sequoia but I havent read him

have you read marvin minsky, and what from parfit?

lapis sequoia Aug 14, 2024, 11:12 AM

#

lapis sequoia have you read marvin minsky, and what from parfit?

reasons and persons from parfit

#

havent red anything else on phil of mind

#

i thought the personal identity chapter from reasons and persons was good because i agreed with it

#

this seems a nice article about meta learning https://jameskle.com/writes/meta-learning-is-all-you-need

James Le

Meta-Learning Is All You Need — James Le

Meta-learning , also known as learning how to learn , has recently emerged as a potential learning paradigm that can learn information from one task and generalize that information to unseen tasks proficiently. During this quarantine time, I started watching lectures on Stanford’s CS 330

lapis sequoia Aug 14, 2024, 11:16 AM

#

lapis sequoia i thought the personal identity chapter from reasons and persons was good becaus...

ill take a look one day, didn't know that guy, seems very interesting.

lapis sequoia Aug 14, 2024, 11:18 AM

#

lapis sequoia ill take a look one day, didn't know that guy, seems very interesting.

basically he takes the teleportation paradox and then goes very deep on a whole bunch of similar arguments that convincingly prove counter intuitive things about personal identity

#

actually he invented the teleportation paradox

#

and his other view is utilitarianism and he also has a whole bunch of very interesting and weird paradoxes even though i don't care too much about phil of ethics

#

that's cool, what are your main areas of interest @lapis sequoia ?

lapis sequoia Aug 14, 2024, 11:23 AM

#

lapis sequoia that's cool, what are your main areas of interest <@456226577798135808> ?

I think just paradoxes and thought experiments in general because its very interesting how counter intuitive they are

lapis sequoia Aug 14, 2024, 11:25 AM

#

lapis sequoia I think just paradoxes and thought experiments in general because its very inter...

nice. recently read 'am i strange loop? by Douglas Hofstadter,' he loves paradoxes. i like too.

lapis sequoia Aug 14, 2024, 11:27 AM

#

lapis sequoia nice. recently read 'am i strange loop? by Douglas Hofstadter,' he loves paradox...

nice, i want to read it and weirdness of the world by eric schwitzgebel

lapis sequoia Aug 14, 2024, 11:29 AM

#

lapis sequoia nice, i want to read it and weirdness of the world by eric schwitzgebel

a review on amazon says:

The word "Bizarre" is used 188 times on its 360-odd pages...

#

but can still be good

lapis sequoia Aug 14, 2024, 11:30 AM

#

lapis sequoia a review on amazon says: > The word "Bizarre" is used 188 times on its 360-odd...

i actually havent even read the reviews i just liked the title

sullen marsh Aug 14, 2024, 11:31 AM

#

Is there anyone can provide a learning path of AI/ML engineer from zero to hero?

fiery bane Aug 14, 2024, 12:30 PM

#

sullen marsh Is there anyone can provide a learning path of AI/ML engineer from zero to hero?

https://kidger.site/thoughts/just-know-stuff/

Patrick Kidger

Personal Website. Math, SciML, scuba diving!

fiery bane Aug 14, 2024, 12:31 PM

#

rich river any data mining projects recommended?

what do you like?

rich river Aug 14, 2024, 12:32 PM

#

fiery bane what do you like?

all are OK

fiery bane Aug 14, 2024, 12:36 PM

#

like, find some topic that you like, sports, movies, etc, and then do data mining on that topic

serene scaffold Aug 14, 2024, 12:46 PM

#

main fox Research or industry? Also, how much regex do you use? I recently found myself w...

I use regex a lot, but mostly to parse semi-structured data.
I do research in industry. I don't work for a university.

serene scaffold Aug 14, 2024, 12:48 PM

#

indigo wing to gather data, to make sense of unstructured data mostly. Like whether you requ...

"do you need to create more columns?" you're talking about feature engineering. data mining is when you form insights based on analysis of large amounts of data.

viscid socket Aug 14, 2024, 12:58 PM

#

Does anyone know somewhere that I can download/mine large amount of resumes? I am thinking of making an anonymous resume dataset for SWOT analysis

lapis sequoia Aug 14, 2024, 1:17 PM

#

viscid socket Does anyone know somewhere that I can download/mine large amount of resumes? I a...

idk but noticed there are several discord communities just about that

hushed canopy Aug 14, 2024, 2:32 PM

#

Hi guys. I need books (or other resources) to learn Data Structures and Algorithms.
Please recommend.

serene scaffold Aug 14, 2024, 2:34 PM

#

hushed canopy Hi guys. I need books (or other resources) to learn Data Structures and Algorith...

wrong channel; see #algos-and-data-structs

lapis sequoia Aug 14, 2024, 2:42 PM

#

dropout 4all

indigo wing Aug 14, 2024, 2:55 PM

#

serene scaffold "do you need to create more columns?" you're talking about feature engineering. ...

yeah sorry got a bit mismatched

serene grail Aug 14, 2024, 2:59 PM

#

lapis sequoia dropout 4all

So is dropout commonly used? From "it's used in basically every NN" to "it's almost never used", how common is it?

serene scaffold Aug 14, 2024, 3:16 PM

#

serene grail So is dropout commonly used? From "it's used in basically every NN" to "it's alm...

I think it's used pretty often; read system papers and develop your own sense for this.

lapis sequoia Aug 14, 2024, 3:39 PM

#

serene grail So is dropout commonly used? From "it's used in basically every NN" to "it's alm...

there are some rules for when to apply it, but it's a common regulariser, and mostly useful for networks that are prone to overfitting (that's why it was invented.)

#

it may not work well with ReLUs in very deep networks, but im unsure whether this is fully established.

serene grail Aug 14, 2024, 3:40 PM

#

I see, thank you both

runic parcel Aug 14, 2024, 3:41 PM

#

can anyone please help me with my langchain?
https://discord.com/channels/267624335836053506/1273305302954934334

lapis sequoia Aug 14, 2024, 3:41 PM

#

you are welcome, you may find the original paper's abstract readable https://www.cs.toronto.edu/~rsalakhu/papers/srivastava14a.pdf
it's got the many top researchers right there.

green herald Aug 14, 2024, 4:43 PM

#

Hello, I'm in search for a DE mentor. Is a good place to ask?

fallow frost Aug 14, 2024, 4:59 PM

#

how do I even anwser to:
"What is your experience managing batch and incremental data ingestion processes?"

#

is incremental live data?

green herald Aug 14, 2024, 5:03 PM

#

I don't know much. I know Python well but not much pipes and data flows.

green herald Aug 14, 2024, 5:21 PM

#

As I see, I need to know SQL well and be able to work with ETL in the cloud.

spare forum Aug 14, 2024, 7:07 PM

#

green herald As I see, I need to know SQL well and be able to work with ETL in the cloud.

havings the basic of sql +data modeling, be familiar with notions like data warahouse, lake... and also some common tools

unkempt apex Aug 14, 2024, 7:43 PM

#

!paste

arctic wedgeBOT Aug 14, 2024, 7:43 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

unkempt apex Aug 14, 2024, 7:43 PM

#

https://paste.pythondiscord.com/EDYA

#

is this good U-Net ?

#

because confuse about output layer

unkempt apex Aug 14, 2024, 8:05 PM

#

yeah updating..

#

?

#

https://towardsdatascience.com/cook-your-first-u-net-in-pytorch-b3297a844cf3

Medium

Cook your First U-Net in PyTorch

A magic recipe to empower your image segmentation projects

#

https://www.sciencedirect.com/science/article/pii/S2772671123001390#tbl0001

#

and also this one
blue arrows represent conv

#

yeah like that!

#

that paper is really nice.

unkempt apex Aug 14, 2024, 8:30 PM

#

dataset is taking so much time to upload

#

on kaggle

#

what if I restart my session, will that 4GB dataset will it gone?

#

shit then what's the best way?

#

I have already

#

download in ka ggle?

#

how>?

smoky basalt Aug 14, 2024, 9:43 PM

#

where should i start i need help, i already know intermediate python and know custom tkinter i need some help in starting projects in data science and ai

green herald Aug 14, 2024, 10:34 PM

#

I don't think you can have a national flag icon.

spring field Aug 14, 2024, 10:36 PM

#

green herald I don't think you can have a national flag icon.

if you wish to get in contact with the moderation team, you can DM @sonic vapor

lapis sequoia Aug 14, 2024, 11:26 PM

#

I think I might have came up with a machine learning concept, I don't think I heard this concept anywhere else.

Concept: The machine gets a bunch of information and if the information if relevant to the task, it will store and keep the info. Else if it isn't relevant to the task, it will save it just in case it's useful for another task. Else, it will delete the info as it's not needed.

green herald Aug 14, 2024, 11:41 PM

#

smoky basalt where should i start i need help, i already know intermediate python and know cu...

The moderator says it is OK.

spring field Aug 14, 2024, 11:46 PM

#

lapis sequoia I think I might have came up with a machine learning concept, I don't think I he...

between relevant and irrelevant what is the "else"

#

I'm also not sure how that's related to machine learning, the machine learning would be determining whether information is relevant or not

lapis sequoia Aug 14, 2024, 11:55 PM

#

spring field between relevant and irrelevant what is the "else"

For example, lets say your friend tells you he picked up a pen. It's information but it's extremely useless.
There are 3 sections.
Useful - Useful to the task
Junk - Stores the info, in case if it's relevant to another task
Trash - Completely useless information

I think this concept can go well with reinforcement learning. Dealing with the information efficiently..?

Sorry, I don't know much about machine learning, but the concept of it intrigues me.

#

sorry, i wasnt done explaining

spring field Aug 14, 2024, 11:56 PM

#

There are 3 sections.
Useful
Junk

I'm not quite following
we're back to two states, initially you implied at least 3 different states (relevant, irrelevant, else (which isn't really possible, since the other two states should cover everything already))

smoky basalt Aug 14, 2024, 11:58 PM

#

green herald The moderator says it is OK.

ur the one that is getting mad

#

im learning tensors

#

i learnt how to make tensors 😂

spring field Aug 15, 2024, 12:00 AM

#

lapis sequoia For example, lets say your friend tells you he picked up a pen. It's information...

and what makes the information included in the example you gave completely useless?

lapis sequoia Aug 15, 2024, 12:02 AM

#

spring field and what makes the information included in the example you gave completely usele...

It's useless because it's not relevant to any task and the information given cannot do any other tasks.

Although, if he asks you if he picked up a pen, it can be useful information.

spring field Aug 15, 2024, 12:03 AM

#

lapis sequoia It's useless because it's not relevant to any task and the information given can...

exactly, so, how do you determine whether it's potentially useful?

#

it either is useful or it is not

lapis sequoia Aug 15, 2024, 12:03 AM

#

True, true

#

Then I would need a system that can detect fake information

#

Because in that scenario, fake information can be relevant.

left tartan Aug 15, 2024, 2:26 AM

#

lapis sequoia Because in that scenario, fake information can be relevant.

I think you're asking: how can I detect which variables are significant and which ones are noise (irrelevant). Is that right? Aka feature importance

rich river Aug 15, 2024, 2:32 AM

#

fiery bane what do you like?

I'm especially interested in learning GBDT and XGBoost (and CRF), if any projects cover them it would be better

lapis sequoia Aug 15, 2024, 2:58 AM

#

left tartan I think you're asking: how can I detect which variables are significant and whic...

Yes.
Although, I was thinking of storing all it's knowledge in a list.

A method I thought of to get rid of the fake information is to put the program through a test and see if it can successfully complete it with no errors. Once it passed, it will keep the information.
Though, I would constantly need to create a test.

That's why I need to come up with an efficient method that can make sure the machine doesn't store false information.

I based this upon how we learn. Let's say we read a book, we absorb all it's information and we store it as useful and good. And when we encounter fake info, we can just dismiss it with our knowledge from the book and disprove it.

But I'm still struggling on how this method would be useful in.

Sorting data?

left tartan Aug 15, 2024, 2:58 AM

#

Fake is probably not the word you mean then; fake means (to me) inaccurate or misleading data, vs 'irrelevant' (noise).

lapis sequoia Aug 15, 2024, 3:04 AM

#

Yeah. Sorry about that. Got a little off track.

#

This honestly sounds like it can be used in sorting data

#

I wish I knew more about machine learning. Anyone got any resources I can use? Thanks.

robust jungle Aug 15, 2024, 4:01 AM

#

does anyone have any tips on making template matching characters more reliable? I want to be able to identify characters in a game menu, which has a consistent font. Currently I'm using cv2.matchTemplate alongside a collection of rendered characters in that font. To my eye the characters look to be about the same size as the ones in the image, and I'm using Image.convert to make sure the colors match. Any ideas?

fiery bane Aug 15, 2024, 4:15 AM

#

rich river I'm especially interested in learning GBDT and XGBoost (and CRF), if any project...

ok, so

find a topic that you like. If you need a list of topic, go https://paperswithcode.com/sota
find a dataset in that topic
go crazy

Papers with Code - Browse the State-of-the-Art in Machine Learning

11332 leaderboards • 5039 tasks • 10405 datasets • 137835 papers with code.

iron sparrow Aug 15, 2024, 5:37 AM

#

Thanks for passing this along

runic parcel Aug 15, 2024, 5:52 AM

#

How is Geospy Ai model trained? What data did they use and how can it be done?

rigid timber Aug 15, 2024, 8:19 AM

#

can anyone help me find a pre trained model for a medical chatbot

unkempt apex Aug 15, 2024, 8:44 AM

#

rigid timber can anyone help me find a pre trained model for a medical chatbot

for what uses?

#

there are some who predict diseases based on symptons

rigid timber Aug 15, 2024, 9:21 AM

#

unkempt apex there are some who predict diseases based on symptons

Exactly that

wise bane Aug 15, 2024, 9:45 AM

#

without getting too deep into what i want to do, its basically a "machine learning algorithm" that can differentiate between slides of blood with cancer and without cancer by using a control data set and a data set that has cancer, how can i achieve this?

unkempt apex Aug 15, 2024, 11:33 AM

#

rigid timber Exactly that

https://huggingface.co/abhirajeshbhai/symptom-2-disease-net

abhirajeshbhai/symptom-2-disease-net · Hugging Face

#

also check the Med-BERT

unkempt apex Aug 15, 2024, 12:37 PM

#

got this while loading dataset using DataLoader

rigid timber Aug 15, 2024, 12:50 PM

#

unkempt apex ```ValueError: num_samples should be a positive integer value, but got num_sampl...

what does that imply?

spare forum Aug 15, 2024, 1:01 PM

#

Show the code too uh

#

torch Dataloader?

#

Seems like you passed empty data or idk

fiery bane Aug 15, 2024, 1:09 PM

#

lapis sequoia I wish I knew more about machine learning. Anyone got any resources I can use? T...

how deep you want to go?

fiery bane Aug 15, 2024, 1:09 PM

#

lapis sequoia I wish I knew more about machine learning. Anyone got any resources I can use? T...

Is this list good enough? https://kidger.site/thoughts/just-know-stuff/

Patrick Kidger

Personal Website. Math, SciML, scuba diving!

unkempt apex Aug 15, 2024, 1:24 PM

#

rigid timber what does that imply?

nvm , I deleted dataset and uploaded again now works

rigid timber Aug 15, 2024, 3:01 PM

#

unkempt apex nvm , I deleted dataset and uploaded again now works

I'll try it myself as well, is the model any good?

runic parcel Aug 15, 2024, 4:20 PM

#

can anyone help me in my langchain problem?
https://discord.com/channels/267624335836053506/1273677397220261960

raw pasture Aug 15, 2024, 4:29 PM

#

Hey guys who is good in machine learning t help me with a project.Anyone

serene scaffold Aug 15, 2024, 4:31 PM

#

raw pasture Hey guys who is good in machine learning t help me with a project.Anyone

always ask your actual question. don't ask to ask.

raw pasture Aug 15, 2024, 4:35 PM

#

okay

lapis sequoia Aug 15, 2024, 4:37 PM

#

this is quite cool, hard to get all though (for me) https://en.wikipedia.org/wiki/Universal_approximation_theorem

Universal approximation theorem

In the mathematical theory of artificial neural networks, universal approximation theorems are theorems of the following form: Given a family of neural networks, for each function

    f
  

{\displaystyle f}

from a certain function space, there exists a sequence of neural networks

...

lapis sequoia Aug 15, 2024, 5:50 PM

#

fiery bane how deep you want to go?

I just want to learn Reinforcement Learning

unkempt apex Aug 15, 2024, 5:52 PM

#

lapis sequoia I just want to learn Reinforcement Learning

then learn that

lapis sequoia Aug 15, 2024, 5:54 PM

#

alr, sounds good

unkempt apex Aug 15, 2024, 6:00 PM

#

#

again this stupid error

although I am directly making notebook using the dataset

unkempt apex Aug 15, 2024, 6:14 PM

#

spare forum torch Dataloader?

..

spare forum Aug 15, 2024, 6:14 PM

#

?

#

asking a question because you droped the error with 0 code

unkempt apex Aug 15, 2024, 6:15 PM

#

@spare forum the code is in pic

#

do you need that class code of how I am loading data?

spare forum Aug 15, 2024, 6:19 PM

#

it's with datasets.ImageFolder from torchvision no ? or handmade

unkempt apex Aug 15, 2024, 6:20 PM

#

spare forum it's with datasets.ImageFolder from torchvision no ? or handmade

yeah from torchvision

#

the dataset is inherited with Dataset from torch.utils.data

spare forum Aug 15, 2024, 6:20 PM

#

check result maybe idk

#

I don't really know the problem

#

I would do like from torchvision import datasets then the code is very similar like datasets.ImageFolder(root="..." , transform = train_transform)

#

but something went off with this I guess

unkempt apex Aug 15, 2024, 7:03 PM

#

spare forum check result maybe idk

that was very stupid error

#

the train images was .jpg and I was checking for .png😂

spare forum Aug 15, 2024, 7:04 PM

#

☠️

unkempt apex Aug 15, 2024, 7:53 PM

#

so now, the project is road extraction from satellite images
where
/train -> satellite images and masks(label)
/test -> sat images
/valid -> sat images

#

so how can I train my model?>
because we can't calculate validation loss as there are no masks for to vaildate and insimple even compare

faint quail Aug 15, 2024, 10:39 PM

#

how to do backpropagation with tensorflow

this is what I have so far

class Conv2d(Layer):
    def __init__(self, depth, kernel_shape=[3, 3], stride=1, variance="He"):
        self.kernel_shape = np.array(kernel_shape)
        self.variance = variance
        self.depth = depth
        self.stride = stride

    def forward(self, input_activations, training=True):
        output_activations = self.biases.copy()
        # for i, kernels in enumerate(self.kernels):
        #     for kernel, channel in zip(kernels, input_activations):
        #         output_activations[i] += scipy.signal.correlate2d(channel, kernel, "valid")

        start_time = time.time()

        output_activations += tf.nn.conv2d(
            input_activations.reshape(*input_activations.shape, -1).T, 
            np.flip(self.kernels.T, (0, 1)), 
            strides=[1, 1, 1, 1], 
            padding='VALID'
        )[0].numpy().T

        end_time = time.time()

        if training:
            self.output_activations = output_activations

        return output_activations

    def backward(self, input_activations, node_values):
        new_node_values = np.zeros(input_activations.shape)
        kernels_gradient = np.zeros(self.kernels.shape)

        for i, (kernels, kernel_node_values) in enumerate(zip(self.kernels, node_values)):
            for j, (image, kernel) in enumerate(zip(input_activations, kernels)):
                
                kernels_gradient[i, j] = scipy.signal.correlate2d(image, kernel_node_values, "valid")
                new_node_values[j] += scipy.signal.convolve2d(kernel_node_values, kernel, "full")

        kernels_biases_gradient = node_values
        return new_node_values, [kernels_gradient, kernels_biases_gradient]

The forward pass is really fast, but the full convolutional operation is really slow and I can't figure out how to write it using tensorflow which is much faster

lapis sequoia Aug 15, 2024, 11:46 PM

#

wise bane without getting too deep into what i want to do, its basically a "machine learni...

I didn't see a response but you want to look at classification models. I haven't read this but it looks like a good start: https://jonaac.github.io/works/deepxgboost.html

agile anvil Aug 16, 2024, 12:53 AM

#

Does anyone here have ChatGPT Advanced Voice and would be willing to help do something like https://www.youtube.com/watch?v=MB-IGShzNzA but for geolocating UK accents?

YouTube

Abdelkader Bouzidi

Chat gpt4o new Advanced Voice Mode recognizing different accents

▶ Play video

faint quail Aug 16, 2024, 1:01 AM

#

faint quail how to do backpropagation with tensorflow this is what I have so far ```py cla...

nvm I figured it out

    def forward(self, input_activations, training=True):
        output_activations = self.biases.copy()
        output_activations += tf.nn.conv2d(
            input_activations.reshape(*input_activations.shape, -1).T, 
            np.flip(self.kernels.T, (0, 1)), 
            strides=[1, 1, 1, 1], 
            padding='VALID'
        )[0].numpy().T

        if training:
            self.output_activations = output_activations

        return output_activations

    def backward(self, input_activations, node_values):
        new_node_values = np.zeros(input_activations.shape)
        kernels_gradient = np.zeros(self.kernels.shape)

        new_node_values = tf.nn.conv2d_backprop_input(
            [1, *input_activations.shape[::-1]],
            filters = self.kernels.T,
            out_backprop = node_values.reshape(*node_values.shape, -1).T,
            strides = [1, 1, 1, 1],
            padding = "VALID",
        ).numpy()[0].T

        kernels_gradient = tf.nn.conv2d_backprop_filter(
            input_activations.reshape(*input_activations.shape, -1).T,
            self.kernels.shape[::-1],
            out_backprop = node_values.reshape(*node_values.shape, -1).T,
            strides = [1, 1, 1, 1],
            padding = "VALID",
        ).numpy().T

        kernels_biases_gradient = node_values
        return new_node_values, [kernels_gradient, kernels_biases_gradient]

merry mica Aug 16, 2024, 1:20 AM

#

@lapis swift

wise bane Aug 16, 2024, 7:43 AM

#

lapis sequoia I didn't see a response but you want to look at classification models. I haven't...

thanks ill check it out

lapis sequoia Aug 16, 2024, 7:43 AM

#

quite a beautiful theorem, the universal approximation theorem in 2 short paragraphs

#

Any continuous fn in a subset of extended euclidean space can be approximated by a 1 -hidden- layer neural network; with infinite neurons.
Interestingly, many theorems cite that the activation fn must be non-polynomial, which many papers seem to ignore (they test polynomials fn and fail.).

mild bear Aug 16, 2024, 7:54 AM

#

Hello, I am currently working on a research project and I've run into a problem with my minimax algorithm. Some backstory, the project is aims to integrate minimax strategies into the selection phase of the MCTS algorithm implemented in Michael Hu's AlphaZero "clone/model". The MCTS used in AlphaZero is a bit different from a traditional MCTS algorithm is these ways: 1.) After the search reaches a leaf node, there is no rollout. Instead, AlphaZero uses the neural network to evaluate the board position and uses that as an estimated game result to update the statistics in the search tree.
2.) When expanding a leaf node, all children are expanded in a single operation, rather than the standard MCTS, which expands one child at a time. This means that after node expansion, a leaf node immediately becomes fully expanded.
3.) AlphaZero uses a slightly different UCT algorithm to select the best child during the selection phase, which incorporates the prior action probabilities from the output of the neural network. There's a lot of code but main issue I'm having is, my minimax function for some reason doesn't work, it does not select the correct max or min values based on the evaluation value gotten at the terminal state

#

The minimax function with alpha-beta pruning:

#

def minimax(
        env,
        node: Node,
        depth: int,
        alpha: float,
        beta: float,
        maximizing_player: bool,
        eval_func: Callable[[np.ndarray], Tuple[Iterable[np.ndarray], float]]
) -> float:

    # Base case: if we reach the maximum depth or the node is terminal (not expanded)
    if depth == 0 or not node.is_expanded:
        # Use the environment's observation and eval_func for terminal state evaluation
        observation = env.observation()  # Get the observation from the environment
        _, value = eval_func(observation)

        return value

    # Get the legal moves (i.e., child nodes that are expanded)
    legal_moves = np.where(node.child_N > 0)[0]

    if maximizing_player:
        max_eval = float('-inf')
        for move in legal_moves:
            child_node = node.children[move]
            eval = minimax(env, child_node, depth - 1, alpha, beta, False, eval_func)
            max_eval = max(max_eval, eval)
            alpha = max(alpha, eval)
            if beta <= alpha:
                break  # Beta cut-off
        return max_eval
    else:
        min_eval = float('inf')
        for move in legal_moves:
            child_node = node.children[move]
            eval = minimax(env, child_node, depth - 1, alpha, beta, True, eval_func)
            min_eval = min(min_eval, eval)
            beta = min(beta, eval)
            if beta <= alpha:
                break  # Alpha cut-off
        return min_eval

lapis sequoia Aug 16, 2024, 8:01 AM

#

i wish 5+ lines code blocks were collapsed by default

mild bear Aug 16, 2024, 8:02 AM

#

could i do that?

lapis sequoia Aug 16, 2024, 8:03 AM

#

i don't think so, not your fault at all; you can paste a link but then less people would see it, so it's fine :-)

mild bear Aug 16, 2024, 8:03 AM

#

The selction function:

#

def best_child(
        env,
        node: Node,
        legal_actions: np.ndarray,
        c_puct_base: float,
        c_puct_init: float,
        child_to_play: int,
        eval_func: Callable[[np.ndarray], Tuple[Iterable[np.ndarray], Iterable[float]]],
        alpha: float = 0.5,
        minimax_depth: int = 2,
) -> Node:
    
    if not node.is_expanded:
        raise ValueError('Expand leaf node first.')

    ucb_scores = -node.child_Q() + node.child_U(c_puct_base, c_puct_init)

    # Initialize the minimax scores for legal actions
    minimax_values = np.full_like(ucb_scores, fill_value=-9999.0, dtype=np.float32)

    # Apply minimax to the legal actions only
    for move in range(len(legal_actions)):
        if legal_actions[move] == 1:
            if move in node.children:
                minimax_values[move] = minimax(env, node.children[move], minimax_depth, float('-inf'), float('inf'),
                                               node.to_play == 1, eval_func)
            else:
                # If the child node does not exist, treat it as a leaf with no Minimax value
                minimax_values[move] = 0

    # Combine the UCB scores and Minimax values with a weighted sum
    combined_scores = (1 - alpha) * ucb_scores + alpha * minimax_values

    # Exclude illegal actions by setting the combined scores to -9999
    combined_scores = np.where(legal_actions == 1, combined_scores, -9999)

    # Select the move with the highest combined score
    move = np.argmax(combined_scores)

    assert legal_actions[move] == 1

    if move not in node.children:
        node.children[move] = Node(to_play=child_to_play, num_actions=node.num_actions, move=move, parent=node)

    return node.children[move]

#

Thanks in advance for any help🙏

verbal oar Aug 16, 2024, 8:13 AM

#

hmm if classical usage of pca was face recognition
and if pca is dimensionality reduction and equivalent is autoencoder so autoencoders are used for face recognition?

#

I meant extracting features with pca

#

hmm but with neural networks here is no need for feature extraction

quaint rivet Aug 16, 2024, 9:45 AM

#

stuck at this error

#

ModuleNotFoundError                       Traceback (most recent call last)

<ipython-input-3-87b76c9d6778> in <cell line: 12>()
     10 
     11 from mrcnn.config import Config
---> 12 from mrcnn.model import modellib, utils

/content/mrcnn/model.py in <module>
     21 import keras.backend as K
     22 import keras.layers as KL
---> 23 import keras.engine as KE
     24 import keras.models as KM
     25 

ModuleNotFoundError: No module named 'keras.engine'

#

i'm trying to use mask rcnn code. But i don't know why it's giving me this error

deep iris Aug 16, 2024, 9:59 AM

#

What is Batching of LLM Jobs, How Can It Reduce LLM Inference Cost, and How Can It Help Overcome Challenges Like Rate Limiting and GPU Utilization?

In this article, I explained all the above concepts. Please have a read and let me know your thought.
https://blog.cuminai.com/unlocking-the-power-of-job-batching-transforming-ai-workloads-2220b8c05e4f

Medium

Unlocking the Power of Job Batching: Transforming AI Workloads

Understanding what is LLM batching API, How it can be helpful? what are the different use cases of it? What can be possible cost saving?

lapis sequoia Aug 16, 2024, 10:06 AM

#

maybe that means that all components of C, W, b can be found?

#

no, the fn just needs to be continuous

#

that was my 1st interp. but i think it just means that sigma would get undefined constants multiplying it

#

if it means that though, it makes sense that it's calling it sigma, and those were proven much later

#

ReLU, GeLU and so on, in fact some were proven only recently!

#

and also for discontinous functions (2023)

#

it's the form most often quoted; the 1st theorem was proven only for sigmoid iirc

#

it was extended to relu etc

#

Also, certain non-continuous activation functions can be used to approximate a sigmoid function, which then allows the above theorem to apply to those functions. For example, the step function works. In particular, this shows that a perceptron network with a single infinitely wide hidden layer can approximate arbitrary functions.

#

true, i thought it was for relus; those were proven later, i may get the paragraph

#

are you on the wikipage?

#

im not sure why, but the first was proven for sigmoids, this is the line:

The first examples were the arbitrary width case. George Cybenko in 1989 proved it for sigmoid activation functions
the paper is paid though

#

maybe but they can be used though, it's just proven later on from what i read there, then that's why we use XLUs

#

yes, plus people use x^2 as well from what i understand

#

you may need to check that paper, kinda funny to put it under a paywall since it's got 30K citations, well maybe thats why

#

for me visually, it makes sense that ReLUs can approximate any fn, more than sigmoids

cinder tangle Aug 16, 2024, 10:28 AM

#

Hi,
I am working on a small project related to RAG and am stuck (Apparently cuz I don't know much)

I used mxbai-embed-large as embeddings and Chroma db as Vector store all goes well to this point.

Issue: When I try to retrieve data with similarity threshold it returns 0 docs and without threshold and k it always returns 4 docs no matter the query.
What is it that I am doing wrong?
Here my Code:

Vector Store Creation File:

# Load Docs and then store embeddings in the Chroma DB
from langchain_community.embeddings import OllamaEmbeddings
from langchain_community.document_loaders import PyMuPDFLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_chroma import Chroma

embeddings = OllamaEmbeddings(
    base_url="http://43.204.231.131:11434",
    model="mxbai-embed-large",
)

loader = PyMuPDFLoader("./data/aliceShort.pdf")
data = loader.load()
# print(len(data))

text_splitter = RecursiveCharacterTextSplitter(
    # Set a really small chunk size, just to show.
    chunk_size=300,
    chunk_overlap=100,
    length_function=len,
    add_start_index=True,
)

chunks = text_splitter.split_documents(data)
print(f"Split {len(data)} documents into {len(chunks)} chunks.")


db = Chroma.from_documents(chunks, embeddings,persist_directory="./chroma_langchain_db")

query = "Who is Alice?"
docs = db.similarity_search(query)
print(docs[0].page_content)

Query File:

from langchain_community.embeddings import OllamaEmbeddings
from langchain_chroma import Chroma

embeddings = OllamaEmbeddings(
    base_url="http://65.2.37.27:11434",
    model="mxbai-embed-large",
)
db = Chroma(persist_directory="./chroma_langchain_db", embedding_function=embeddings)
query_text="Who is Alice?"
retriever = db.as_retriever(
    search_type="similarity_score_threshold", search_kwargs={"score_threshold": 0.1})
docs = retriever.invoke(query_text)
print(len(docs))

lapis sequoia Aug 16, 2024, 10:33 AM

#

this is also quite interesting lol:

Notice also that the neural network is only required to approximate within a compact set K. The proof does not describe how the function would be extrapolated outside of the region.

#

Well, i guess it does badly outside the region

#

i read it as: the approximation will be accurate within the training set (or the area sufficiently covered by it)

#

yeah, but it's limited unless you have all the data

#

so you may learn any fn fitting the data in a subarea

#

i don't mean infinite data, i mean all the data

#

uhmm..to me it's got a more practical reading as well. since we normally have a subset of the data, that may be fully linear (described by a line/plane/etc), and you may find f for that region K, in any-dimensional space

#

but that's f for K which is the universe for the model, since it hasn't seen outside K

#

no, the dataset (datapoints) would get g to approach f

#

so one works backwards and the dataset points define K for g which will approach to f for K

#

then any datapoint outside K will never be predicted correctly unless one gets lucky

#

that leaky relu and such can approximate any fn seems intuitive,
imagine that you put all weights to 0 apart from some (what ReLU does.),
and they add up to a local line that is 'tangent' to the real fn; for the offset, one's got the bias.

#

(the infinite neurons of the single layer are the infinite segments)

#

apparently this is why polynomials fail, i've no idea what it means

#

Paper is MULTILAYER FEEDFORWARD NETWORKS WITH A NON-POLYNOMIAL ACTIVATION FUNCTION CAN APPROXIMATE ANY FUNCTION sorry it's in caps

#

this is another "proof" of universality for maxout networks, it's visual. i can't get it very well, but seems intuitively right?

spare forum Aug 16, 2024, 11:27 AM

#

lapis sequoia Paper is `MULTILAYER FEEDFORWARD NETWORKS WITH A NON-POLYNOMIAL ACTIVATION FUNCT...

Isn't that not so knew?

lapis sequoia Aug 16, 2024, 11:29 AM

#

spare forum Isn't that not so knew?

it is :-(

#

1992

spare forum Aug 16, 2024, 11:42 AM

#

C(Rn) would be continous functions Rn->R

#

idk about Sigma_n

lapis sequoia Aug 16, 2024, 11:43 AM

#

yup same, that's why i shared the paper's title below

#

but Density is mentioned in the wikipage as well

#

idk what that is either

lapis sequoia Aug 16, 2024, 11:47 AM

#

spare forum Isn't that not so knew?

what's new is the proof for discontinous fns i think i can share if anynone wants to check, it's way too complex for me

spare forum Aug 16, 2024, 11:50 AM

#

lapis sequoia what's new is the proof for discontinous fns i think i can share if anynone want...

I see because the theorem with some strict assumptions (continuous ? ) is older, I have it somewhere in my course (not the entire proof just the result lol)

lapis sequoia Aug 16, 2024, 11:51 AM

#

spare forum I see because the theorem with some strict assumptions (continuous ? ) is older...

yes, that's correct, so you can find out there that NNs cant approximate some specific fns (discontinuous)

#

i mean some crazy fns f, if that's a reply to me: https://www.reddit.com/r/programming/comments/z23f05/comment/ixeg9os/ (just the comments!)

ZMeson's comment on "Why Neural Networks Can Approximate Any Functi...

Explore this conversation and more from the programming community

toxic mortar Aug 16, 2024, 11:56 AM

#

Hey guys,

I’ve created a multiclass classification model and trained it on a labeled dataset. Went pretty well on the local dataset tbh and I’m now looking to soft-launch it into prod. The input data will be converted into an n-dimensional input vector, which won’t form a convex or regular shape when plotted on a chart (at least my EDA shows that). Since I can’t foresee every possible model input, the model won’t handle every scenario perfectly, which is i guess okay, but I am looking for broad use-case. Which will lead to a number of false positives, which I want to iteratively add to my training data corpus and improve the model overtime.

I’m looking for an efficient approach to identify and manage these false positives. I was thinking about:
1)Randomly sampling a subset of the data and label it manually to verify where it is true postiive or false postiive.
2)Get user feedback to identify misclassified ones.
3)Using clustering techniques with metrics like Silhouette score, Davies-Bouldin Index, Calinski-Harabasz Index (CH), Normalized Mutual Information (NMI), or the Dunn Index.
4) Combine 1) and 3)? Identify some of false positives and then with clustering to find the similiar ones which are possibly also false positives

My end goal is to create a pipeline that will iteratively improve over time. How would you approach this problem? Thanks!

gloomy pulsar Aug 16, 2024, 12:30 PM

#

Happy Friday, August 16: It’s all about AI and Automation! 🎉
Hello i am new there,)

I need your collective wisdom for an AI challenge! 🧠

My mission: Describe images with AI, and I’ve set my sights on LLaVA.

The issue: I’m a bit lost on how to choose the best approach! 🏊‍♂️

Quick context:
• I previously used OpenRouter (which used Fireworks)
• But it’s no longer available 😢
• I’m looking fto use Python
• I struggled this morning with PyTorch (persistent DLL file issues) 😅
• My laptop doesn’t have a powerful graphics card

What I’m looking for:

An API rather than a local solution (too complicated for me right now)
Cost-effective options
Technically simple solutions

I’ve already explored a few options:
• Replicate
• Hugging Face
• Fal.ai
• Google Colab...

But I’m a bit confused by all these options and their differences... 🤔

Questions for you, wise developers:
• What would be the best API for using LLaVA in my case?
• How can I navigate through all the variations of LLaVA?
• Do you have a simple comparison of the models (efficiency/cost)?
• Are there other options I might have missed?

I don’t want to dive in headfirst without understanding all possibilities first. Basically, how would you go about researching and choosing the best option?

Thanks in advance for your insights! 🙏✨

Please excuse my English if it’s not perfect, as I’m not a native speaker.

bold snow Aug 16, 2024, 12:32 PM

#

any idea how to get started with machine learning without heavy math background?

agile cobalt Aug 16, 2024, 12:34 PM

#

bold snow any idea how to get started with machine learning without heavy math background?

study math blobshrug
you will probably want to understand a little of calculus and linear algebra, at least enough to understand why things work

lapis sequoia Aug 16, 2024, 12:34 PM

#

Please excuse my English if it’s not perfect, as I’m not a native speaker.

are you sure you aren't native speaker

lapis sequoia Aug 16, 2024, 12:35 PM

#

bold snow any idea how to get started with machine learning without heavy math background?

check fast.ai

agile cobalt Aug 16, 2024, 12:35 PM

#

bold snow any idea how to get started with machine learning without heavy math background?

check the pinned mml-book (mathematics for machine learning)

bold snow Aug 16, 2024, 12:36 PM

#

agile cobalt study math <:blobshrug:700139644062269521> you will probably want to understand ...

my problem is I dont know how little should I learn because I'm not keen to long studying I learn fast when im doing something

lapis sequoia Aug 16, 2024, 12:38 PM

#

my take is that you would rather get started and build an intuition with a library that makes stuff for you

gloomy pulsar Aug 16, 2024, 12:38 PM

#

lapis sequoia Aug 16, 2024, 12:38 PM

#

bc that makes learning easier afterwards, in a way decoupling terminology from concepts.

bold snow Aug 16, 2024, 12:39 PM

#

lapis sequoia my take is that you would rather get started and build an intuition with a libra...

like sklearn or tensorflow?

gloomy pulsar Aug 16, 2024, 12:39 PM

#

lapis sequoia > Please excuse my English if it’s not perfect, as I’m not a native speaker. ar...

know u know with my audio,)

lapis sequoia Aug 16, 2024, 12:39 PM

#

like fast.ai

bold snow Aug 16, 2024, 12:39 PM

#

got it thank you very much

lapis sequoia Aug 16, 2024, 12:39 PM

#

you can also check pytorch slowly, it's great and has got many examples

bold snow Aug 16, 2024, 12:40 PM

#

thank you very much

lapis sequoia Aug 16, 2024, 12:41 PM

#

np, don't just listen my advice though, that's one angle, the book suggested to you is another, and the book is very good.

agile cobalt Aug 16, 2024, 12:43 PM

#

gloomy pulsar **Happy Friday, August 16: It’s all about AI and Automation! 🎉** Hello i am new...

either fal or replicate should work fine

if you need to test something with 0 costs you can try using the Gradio API for a Hugging Face Space that uses Llava like https://huggingface.co/spaces/llava-hf/llava-4bit

lapis sequoia Aug 16, 2024, 12:51 PM

#

i didn't understand much of those posts (a lot of jargon for me to parse.) but this is somewhat simpler, it's just a small piece of those posts

#

agile cobalt Aug 16, 2024, 12:53 PM

#

• How can I navigate through all the variations of LLaVA?
Using a different model should be as simple as changing the name of the model in one line of code
If you mean finding all variations that exist, browse though Hugging Face models or the models page of the API provider you plan to use
• Do you have a simple comparison of the models (efficiency/cost)?
You can look up benchmarks, but you should never expect for the benchmark performance to be an extremely good estimation of its performance in real tasks ; You must test and benchmark it in your own tasks

lapis sequoia Aug 16, 2024, 12:59 PM

#

The post starts

By the Stone-Weierstrass theorem,
and the image i linked is the theorem.. :-(

#

(from https://arxiv.org/pdf/1302.4389v4, neat paper imho)
The theorem says:

In mathematical analysis, the Weierstrass approximation theorem states that every continuous function defined on a closed interval [a, b] can be uniformly approximated as closely as desired by a polynomial function.
The paper/image as well, but replaces polynomial with PWL (piece wise linear.)

#

(bc maxout networks can approximate any PWL, they say they are universal fn approximators.)

cinder tangle Aug 16, 2024, 1:31 PM

#

cinder tangle Hi, I am working on a small project related to RAG and am stuck (Apparently cuz ...

Can someone help me on this??? 😒😒😒

lapis sequoia Aug 16, 2024, 1:44 PM

#

sota in CIFAR-100 not improved in several years apparently?

#

https://paperswithcode.com/sota/image-classification-on-cifar-100

Papers with Code - CIFAR-100 Benchmark (Image Classification)

The current state-of-the-art on CIFAR-100 is EffNet-L2 (SAM). See a full comparison of 199 papers with code.

#

we should try to beat it :-). They've got a neat API https://github.com/paperswithcode/paperswithcode-client

GitHub

GitHub - paperswithcode/paperswithcode-client: API Client for paper...

API Client for paperswithcode.com. Contribute to paperswithcode/paperswithcode-client development by creating an account on GitHub.

north drift Aug 16, 2024, 3:16 PM

#

Hey guys!

#

quick question

#

Are you aware of any AutoML llibraries that takes advantage of CUDA or GPU?

#

I am working with AutoGluon at the moment, seems to be CPU intensive even though I am using GPU parameters. Is there anything that correctly integrates with Nvidia CUDA as per your knowledge?

If so, please drop a reply! Thanks!

wooden sail Aug 16, 2024, 3:18 PM

#

did you install the gpu version and set the configuration to use the gpu?

#

most standard/popular ML libraries CAN use gpus, but require you to set it up correctly

north drift Aug 16, 2024, 3:20 PM

#

yeah, I am using autogluon[all]

spare forum Aug 16, 2024, 3:20 PM

#

I don't have a GPU but I know you can use it with autogluon

north drift Aug 16, 2024, 3:21 PM

#

I see. It seems to be keen on using CPU but lemme look into it further

wooden sail Aug 16, 2024, 3:22 PM

#

different models and optimizers require you to tell them explicitly to use the gpu

spare forum Aug 16, 2024, 3:23 PM

#

It use CPU by default, also it uses all available cores with agluon

gloomy pulsar Aug 16, 2024, 4:01 PM

#

agile cobalt > • How can I navigate through all the variations of LLaVA? Using a different mo...

thank you very much @agile cobalt !🙏
For your valuable response and there is a lot of information and I really like your sentence that I put in quote above 'V yes because indeed we get lost in all the proposed models that is why I wondered how experienced developers did it which is not at all my case who tries it is just to run a simple script in python!

As I am a beginner basically I am dependent on the information that you give me the artificial intelligences to guide me and in some cases I wasted too much time on useless choices when there was a very simple solution in two clicks so it is true that it is never easy to find the right decision in my case to know which way to start but thanks to your information I am already better equipped

lapis sequoia Aug 16, 2024, 7:02 PM

#

damn

#

the amount of optimizers in nevergrad is crazy

#

I am looping through all of them to see which one is the best

lapis sequoia Aug 16, 2024, 8:00 PM

#

the results are in

#

I asked all of them to solve a maze

#

the worst one goes to "HaltonSearchPlusMiddlePoint" (which is quasi random search and idk what is middle point)

#

the best one is LargeDiagCMA (evolutionary strategy)

#

whats crazy is

#

Accelerated random search is better than all of them

#

and no library implements it

#

300 algos though thats insane

lapis sequoia Aug 16, 2024, 8:06 PM

#

lapis sequoia Accelerated random search is better than all of them

Actually I just realized that its not true

#

because I let them all have 10 seconds and nevergrad is a bit slower and does 10 times less iterations (even random search)

agile anvil Aug 16, 2024, 9:29 PM

#

I want to repost #data-science-and-ml message periodically, maybe is once every couple weeks until someone responds ok?

serene scaffold Aug 16, 2024, 9:49 PM

#

agile anvil I want to repost https://discord.com/channels/267624335836053506/366673247892275...

No

lapis sequoia Aug 16, 2024, 10:08 PM

#

https://corbin-c.medium.com/
Check out the MLP article. Is it accurate enough??

Medium

Corbin Chandler – Medium

Read writing from Corbin Chandler on Medium. My name is Corbin. I am super interested in Machine Learning & Computer Programming! I currently know ~18 coding languages.

agile anvil Aug 16, 2024, 10:36 PM

#

serene scaffold No

Ok, then please let me ask about classifiers in general for https://www.accenthelp.com/blogs/accenthelpblog/british-isles-accent-map

AccentHelp

British Isles Accent Map

British Isles Accent Map When people talk about a ‘British accent’, they tend to be thinking of the upper class Received Pronunciation accent. But what you might not realize is that the UK has a huge variety of accents, and a higher level of linguistic diversity than many other countries. These range from the lilt of t

serene scaffold Aug 16, 2024, 11:26 PM

#

agile anvil Ok, then please let me ask about classifiers in general for https://www.accenthe...

It's possible in principle if you have enough samples of each accent. You might need hours of audio and dozens of distinct speakers per account.

#

It would be especially helpful if the dataset had speakers who contributed audio samples in more than one accent

nocturne valley Aug 17, 2024, 1:09 AM

#

agile anvil Ok, then please let me ask about classifiers in general for https://www.accenthe...

Is there a training data set for accent identification?

simple tapir Aug 17, 2024, 7:13 AM

#

Hi

#

in yolo

#

model = YOLO("yolov8n.yaml")
model = YOLO("yolov8n.pt")
model = YOLO("yolov8n.yaml").load("yolov8n.pt")

same as

model = YOLO("yolov8n.pt")

?

#

in the first example, we create a model from stratch and transfer the params of pretrained yolo model

#

In the second one, we directly use the pretrained model

#

Is there any difference between them or are they just same?

lapis sequoia Aug 17, 2024, 8:25 AM

#

some ways u could find out: 1. log the model's weights + arch, 2. inspect the config file, 3. Try on a sample x and compare, 4. Read the docs. But as a guess i'd expect so. @simple tapir

#

the yaml file may be for re-training (or creating a model from scratch.), but can't say for sure.

#

Heuristics and example for when and how to consider dropout: https://www.kaggle.com/code/pavansanagapati/what-is-dropout-regularization-find-out

What is Dropout Regularization? Find out :)

Explore and run machine learning code with Kaggle Notebooks | Using data from [Private Datasource]

lapis sequoia Aug 17, 2024, 10:56 AM

#

Hi, I am looking for a Python-oriented AI notes PPT presentation , like python basics then numpy, pandas, matplotlib libraries.
Thanks

jaunty helm Aug 17, 2024, 11:47 AM

#

lapis sequoia Hi, I am looking for a Python-oriented AI notes PPT presentation , like python ...

I don't have one, but fyi none of what you said (np, pd, matplot) are inherently AI
you might have some luck broadening your search to just 'data science' or something

unique spoke Aug 17, 2024, 12:40 PM

#

Hey guys, Have a question on how I can run my program through the input from my phone's camera

#

Followed this : https://medium.com/@saicoumar/how-to-use-a-smartphone-as-a-webcam-with-opencv-b68773db9ddd#:~:text=On your phone%2C open up,computer via a USB cable.&text=As soon as you see,see which cameras are available.

Medium

How to Use a Smartphone as a Webcam with OpenCV

Whether you don’t have a webcam or you want to take your camera wireless, using your smartphone can be a viable alternative. When working…

#

But didnt seem to work for me

indigo wing Aug 17, 2024, 12:52 PM

#

Hey can someone tell me all Major steps and their minor steps in order. Like what comes first and followed by what. WHere do we start fron? data gathering > wrangling or elt pipelines > preprocessing and what part of it etc. I am very confused about the process in bits. like transformation itself is part of preprocessing, but what others are part of it and at what time or project does it come?

#

for ai, ds and dl

lapis sequoia Aug 17, 2024, 1:00 PM

#

The quadratic loss assigns more importance to outliers than to the true data due to its square nature, so alternatives like the Huber, Log-Cosh and SMAE losses are used when the data has many large outliers.

#

never used the Huber loss but it seems quite common

#

im looking for some good book on constructing/designing/handcrafting loss functions w examples

indigo wing Aug 17, 2024, 1:25 PM

#

lapis sequoia im looking for some good book on constructing/designing/handcrafting loss functi...

See pinned messgaes on this channel, maybe thay can help you

lapis sequoia Aug 17, 2024, 1:29 PM

#

indigo wing See pinned messgaes on this channel, maybe thay can help you

thanks for the heads up, actually one of the books seems to have some stuff at least

rigid timber Aug 17, 2024, 1:31 PM

#

unkempt apex https://huggingface.co/abhirajeshbhai/symptom-2-disease-net

@unkempt apex can I use this model with a flask script?

versed pilot Aug 17, 2024, 2:33 PM

#

agile anvil Ok, then please let me ask about classifiers in general for https://www.accenthe...

This is a bit of a huge task. Accent alone from dialect are two different things. Accent is how you'd pronounce u in cup, or whether you pronounce the r in car etc. Dialect goes into local vocabulary and maybe even grammar/syntax e.g. supposedly in some places people still said "thou" until fairly recently.

lapis sequoia Aug 17, 2024, 2:56 PM

#

neat article, quite hard at parts https://en.wikipedia.org/wiki/Loss_function; sharing in case anyone wants to discuss it :-)

loud sluice Aug 17, 2024, 3:52 PM

#

Hey, I have a AI+Cybersec Hackathon Problem Statement
I'd like if anyone could give their insights as to how they would approach this and how you would interpret this

1. Automated data collection from RAW images (forensic images) and other formats using disk imaging tools 
2. Automate the scanning and analysis of data, including files, system logs, registry entries, network activity etc. 
**3. Identify indicators of compromise (IOCs) and related suspicious activities 
4. Integrate AI/ML algorithms for anomaly detection and pattern recognition. The AI/ML feature should incorporate a scoring system and recommendation engine that allow investigators to quickly focus on the important artifacts. **
5. User-friendly review options should include interactive timelines and graphical summaries, while comprehensive reporting capabilities should allow exports in various formats such as PDF, JSON, and CSV.```

Emphasis on the 3rd and 4th point

Thanks

#

correct me if im wrong, ig we have to make an Anomaly detection like tool for Real time packets.

#

but the scoring system part is kinda confusing(pt4)

midnight moon Aug 17, 2024, 4:01 PM

#

Hello Everyone, I am a student. I want to workout in a product in machine learning. I know programming language like c,c++,java,python. I have also been learning books from Oreilly publication and YT channel. How should I get started ?

placid gazelle Aug 17, 2024, 4:07 PM

#

hi

runic parcel Aug 17, 2024, 5:23 PM

#

Answer the question based on the above context: {question}"""

SYSTEM_PROMPT = """Based on the following context: {context}, please recommend the best tools for the question: {question}. Provide the tool names only in a Python list format."""```
is this good for my Ai RAG, anything to add or remove for making it a good prompt by prompt engg?

unkempt apex Aug 17, 2024, 5:35 PM

#

rigid timber <@842272827393441854> can I use this model with a flask script?

sry for late reply!

wdym by using this model?
you can do everything with it, finetune , inference(give input and receive output)
but for flask, then you have to make an endpoint!
just like we use chatGPT api!

but if you are using for yourself then just use

warm river Aug 17, 2024, 5:41 PM

#

ok, which part of math should i practice for a.i ?

#

I am interested in a.i

#

ok

rigid timber Aug 17, 2024, 5:56 PM

#

unkempt apex sry for late reply! wdym by using this model? you can do everything with it, fi...

I was hoping to try it on a locally hosted web application

#

Im quite new to this

unkempt apex Aug 17, 2024, 5:56 PM

#

new on what?

#

web or AI?

hard steppe Aug 17, 2024, 6:00 PM

#

str(helper_llm.invoke(f"write the very short summarize & combine of the DuckDuckGo Search Result's Without Loosing Detail, Result:\n\n\n\n{result}").content)
I am Using Langchain, THe Above is the Prompt, How Can I tell AI to not to include the Here is a short summary and combination of the DuckDuckGo search results without losing detail:
curent Result:

Here is a short summary and combination of the DuckDuckGo search results without losing detail:\n\n**Summary:** OpenAI\'s CEO Sam ... interviews, including of members of the OpenAI Board of Directors.

Result I want:

I can use any mode which is available on groq

rigid timber Aug 17, 2024, 6:43 PM

#

unkempt apex web or AI?

AI

unkempt apex Aug 17, 2024, 6:52 PM

#

rigid timber AI

have you load that model?

worldly dawn Aug 17, 2024, 8:06 PM

#

@radiant shadow @left tartan here too ^

leaden kayak Aug 17, 2024, 8:54 PM

#

What are your favorite prompt tips when using language and code models?

#

For me it’s « PEP8 style format » after a Python request

serene scaffold Aug 17, 2024, 8:56 PM

#

leaden kayak What are your favorite prompt tips when using language and code models?

What are you trying to do?

leaden kayak Aug 17, 2024, 9:07 PM

#

Getting more quality outputs from local models I use daily

#

It’s a question with broad applications

serene scaffold Aug 17, 2024, 10:17 PM

#

leaden kayak Getting more quality outputs from local models I use daily

Is this only for code generation? Because that doesn't go without saying.

spare forum Aug 17, 2024, 10:41 PM

#

leaden kayak What are your favorite prompt tips when using language and code models?

It's gonna be bad code anyway, just gaining some time but for anything serious no copy and paste

spring field Aug 17, 2024, 11:03 PM

#

I'm just curious, is this what you're expected to do throughout the internship? (it just doesn't seem like you'd be doing much of "generative" AI)

serene scaffold Aug 18, 2024, 1:35 AM

#

Looks like the internship is about building a data pipeline. It doesn't look like you'll be doing anything with any variety of AI. But your pipeline might support people who will.

serene grail Aug 18, 2024, 1:48 AM

#

serene scaffold Looks like the internship is about building a data pipeline. It doesn't look lik...

Are the skills gained from this sort of thing (building a data pipeline and working with data pipelines) usually useful if one wishes to work in the ML field? Do people there still do these things?
Or is it usually completely separate people building the data pipelines and making models?

past meteor Aug 18, 2024, 1:48 AM

#

serene grail Are the skills gained from this sort of thing (building a data pipeline and work...

Yes, they're very relevant

#

In many jobs and roles (all the ones I've had) they were one person doing both

#

You can have a data engineer without a data scientist / ML engineer but not vice versa. If your future employer makes the mistake of hiring a ML person without a data engineer then you'll have to (be willing to) do both

serene grail Aug 18, 2024, 1:51 AM

#

past meteor Yes, they're very relevant

I see, thank you!

agile anvil Aug 18, 2024, 1:55 AM

#

versed pilot This is a bit of a huge task. Accent alone from dialect are two different things...

I disagree, the outlines around classification have been drawn decades ago, by those measuring the first and second formants of vowel shifts: https://www.cambridge.org/core/journals/journal-of-the-international-phonetic-association/article/abs/formant-frequencies-of-vowels-in-13-accents-of-the-british-isles/857541BE2E95A40117CBF24DE5836F6E

Cambridge Core

Formant frequencies of vowels in 13 accents of the British Isles | ...

Formant frequencies of vowels in 13 accents of the British Isles - Volume 40 Issue 1

clear ore Aug 18, 2024, 3:32 AM

#

HI, How are you , Can you please telll me i little bit confuse what i learn next i complete PYthon Bootcamp , Which field is best Data Science , Data Analyst , Cyber Security , or AI Enginer . My Self ....... My name is Danish , I do BS in Information Management from Punjab University Lahore. Thanks !

versed pilot Aug 18, 2024, 4:40 AM

#

agile anvil I disagree, the outlines around classification have been drawn decades ago, by t...

So that limits the scope only to accent, not to dialect, and only to vowels

rigid timber Aug 18, 2024, 6:02 AM

#

unkempt apex have you load that model?

I don’t know how

unkempt apex Aug 18, 2024, 6:54 AM

#

rigid timber I don’t know how

then learn to do that

river cape Aug 18, 2024, 7:55 AM

#

Hi guys is it hard to build an image generation model without the use of any gpt?

buoyant vine Aug 18, 2024, 8:59 AM

#

Hard

#

You need a lot of data

dense lichen Aug 18, 2024, 9:16 AM

#

Hey guys

I just got an idea but don't know where to start from.
So we use a postgresql DB
I was wondering if someone could guide me on how I can like just give a chat prompt for people, and an LLM model could understand based on the schema and table descriptions that I am going to provide.

I want to train the model with the database schema and its descriptions.

What i want to do is help people not giving information to all the common platforms they use like chatgpt, claude etc, and just train my own model. this way the users dont have to keep explaining to the AI to get answers.
I am a professional, however im very new to all these so just wanted to know if this is something already done or any tutorials that could help me with it.

lapis sequoia Aug 18, 2024, 9:47 AM

#

river cape Hi guys is it hard to build an image generation model without the use of any gpt...

did you look into variational autoencoders? (encodes into latent vector, continuous space, and decodes to new data.)

river cape Aug 18, 2024, 10:09 AM

#

lapis sequoia did you look into variational autoencoders? (encodes into latent vector, continu...

Yes Encoder-decoder , but buildiing a model that can handle least 1% of the data , is it hard?

lapis sequoia Aug 18, 2024, 10:10 AM

#

river cape Yes Encoder-decoder , but buildiing a model that can handle least 1% of the data...

ive no idea im afraid, but maybe smone else here knows

river cape Aug 18, 2024, 10:11 AM

#

lapis sequoia ive no idea im afraid, but maybe smone else here knows

Because i have understood the way it works , but then i wondered to can I build any gpt model , that does atleast 2% of the job from scratch

indigo wing Aug 18, 2024, 10:28 AM

#

I ran a model 100 times in dev, I want to take a higher value from the highest value so that I can take care of the performance in prod. What is the appropriate percentage above the upper bound.

#

example tc 10-100 seconds in dev. There are also ram constraints as 100% ram being used

#

what's the % I should add above the upper bound that's not too extreme a case

#

10% seems too much if it takes model 10 s, and to much if it takes 1 week

#

I want my data to determine my extremes.

#

So that when I am working on my code in dev, and test it. It will throw error if it takes more than the acceptable range

lapis sequoia Aug 18, 2024, 10:48 AM

#

it's quite weird that the loss fn can be considered as just a fn you want to minimise, vs having some relationship to statistics

#

i wish one day ill understand that :-(

#

seems how wikipedia starts the page about Loss

#

maybe learning how linear regression can be seen as an optimisation problem (least squares, by calculus or linear algebra approach.) or some statistics problem can help, idk.

indigo wing Aug 18, 2024, 11:00 AM

#

river cape Because i have understood the way it works , but then i wondered to can I build ...

what is this percentage for?

compact valley Aug 18, 2024, 11:03 AM

#

Trying to figure out which OS to use for data engineering before I jump into learning
I've been a web dev for few years now and will transition to data by new year

#

I am comfy using any and all win/linux/macos

#

I just wanna know like what is the preference in workstation/laptop setups real data engies use

river cape Aug 18, 2024, 11:18 AM

#

indigo wing what is this percentage for?

100% would require the processing power of idk , maybe a lot , so used the 2%

river cape Aug 18, 2024, 11:20 AM

#

compact valley I am comfy using any and all win/linux/macos

I use LInux , does the job for you , no much of problem while installing and running dependencies , handles environments also.

compact valley Aug 18, 2024, 11:22 AM

#

river cape I use LInux , does the job for you , no much of problem while installing and run...

What kind of machine do you own, do you do it on a workstation what are the speccs or laptop

river cape Aug 18, 2024, 11:24 AM

#

compact valley What kind of machine do you own, do you do it on a workstation what are the spec...

Okay I have the HP-Pavilion gaming laptop , 16gb ram 512gb ssd and gtx 1650 , I have dual booted the pc into WIndows and LInux

compact valley Aug 18, 2024, 11:24 AM

#

river cape Okay I have the HP-Pavilion gaming laptop , 16gb ram 512gb ssd and gtx 1650 , I ...

this is what I am trying to avoid, working on win with wsl lol
hoping that unix based MacBook with m3 chip will do it for me

river cape Aug 18, 2024, 11:24 AM

#

river cape Okay I have the HP-Pavilion gaming laptop , 16gb ram 512gb ssd and gtx 1650 , I ...

and for any ml or ds tasks, I use colab

#

One of the best tools

river cape Aug 18, 2024, 11:25 AM

#

compact valley this is what I am trying to avoid, working on win with wsl lol hoping that unix ...

Macbook should run perfectly well

compact valley Aug 18, 2024, 11:26 AM

#

I really really hope so, cuz I have no experience with data engineering tools and hoping that there will be no issues

#

like compatability and stuff idk, just wanna take the job with me to cafe if I want to
and MacBooks are awesome for that

lapis sequoia Aug 18, 2024, 11:27 AM

#

pretty cool wiki article https://en.wikipedia.org/wiki/Empirical_risk_minimization

Empirical risk minimization

Empirical risk minimization is a principle in statistical learning theory which defines a family of learning algorithms based on evaluating performance over a known and fixed dataset. The core idea is based on an application of the law of large numbers; more specifically, we cannot know exactly how well a predictive algorithm will work in practi...

compact valley Aug 18, 2024, 11:29 AM

#

lapis sequoia pretty cool wiki article https://en.wikipedia.org/wiki/Empirical_risk_minimizati...

I barely understood anything lol

spare forum Aug 18, 2024, 11:29 AM

#

compact valley Trying to figure out which OS to use for data engineering before I jump into lea...

Only windows could be a pain, the rest is ok

compact valley Aug 18, 2024, 11:29 AM

#

spare forum Only windows could be a pain, the rest is ok

and yet the best data engi I know in real life uses 6th gen i5 windows PC... lol

lapis sequoia Aug 18, 2024, 11:30 AM

#

compact valley I barely understood anything lol

1st paragraph?

spare forum Aug 18, 2024, 11:31 AM

#

Most company forces you the OS anyway, It's just a pain for spark and things like that, it's fine but not the best

compact valley Aug 18, 2024, 11:31 AM

#

lapis sequoia 1st paragraph?

Yes okay I think that explains the term Epirical Risk

lapis sequoia Aug 18, 2024, 11:31 AM

#

Background? i.e 2nd section

compact valley Aug 18, 2024, 11:32 AM

#

spare forum Most company forces you the OS anyway, It's just a pain for spark and things lik...

So should I get a MacBook Pro M3Pro chip 18gb ram lol

lapis sequoia Aug 18, 2024, 11:32 AM

#

ive got a mac mini w silicon chip, they are quite nice and cheap from what i see (2nd hand)

spare forum Aug 18, 2024, 11:32 AM

#

Sound good

lapis sequoia Aug 18, 2024, 11:44 AM

#

if one assumes P(x,y) -or joint probability distribution- exists it seems that P(y|x) (or conditional distribution) makes sense for DL. That should be what the model ends up estimating.

#

P(y|x) is just a slice of P(x,y)

#

i don't think the pixels are independent variables though..

radiant rock Aug 18, 2024, 11:56 AM

#

hey guys does anyone know any ai tools that automatically create flashcards for study? i like quizlet but it doesn't include everything

spare forum Aug 18, 2024, 12:05 PM

#

lapis sequoia if one assumes `P(x,y)` -or joint probability distribution- exists it seems that...

We rarely know the P(y|x) distribution , when we do that's exactly where theorically we can use naive Bayes algorithm

lapis sequoia Aug 18, 2024, 12:08 PM

#

spare forum We rarely know the P(y|x) distribution , when we do that's exactly where theoric...

isn't this the trained model? i.e f(X); say for a classification task.

#

im trying to map this fn to DL as well (the Risk), i'd asy the integral is normally a sum, L the loss but can't see dP(x,y) mapping to anything

#

if one has \int sin(x)dx then maps to \sum sin(x) delta

spare forum Aug 18, 2024, 12:23 PM

#

dP(x y) could mean it's for classification or regression, it's a measure, dw it's just a very general way of writing

spare forum Aug 18, 2024, 12:27 PM

#

lapis sequoia isn't this the trained model? i.e f(X); say for a classification task.

We don't know an explicit conditional distribution, if we know P(y|x) then we can explicity do a model with it let's say for binary classification we could write it simply with P(y=0|x),

lapis sequoia Aug 18, 2024, 12:28 PM

#

spare forum dP(x y) could mean it's for classification or regression, it's a measure, dw it'...

wait isn't that 1/n ? ig it's not..

lapis sequoia Aug 18, 2024, 12:30 PM

#

spare forum We don't know an explicit conditional distribution, if we know P(y|x) then we ca...

i dont understand, if we run many examples x through f(x) and f is the NN, and the output is interpreted as probability, it seems to me P(y|x)~f(x) ?

leaden kayak Aug 18, 2024, 12:31 PM

#

serene scaffold Is this only for code generation? Because that doesn't go without saying.

Mostly for code generation, inspiration and learning, not copy-paste.

spare forum Aug 18, 2024, 12:34 PM

#

lapis sequoia i dont understand, if we run many examples `x` through `f(x)` and `f` is the NN,...

the classifier as to return a class or a number for regression, in this case, theorically the classifier is when you apply the argmax on top

fiery bane Aug 18, 2024, 12:34 PM

#

river cape Hi guys is it hard to build an image generation model without the use of any gpt...

very easy

#

just get a pretained model, done.

lapis sequoia Aug 18, 2024, 12:35 PM

#

lol

spare forum Aug 18, 2024, 12:36 PM

#

fiery bane just get a pretained model, done.

now if he ask "from scratch" let's run away

fiery bane Aug 18, 2024, 12:36 PM

#

now if he ask "but it has to be good" I'll answer: if you have few million dollars, you can do it by funding me and I'll do it for you

lapis sequoia Aug 18, 2024, 12:37 PM

#

he meant general pretrained model :-)

fiery bane Aug 18, 2024, 12:37 PM

#

what's a general pretained model?

lapis sequoia Aug 18, 2024, 12:39 PM

#

any pretrained model, it was just a bad joke

fiery bane Aug 18, 2024, 12:39 PM

#

lol ok haha,

spare forum Aug 18, 2024, 12:40 PM

#

lapis sequoia i dont understand, if we run many examples `x` through `f(x)` and `f` is the NN,...

but yeah anyway it's obviously the goal to approximate this

lapis sequoia Aug 18, 2024, 12:40 PM

#

isn't this what i meant

#

lapis sequoia Aug 18, 2024, 12:41 PM

#

spare forum but yeah anyway it's obviously the goal to approximate this

nice, thanks. i still dont really get it but am closer than before, effectively lowering my loss i think

fiery bane Aug 18, 2024, 12:44 PM

#

lapis sequoia nice, thanks. i still dont really get it but am closer than before, effectively ...

what are you talking about btw?

lapis sequoia Aug 18, 2024, 12:45 PM

#

im trying to understand a neural network in statistical terms as opposed to an optimised function, or something close to that @fiery bane

#

this article is so far the simplest description ive found, https://en.wikipedia.org/wiki/Empirical_risk_minimization
though not quite complete.

Empirical risk minimization

Empirical risk minimization is a principle in statistical learning theory which defines a family of learning algorithms based on evaluating performance over a known and fixed dataset. The core idea is based on an application of the law of large numbers; more specifically, we cannot know exactly how well a predictive algorithm will work in practi...

spare forum Aug 18, 2024, 12:46 PM

#

it applies to all supervised learning tbf

serene grail Aug 18, 2024, 12:48 PM

#

lapis sequoia seems how wikipedia starts the page about Loss

Isn't loss basically the difference between true predictions (true positives, true negatives) and false predictions (false positives, false negatives)?
So this is what you're trying to minimize, 0 loss -> you estimate lines up with true values 100%
(I'm a noob so take this with a mountain of salt)

fiery bane Aug 18, 2024, 12:49 PM

#

lapis sequoia im trying to understand a neural network in statistical terms as opposed to an o...

like this? https://deeplearningtheory.com/

The Principles of Deep Learning Theory

Official website for The Principles of Deep Learning Theory, a Cambridge University Press book.

lapis sequoia Aug 18, 2024, 12:55 PM

#

serene grail Isn't loss basically the difference between true predictions (true positives, tr...

yeah, 100%

#

but it's possible to see it statistically

lapis sequoia Aug 18, 2024, 12:56 PM

#

fiery bane like this? https://deeplearningtheory.com/

oh no, no that book lol

#

if edward witten likes it, i wont understand it

#

but yeah, it includes a lot of fantastic stuff

fiery bane Aug 18, 2024, 12:59 PM

#

lol ias people

fiery bane Aug 18, 2024, 1:00 PM

#

lapis sequoia if edward witten likes it, i wont understand it

How about this? https://yann.lecun.com/exdb/publis/orig/lecun-06.pdf

lapis sequoia Aug 18, 2024, 1:01 PM

#

that looks good, nice to see some physics formulas there

#

thank you both @fiery bane @spare forum

spare forum Aug 18, 2024, 1:07 PM

#

It would be easier to find smthing like a master degree course or something

fiery bane Aug 18, 2024, 1:07 PM

#

spare forum It would be easier to find smthing like a master degree course or something

compared to what?

spare forum Aug 18, 2024, 1:08 PM

#

Finding articles etc...

fiery bane Aug 18, 2024, 1:09 PM

#

Well, maybe all he needed was that one article I posted lol

spare forum Aug 18, 2024, 1:11 PM

#

Just saying, bc on those courses you have a bit of everything centralized with the most important, may be heavier maths tho

fiery bane Aug 18, 2024, 1:12 PM

#

that's true.
I think the best combination is if there's a text book, and a course based on just that textbook

lapis sequoia Aug 18, 2024, 3:57 PM

#

finally understood the formula (approx)

#

the risk minimisation formula is:

weighing the error (loss) with the probability of that instance,
adding all up (integral in terms of x and y vectors) aka risk or probability of error,
and minimising it (wrt to the weights.)

#

(in practice, it ends up being the standard mean-loss minimisation by backpropagation.)

fiery bane Aug 18, 2024, 4:09 PM

#

yea haha pretty much

#

have you read bishop plmr?

lapis sequoia Aug 18, 2024, 4:10 PM

#

no, but this paper really blew my mind https://papers.nips.cc/paper_files/paper/1991/file/ff4d5fbbafdf976cfdc032e3bde78de5-Paper.pdf

#

i dont know much and need to go in steps

#

never heard of this before https://en.wikipedia.org/wiki/Riemann–Stieltjes_integral but it was useful for it

Riemann–Stieltjes integral

In mathematics, the Riemann–Stieltjes integral is a generalization of the Riemann integral, named after Bernhard Riemann and Thomas Joannes Stieltjes. The definition of this integral was first published in 1894 by Stieltjes. It serves as an instructive and useful precursor of the Lebesgue integral, and an invaluable tool in unifying equivalent f...

fiery bane Aug 18, 2024, 4:11 PM

#

I mean, sounds like this is the kind of things that you want: https://www.microsoft.com/en-us/research/uploads/prod/2006/01/Bishop-Pattern-Recognition-and-Machine-Learning-2006.pdf

lapis sequoia Aug 18, 2024, 4:12 PM

#

actually looks fantastic, great plot quality

#

last few days i read 5 papers and saved about 50

#

XD im falling behind

fiery bane Aug 18, 2024, 4:15 PM

#

lapis sequoia last few days i read 5 papers and saved about 50

do you want more?

lapis sequoia Aug 18, 2024, 4:15 PM

#

no, i just want to understand

fiery bane Aug 18, 2024, 4:19 PM

#

good luck!

lapis sequoia Aug 18, 2024, 4:20 PM

#

fiery bane good luck!

thank you, same :-)

fiery bane Aug 18, 2024, 4:21 PM

#

I don't need luck.
I need miracles T__T

lapis sequoia Aug 18, 2024, 4:22 PM

#

u theist?

fiery bane Aug 18, 2024, 4:29 PM

#

yea sure

ionic valley Aug 18, 2024, 8:33 PM

#

do AI/ML positions ask for LC during interviews?

unkempt apex Aug 18, 2024, 8:47 PM

#

getting error while installing packages with pip

#

on aws ec2 instance

#

WARNING: pip is configured with locations that require TLS/SSL, however the ssl module in Python is not available.
WARNING: Retrying (Retry(total=4, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError("Can't connect to HTTPS URL because the SSL module is not available.")': /simple/pip/
WARNING: Retrying (Retry(total=3, connect=None, read=None, redirect=None, status=None)) after connection broken by 'SSLError("Can't connect to HTTPS URL because the SSL module is not available.")': /simple/pip/```

#

this one

faint quail Aug 18, 2024, 11:51 PM

#

how does pytorch, tensorflow and other neural network frameworks save the weights and biases so effeciently? I'm making my own module from sratch and I notice that my file sizes are in the gb's and all I did was copy the yolo v1 architecture

https://github.com/TheonlyIcebear/Neural-Net-Framework/blob/main/utils/network.py

GitHub

Neural-Net-Framework/utils/network.py at main · TheonlyIcebear/Neur...

I custom library I made for training neural networks from scratch, using numpy and scipy - TheonlyIcebear/Neural-Net-Framework

#

is it just that the data is compressed using some algorithm?

agile anvil Aug 19, 2024, 12:03 AM

#

versed pilot So that limits the scope only to accent, not to dialect, and only to vowels

I'm not interested in classifying dialect, just accent, partly because I believe simply identifying the vowels' first and second formants will provide as much geolocation information as more detailed examination of speech. It still requires dictation with phonetic time points (e.g. a first pass with a STT service like AssemblyAI, a second and third pass for forced alignment of words and phonemes with PocketSphinx, and a fourth pass putting each voiced phoneme into a formants extractor.) That could build a classifier with enough training data.

Unfortunately, nobody seems to have labeled training data of just "people native to $CITY, UK saying things". Is it even possible that doesn't exist somewhere yet?

serene scaffold Aug 19, 2024, 1:35 AM

#

just leaving this here.

iron basalt Aug 19, 2024, 4:56 AM

#

faint quail how does pytorch, tensorflow and other neural network frameworks save the weight...

zlib

faint quail Aug 19, 2024, 6:14 AM

#

iron basalt zlib

thx

tidal bough Aug 19, 2024, 7:54 AM

#

faint quail how does pytorch, tensorflow and other neural network frameworks save the weight...

What are you comparing with? Maybe the model you're looking at is quantized and your version isn't?

proper crag Aug 19, 2024, 8:16 AM

#

to enhance model's training time efficiency .....is it enouf if i install eGPU only?

#

im using macbook

#

and im planning to deploy the model to docker then connect the model via an API to my EVE-NG

#

bczu i wan the model to analyse time series data from my lab which is residing inside EVE-NG
EVE-NG is virtual environment for computer networking

brave yew Aug 19, 2024, 8:20 AM

#

can you guys tell me what your development environment is for working or fine tuning models? I as a student use a gaming laptop until i blew my gpu a while ago, it wasn't that strong (GTX 1650) but it got the work done, but now i only have integrated graphics to work with which is abhorrent, so... what are some places where i can migrate my project to, to get gpu access?

unkempt apex Aug 19, 2024, 8:21 AM

#

brave yew can you guys tell me what your development environment is for working or fine tu...

If you Want to train, like there are some options as kaggle, colab

#

But then working with it maybe you need to adapt for cloud gpu providers or maybe AWS things

#

I have tried both ( AWS and colab) , it depends on what you wanna do with and how big is your mode

proper crag Aug 19, 2024, 8:23 AM

#

proper crag to enhance model's training time efficiency .....is it enouf if i install eGPU o...

this project is that to use the model to analyse my network lab

brave yew Aug 19, 2024, 8:23 AM

#

unkempt apex If you Want to train, like there are some options as kaggle, colab

well i usually work with py files and not notebooks, are notebooks capable of OOP's?

unkempt apex Aug 19, 2024, 8:23 AM

#

brave yew well i usually work with py files and not notebooks, are notebooks capable of OO...

Yeah , after finishing your code , you can download that notebook as .py also

proper crag Aug 19, 2024, 8:24 AM

#

also didnt wrry for the network it just my own smoll network lab project and im myself have been majoring in computer networking till degree

spare forum Aug 19, 2024, 8:25 AM

#

brave yew well i usually work with py files and not notebooks, are notebooks capable of OO...

It's python in all case, notebooks just separate chunks of code and outputs

proper crag Aug 19, 2024, 8:25 AM

#

proper crag to enhance model's training time efficiency .....is it enouf if i install eGPU o...

anyone ?

unkempt apex Aug 19, 2024, 8:26 AM

#

proper crag anyone ?

Wait , some one will respond

unkempt apex Aug 19, 2024, 8:27 AM

#

proper crag anyone ?

You want to decrease training time?

brave yew Aug 19, 2024, 8:27 AM

#

spare forum It's python in all case, notebooks just separate chunks of code and outputs

okay guess i will learn google colab then

unkempt apex Aug 19, 2024, 8:28 AM

#

proper crag to enhance model's training time efficiency .....is it enouf if i install eGPU o...

Then it will literally , depends on eGPU which u will use

brave yew Aug 19, 2024, 8:28 AM

#

for finetuning models using pytorch will i require colab pro?

proper crag Aug 19, 2024, 8:28 AM

#

unkempt apex You want to decrease training time?

yh

unkempt apex Aug 19, 2024, 8:28 AM

#

brave yew for finetuning models using pytorch will i require colab pro?

Collab pro only offers GPU for more time!

#

You can still fine-tune within time limits

proper crag Aug 19, 2024, 8:29 AM

#

for model like SVM is it CPU or GPU focused?

lapis sequoia Aug 19, 2024, 8:29 AM

#

interesting paper this one https://arxiv.org/pdf/2012.05208

spare forum Aug 19, 2024, 8:30 AM

#

proper crag for model like SVM is it CPU or GPU focused?

Most ml models doesn't need the use of GPU

proper crag Aug 19, 2024, 8:30 AM

#

oh

#

ok..bcuz it uses SVM

unkempt apex Aug 19, 2024, 8:30 AM

#

proper crag for model like SVM is it CPU or GPU focused?

I used to learn SVM on my cpu

proper crag Aug 19, 2024, 8:34 AM

#

unkempt apex I used to learn SVM on my cpu

what CPU you used that time ?...and the data..also does its kind of pepega or so so?

lapis sequoia Aug 19, 2024, 8:34 AM

#

sneak peek for the curious

unkempt apex Aug 19, 2024, 8:34 AM

#

Yeah it also depends on dataset

#

But I still use ryzen 3 3200g

proper crag Aug 19, 2024, 8:35 AM

#

i mean asked you that time

unkempt apex Aug 19, 2024, 8:35 AM

#

Same sir

#

But if dataset is on kaggle you ca directly use there notebooks

proper crag Aug 19, 2024, 8:35 AM

#

i mean i wan to connect the model

#

to an application which is used for my virtual computre networking lab

unkempt apex Aug 19, 2024, 8:37 AM

#

Wdym mean by connecting model?

You can host that model and I tegrate API's ( just like GPT)

Or maybe add the model on kab, and then use that with short python code

proper crag Aug 19, 2024, 8:37 AM

#

EVE-NG computer networking virtual enviroment

unkempt apex Aug 19, 2024, 8:37 AM

#

proper crag to an application which is used for my virtual computre networking lab

Is that web app?

proper crag Aug 19, 2024, 8:37 AM

#

app

unkempt apex Aug 19, 2024, 8:37 AM

#

Ahh, then I am not familiar with that

#

I guess you have to use API then by hosting your model

proper crag Aug 19, 2024, 8:38 AM

#

an application, my computer networking lab is inside the app and i wan to connect the model to analyze traffic data of my lab

unkempt apex Aug 19, 2024, 8:39 AM

#

Ha e you ever tried integrating API calls on app?
Any app

proper crag Aug 19, 2024, 8:39 AM

#

i'll try to search

#

although ty

unkempt apex Aug 19, 2024, 8:39 AM

#

But you have to host the model on webserver then

#

Wait lemme search for that then

serene grail Aug 19, 2024, 8:40 AM

#

lapis sequoia sneak peek for the curious

Oooh, that's really interesting chocojNoted I should read that paper

lapis sequoia Aug 19, 2024, 8:40 AM

#

nice, if you do we can discuss it

#

at least the intro, idk how complex it gets later so i might not be able to discuss the rest XD

unkempt apex Aug 19, 2024, 8:41 AM

#

proper crag i'll try to search

You can also integrate in ap itself only if that model is too small, but again that will be wrong approach

brave yew Aug 19, 2024, 8:48 AM

#

wait... you can't use terminal in colab? how do you import libraries?

unkempt apex Aug 19, 2024, 8:48 AM

#

brave yew wait... you can't use terminal in colab? how do you import libraries?

Lol, nice question, colab do this for u

#

Just import it in code

brave yew Aug 19, 2024, 8:49 AM

#

damn i am dumb

lapis sequoia Aug 19, 2024, 8:49 AM

#

paid colab has terminal

lapis sequoia Aug 19, 2024, 9:07 AM

#

It's somewhat reasonable that AI won't work out of distribution, but does it learn generalisable units that can be easily learn out of distribution? The answer is to some extent yes (fine tuning, and other approaches), and no (they can't solve ARC challenges.)

#

Why does this happen?

#

but most of those guys (see bengio and karpathy, now seem to disagree!)

serene grail Aug 19, 2024, 9:12 AM

#

I don't know, I feel like this is the sort of question the leading experts are trying to solve and I don't have the knowledge
I mean, fundamentally you should be able to learn to generalize based on limited information because humans do it. That's the thing.
So maybe the approaches we are using are just not yet good enough, like we haven't discovered how to make machines that "learn concepts" in the way that allows for this sort of generalization

#

Some people would say "just throw more compute at it" but idk about that 🥴

lapis sequoia Aug 19, 2024, 9:14 AM

#

Yes, I agree, it's quite puzzling

#

that paper says:

In this work, we will adopt a more unified approach that addresses these problems from within the framework of connectionism.
we'll see (the "problems" are of creating more abstract, symbolic units; and "connectionists" is just standard deep learning.)

#

visually, it looks like this (the last bit is similar to Marvin Minsky's diagrams.):

main sluice Aug 19, 2024, 9:21 AM

#

Hi fellow data scientists

remote stream Aug 19, 2024, 9:22 AM

#

Bois is anyone interested in helping me in a project

#

Abt voice keyword detection

#

It's for a competition. I need help in vc

serene grail Aug 19, 2024, 9:26 AM

#

lapis sequoia visually, it looks like this (the last bit is similar to Marvin Minsky's diagram...

Hmmm, so basically every object corresponds to a "mental object" and that's what allows us to reason about things abstractly, like "this object is like this relative to this object"
This is my understanding
And NNs don't currently have that capability

lapis sequoia Aug 19, 2024, 9:30 AM

#

imho the notion of "object" isn't that difficult to learn, isn't SAM (Meta's Segment Anything Model.) excellent at that?

#

im not sure whether it knows an object from a part of it, though, but does not confuse them in a way..

serene grail Aug 19, 2024, 9:33 AM

#

I don't know anything about SAM
But also "object" is kind of a really vague term

lapis sequoia Aug 19, 2024, 9:34 AM

#

yeah, there are papers about what an object is...

serene grail Aug 19, 2024, 9:34 AM

#

Like, anything is an object. You can say that any part of an object is an object, any property of an object is an object, any action is an object, etc.
If we're talking about a "mental object", which again, I'm not defining well so maybe what I'm saying doesn't make sense 🥴

lapis sequoia Aug 19, 2024, 9:37 AM

#

you can directly use sam

#

it's worth checking it https://segment-anything.com/demo

Segment Anything

Meta AI Computer Vision Research

serene grail Aug 19, 2024, 9:38 AM

#

Oh nice, I'll look into it later

lapis sequoia Aug 19, 2024, 9:39 AM

#

i love this one:

#

you hover, it selects the dog

serene grail Aug 19, 2024, 9:40 AM

#

If it's computer vision, it's more about detecting objects right?
I think the paper talked more about the ability to reason about objects (from reading the beginning)

serene grail Aug 19, 2024, 9:40 AM

#

lapis sequoia i love this one:

Oh cool

lapis sequoia Aug 19, 2024, 9:40 AM

#

yes, but for that you need segregation

#

one problem seems that networks have all the information merged

serene grail Aug 19, 2024, 9:42 AM

#

lapis sequoia visually, it looks like this (the last bit is similar to Marvin Minsky's diagram...

Which is the first step here right

lapis sequoia Aug 19, 2024, 9:42 AM

#

exactly

#

and the reasoning is the composability

#

so i'd do: SAM => NN 1 => NN 2 say

#

NN 1 may not be necessary actually. that's only for CV

serene grail Aug 19, 2024, 9:44 AM

#

lapis sequoia so i'd do: `SAM => NN 1 => NN 2` say

Yeah I think that would make sense to have multiple models working together

lapis sequoia Aug 19, 2024, 9:44 AM

#

in my mind binding problem == not having segregation

remote stream Aug 19, 2024, 9:44 AM

#

Guys is there someone who's willing to collaborate with me on a keyword detection project I don't have much knowledge in that field any help is appreciated

lapis sequoia Aug 19, 2024, 10:34 AM

#

read sects 1 & 2, will read 3 tomorrow likely

serene grail Aug 19, 2024, 10:36 AM

#

Nice, I only read section 1 so far

lapis sequoia Aug 19, 2024, 11:53 AM

#

ended up reading 3 as well, not all the details though, and will instead read 4 + 5 tom.

warm mortar Aug 19, 2024, 1:16 PM

#

Any ESRGAN expert here??

serene scaffold Aug 19, 2024, 1:41 PM

#

warm mortar Any ESRGAN expert here??

Hello, remember to never ask to ask. always ask your actual question.

warm mortar Aug 19, 2024, 2:31 PM

#

serene scaffold Hello, remember to never ask to ask. always ask your actual question.

I am asking who is a ESRGAN expert here? So I can get to know about upscaling and stuff

serene scaffold Aug 19, 2024, 2:32 PM

#

warm mortar I am asking who is a ESRGAN expert here? So I can get to know about upscaling an...

right. don't ask who knows about ESRGAN. ask a question that someone who knows about ESRGAN could start answering.

#

by waiting for someone who thinks they know about ESRGAN to present themselves, you're creating extra steps for that person if they ever appear, and preventing other people from potentially helping.

polar zinc Aug 19, 2024, 2:59 PM

#

Hi, does anyone know how I can plot a line for average increase over time using one axis with Matplotlib?
Usually I do it based on 2 axis but cannot get any methods to work using 1
example data = [10,20,12,14,12,9,15,18,12,10,15,14,17,10,20]
The other axis is the days. Example: ["10th July", "11th July", "12th July", "15th July", "16th July", "20th July"]

left plover Aug 19, 2024, 3:59 PM

#

np.mean()

warm mortar Aug 19, 2024, 6:53 PM

#

serene scaffold right. don't ask who knows about ESRGAN. ask a question that someone who knows a...

I want to know about ESRGAN upscaling images and videos on Google collab???

spring field Aug 19, 2024, 7:09 PM

#

brave yew can you guys tell me what your development environment is for working or fine tu...

paperspace is pretty great

spring field Aug 19, 2024, 7:10 PM

#

warm mortar I want to know about ESRGAN upscaling images and videos on Google collab???

someone who would know about ESRGAN could not answer this question without asking follow-up questions, please just ask your actual question

warm mortar Aug 19, 2024, 7:10 PM

#

spring field someone who would know about ESRGAN could not answer this question without askin...

#

My google collab show this whenever I upscale the videos using ESRGAN

#

How to solve this issue??

spring field Aug 19, 2024, 7:11 PM

#

this appears completely unrelated to ESRGAN apart from it being somewhat involved in the process as a whole unamusedowo

#

and there's not enough information to answer your question, you haven't defined a name bing_shrug
to fix this, you would define this name

#

if it works locally but not on google colab, consider it being an issue with environment compatibility, for example, you're using a newer or older version on google colab than locally

warm mortar Aug 19, 2024, 7:14 PM

#

spring field this appears completely unrelated to ESRGAN apart from it being somewhat involve...

I got this collab from someone, but they are unreachable. And also you have to mount google drive. Then mention the input folder and the model's path. After that the upscaling would begin. But this is not working with videos. On images it works fine.
Should I send you the collab for corrections????

spring field Aug 19, 2024, 7:15 PM

#

how did you get it "from someone" if they are unreachable?

spring field Aug 19, 2024, 7:17 PM

#

warm mortar I got this collab from someone, but they are unreachable. And also you have to m...

the error is pretty clear on what the issue is and the issue appears to be in your notebook
because in some code path the name pre_upscale is not defined, yet used

warm mortar Aug 19, 2024, 7:18 PM

#

spring field how did you get it "from someone" if they are unreachable?

Before I joined a community but it somehow got deleted or discontinued. Some user wrote me this but after that they don't even respond to messages and emails

warm mortar Aug 19, 2024, 7:19 PM

#

spring field the error is pretty clear on what the issue is and the issue appears to be in yo...

What to use it in instead of it???

spring field Aug 19, 2024, 7:21 PM

#

I would assume the same thing, but it's just not defined in that code path

#

and you still haven't provided more information
well, I suppose you asked whether you should and I didn't respond to that...
anyway, paste your code here: https://paste.pythondiscord.com

warm mortar Aug 19, 2024, 7:23 PM

#

spring field and you still haven't provided more information well, I suppose you asked whethe...

Should I send you my collab??

spring field Aug 19, 2024, 7:24 PM

#

!paste no, paste it here

arctic wedgeBOT Aug 19, 2024, 7:24 PM

#

Pasting large amounts of code

If your code is too long to fit in a codeblock in Discord, you can paste your code here:
https://paste.pythondiscord.com/

After pasting your code, save it by clicking the Paste! button in the bottom left, or by pressing CTRL + S. After doing that, you will be navigated to the new paste's page. Copy the URL and post it here so others can see it.

warm mortar Aug 19, 2024, 7:30 PM

#

spring field !paste no, paste it here

I have pasted the collab code how to share it with you?

spring field Aug 19, 2024, 7:31 PM

#

send the link to the paste

warm mortar Aug 19, 2024, 7:31 PM

#

spring field send the link to the paste

https://paste.pythondiscord.com/LOIA

warm mortar Aug 19, 2024, 7:31 PM

#

warm mortar https://paste.pythondiscord.com/LOIA

Could you kindly review and correct what's necessary

warm mortar Aug 19, 2024, 7:32 PM

#

warm mortar Could you kindly review and correct what's necessary

I would be grateful to you

spring field Aug 19, 2024, 7:33 PM

#

can you point me to where in the code is pre_upscale defined?

warm mortar Aug 19, 2024, 7:34 PM

#

spring field can you point me to where in the code is `pre_upscale` defined?

Ok

#

Line number 56

spring field Aug 19, 2024, 7:35 PM

#

no, where is it defined?

#

that's where it is referenced

warm mortar Aug 19, 2024, 7:37 PM

#

spring field no, where is it defined?

In video upscaling

spring field Aug 19, 2024, 7:39 PM

#

which line of code is that?

#

do you have some understanding of how Python works? because I feel not and if that's the case, I would advise you to start from the beginning, you can check out the resources linked below to get started

#

!resources

arctic wedgeBOT Aug 19, 2024, 7:39 PM

#

Resources

The Resources page on our website contains a list of hand-selected learning resources that we regularly recommend to both beginners and experts.

warm mortar Aug 19, 2024, 7:40 PM

#

spring field which line of code is that?

warm mortar Aug 19, 2024, 7:40 PM

#

warm mortar

Could you correct it??

spring field Aug 19, 2024, 7:42 PM

#

warm mortar

as I said, it's not defined there, it's only referenced there, you need to define it first

and since you don't quite seem to understand that, I feel as though you should pick up on the basics before attempting such endeavours, that'll make it easier for you down the road too

warm mortar Aug 19, 2024, 7:45 PM

#

spring field as I said, it's not defined there, it's only referenced there, you need to defin...

How to define it???

spring field Aug 19, 2024, 7:47 PM

#

I'm afraid it won't be that simple, for one, I have no idea what that function is supposed to do, and two, it would take quite a bit of effort to define it (probably), so, again, I would suggest you start with the basics and slowly work your way up

warm mortar Aug 19, 2024, 7:49 PM

#

spring field I'm afraid it won't be that simple, for one, I have no idea what that function i...

Could you refer me to someone??

spring field Aug 19, 2024, 7:50 PM

#

arctic wedge

^ the resources here are fantastic

unkempt apex Aug 19, 2024, 8:07 PM

#

first time using U-Net model, ,, so this is after 20 epochs,
but the mask should only contain the lines for road

faint quail Aug 19, 2024, 8:07 PM

#

nnuh uh

faint quail Aug 19, 2024, 8:08 PM

#

unkempt apex first time using U-Net model, ,, so this is after 20 epochs, but the mask should...

not too sure what that is but is that just a edge detector?

#

if so it seems to be doing its job

unkempt apex Aug 19, 2024, 8:09 PM

#

faint quail not too sure what that is but is that just a edge detector?

ofc edge detector for roads, and then drawing those lines

#

like this one, where it is only creating lines

spring field Aug 19, 2024, 8:11 PM

#

what was the input for that?

faint quail Aug 19, 2024, 8:11 PM

#

it seems like it is only detecting the lines over a certain thickness, so likely needs more training time / model capacity or its a dataset issue

#

idk tho im prolly wrong

unkempt apex Aug 19, 2024, 8:12 PM

#

spring field what was the input for that?

like this

#

num_classes = 1
is it okay?
because I only want that white lines?

spring field Aug 19, 2024, 8:15 PM

#

supervised or unsupervised?

unkempt apex Aug 19, 2024, 8:17 PM

#

supervised

#

dataset is on kaggle also

spring field Aug 19, 2024, 8:26 PM

#

unkempt apex like this

if these are the masks then you probably don't want to have a grayscale mask as an output, you want to convert pixels above a threshold to pure white and below the threshold to pure black and perchance calculate the loss with that

unkempt apex Aug 19, 2024, 8:29 PM

#

spring field if these are the masks then you probably don't want to have a grayscale mask as ...

        image = Image.open(img_path).convert('RGB')
        mask = Image.open(mask_path).convert('L')

#

this is the code I used while loading dataset

#

so all masks are grayscale

#

so do you think this is bothering it?

spring field Aug 19, 2024, 8:30 PM

#

I meant that you convert the output of the network to black and white, instead of having values in between

unkempt apex Aug 19, 2024, 8:30 PM

#

or I should use mask images as it is?

unkempt apex Aug 19, 2024, 8:31 PM

#

spring field I meant that you convert the output of the network to black and white, instead o...

not understood!

spring field Aug 19, 2024, 8:31 PM

#

unkempt apex first time using U-Net model, ,, so this is after 20 epochs, but the mask should...

like here with the predicted mask if the value of a pixel is say greater than 150, you just set it to 255 and if the value is less than 150, you just set it to 0

unkempt apex Aug 19, 2024, 8:31 PM

#

spring field like here with the predicted mask if the value of a pixel is say greater than 15...

okay okay got it now

#

lemme see how it can be done

spring field Aug 19, 2024, 8:32 PM

#

lemme know how it goes, I'm curious as well 😁

unkempt apex Aug 19, 2024, 8:35 PM

#

def predict_single_image(img_path, model, transform, device, threshold = 150):
    image = Image.open(img_path).convert('RGB')
    image = transform(image).unsqueeze(0).to(device)
    
    with torch.no_grad():
        output = model(image)
        output = torch.sigmoid(output)
        output = output.squeeze().cpu().numpy()
    
    # now applying threshold
    binary_mask = (output> (threshold /  255.0)).astype(np.uint8) * 255
    return binary_mask

#

is it good?

#

nah, still not getting correct output

spring field Aug 19, 2024, 8:38 PM

#

you can just use np.where

unkempt apex Aug 19, 2024, 8:38 PM

#

where to use .where?

#

and why?

spring field Aug 19, 2024, 8:39 PM

#

binary_mask = np.where(output > (threshold / 255.0), 255, 0)

unkempt apex Aug 19, 2024, 8:39 PM

#

okay so no matter what I change the threshold, it still gives me this

#

tried 0.5 also

spring field Aug 19, 2024, 8:41 PM

#

unkempt apex okay so no matter what I change the threshold, it still gives me this

this what?

unkempt apex Aug 19, 2024, 8:41 PM

#

spring field this what?

same mask

#

no visible changes

spring field Aug 19, 2024, 8:41 PM

#

spring field ```py binary_mask = np.where(output > (threshold / 255.0), 255, 0) ```

are you using this?

unkempt apex Aug 19, 2024, 8:41 PM

#

yeah

spring field Aug 19, 2024, 8:42 PM

#

then it, uhh, doesn't make sense, if something were off with the threshold, you'd be getting either a completely white or a completely black image

#

(did you save the code?)

unkempt apex Aug 19, 2024, 8:42 PM

#

yeah auto save on vs code

spring field Aug 19, 2024, 8:43 PM

#

well, surely something's not running

#

did you rerun the code?

#

how are you displaying the image?

#

can you go into a debugger and look at it?

unkempt apex Aug 19, 2024, 8:44 PM

#

import torch
import torch.nn as nn
from torchvision import transforms
from PIL import Image
import matplotlib.pyplot as plt
from model import UNet 
from customDataset import CustomDataset  


model_path = 'best_model.pth'
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')

model = UNet(n_class=1) 
checkpoint = torch.load(model_path, map_location=device)
model.load_state_dict(checkpoint['model_state_dict'])
model.to(device)
model.eval()


transform = transforms.Compose([
    transforms.Resize((256, 256)),
    transforms.ToTensor(),
])

def predict_single_image(img_path, model, transform, device):
    image = Image.open(img_path).convert('RGB')
    image = transform(image).unsqueeze(0).to(device)
    
    with torch.no_grad():
        output = model(image)
        output = torch.sigmoid(output)
        output = output.squeeze().cpu().numpy()
    
    return output

test_image_path = 'random.jpg' 
predicted_mask = predict_single_image(test_image_path, model, transform, device)

# Visualize the result
original_image = Image.open(test_image_path)
plt.figure(figsize=(10, 5))
plt.subplot(1, 2, 1)
plt.imshow(original_image)
plt.title('Original Image')
plt.axis('off')

plt.subplot(1, 2, 2)
plt.imshow(predicted_mask, cmap='gray')
plt.title('Predicted Mask')
plt.axis('off')

plt.show()

#

this is the whole test.py if you want

#

also , why to use debugger?

unkempt apex Aug 19, 2024, 8:45 PM

#

spring field did you rerun the code?

ofc

spring field Aug 19, 2024, 8:45 PM

#

unkempt apex ```py import torch import torch.nn as nn from torchvision import transforms from...

so, uhh, where in this do you have that np.where?

spring field Aug 19, 2024, 8:45 PM

#

unkempt apex also , why to use debugger?

debugger is an amazing tool

unkempt apex Aug 19, 2024, 8:45 PM

#

wtff sorryt

#

I accidently changes into train.py

#

it's late night here and I am still awake with half open eyes😂

#

need to sleep but after this testing

#

okay threshold is working

#

but not getting accurate

#

for example, setting 150, giving full black image

spring field Aug 19, 2024, 8:49 PM

#

perchance need to lower it
but also, try training on that mask

#

use it for calculating the loss from the ground truth and such

unkempt apex Aug 19, 2024, 8:50 PM

#

spring field perchance need to lower it but also, try training on that mask

training on mask?

#

how?

#

by specifying mask_image with threshold?

spring field Aug 19, 2024, 8:53 PM

#

mmm, I'm not sure, maybe what I'm thinking of is more suited for a metric insetad of a loss, I was thinking of essentially using log loss and comparing the masks you get from the model after applying this threshold to the ground truth image pithink

unkempt apex Aug 19, 2024, 8:54 PM

#

is it problem in training?, because testing seems to be simplen now, ( just use random.jpg and generate mask according to model is being trained )

spring field Aug 19, 2024, 8:55 PM

#

I mean, clearly the model has either not had enough training or the training was ineffective

unkempt apex Aug 19, 2024, 8:56 PM

#

spring field I mean, clearly the model has either not had enough training or the training was...

okay will give a look at it, thanks for the time

versed heron Aug 19, 2024, 9:23 PM

#

hey guys, some friends and I are working on some hackathon projects this month in the DS/ML space. if anyones interested in joining in shoot me a message!

fallow tree Aug 19, 2024, 10:36 PM

#

guys

#

hello where can i find some free open ai Api just for testing

serene grail Aug 19, 2024, 10:44 PM

#

fallow tree hello where can i find some free open ai Api just for testing

Specifically Open AI doesn't have free API as far as I know, it's paid
You could look at Hugging Face, I've heard they have some free API there (with a different model). It's going to be limited of course but for testing it should be fine

fallow tree Aug 19, 2024, 10:49 PM

#

serene grail Specifically Open AI doesn't have free API as far as I know, it's paid You could...

Thank u so much man ❤️ .

faint quail Aug 19, 2024, 10:56 PM

#

https://tenor.com/view/cat-fire-flamethrower-burn-on-fire-gif-7684110515453159552

Tenor

#

they keel cat

fallow tree Aug 19, 2024, 11:02 PM

#

Another question please

#

is there any alternatives for Google colab pro ? free with more ram , cz i cant work under 12.7 Gb Ram

proper crag Aug 20, 2024, 12:29 AM

#

is google collab free?

#

if i wan to use it to host my model

serene scaffold Aug 20, 2024, 12:41 AM

#

proper crag if i wan to use it to host my model

you can't use it to host models, no.

serene scaffold Aug 20, 2024, 12:43 AM

#

fallow tree is there any alternatives for Google colab pro ? free with more ram , cz i cant ...

the only other colab-like platform I know of is kaggle notebooks. but either way, no one is going to give you unlimited free compute.

agile cobalt Aug 20, 2024, 1:57 AM

#

Hugging Face Spaces is pretty generous for model deployment tbh

I don't think you're going to find >12GB RAM for free without strings anywhere though

devout fable Aug 20, 2024, 4:34 AM

#

hey, I just joined. Can anyone suggest a library for transforming excel files into markdown which preserve as much as possible of the original formatting? I've done openpyxl -> pandas -> markdown, but you lose a lot of formatting there.

versed pilot Aug 20, 2024, 4:49 AM

#

fallow tree is there any alternatives for Google colab pro ? free with more ram , cz i cant ...

Have you looked at Github Codespaces? https://docs.github.com/en/codespaces/overview

Account plan | Storage per month | Core hours per month

GitHub Free | 15 GB-month | 120
GitHub Pro | 20 GB-month | 180

GitHub Docs

GitHub Codespaces overview - GitHub Docs

heavy lily Aug 20, 2024, 5:06 AM

#

Hii

#

Can someon help me with something

#

#

I am getting my data like this

#

#

But i want it like this

devout fable Aug 20, 2024, 5:16 AM

#

data2.head()

versed pilot Aug 20, 2024, 5:16 AM

#

or even just data2 ?

main citrus Aug 20, 2024, 5:53 AM

#

Works too

#

You should remove the print

cosmic willow Aug 20, 2024, 7:26 AM

#

i wanted to make a network by revorking a tutorial
idk much about the matrix and vector operations involved but i know mostly how it shoul work
my problem is that my code isnt really learning. it stops at like 60% precision
code: https://paste.pythondiscord.com/JY7A
can any1 tell what i messed up?
tutorial: http://neuralnetworksanddeeplearning.com/chap1.html
i used the same settings and he starts at 90% i arrive at 60%-70%

teal sapphire Aug 20, 2024, 7:50 AM

#

cosmic willow i wanted to make a network by revorking a tutorial idk much about the matrix and...

for activation function deriative try this maybe tell me if its good:

def sigmoid_prime(z: np.ndarray[float]) -> np.ndarray[float]:
    s = sigmoid(z)
    return s * (1 - s)

cosmic willow Aug 20, 2024, 7:51 AM

#

thx checking rn

teal sapphire Aug 20, 2024, 7:51 AM

#

Give me 10 minute to write code for your evaluation function to make sure its correctly computing accuracy

cosmic willow Aug 20, 2024, 7:51 AM

#

i can wait thx

#

also as i see that should be a speed up but it only avoids to run it twice doesnt really improve the fact that my score seems to cap at 60.7%

teal sapphire Aug 20, 2024, 7:54 AM

#

    test_results = [(np.argmax(self.feedsforward(x)), np.argmax(y)) for (x, y) in test_data]
    return sum(int(x == y) for (x, y) in test_results) / len(test_data)

(Evaluation function)

#

try @cosmic willow

small wedge Aug 20, 2024, 7:54 AM

#

you don't need to convert them to int btw

#

since bool is a subclass of int

teal sapphire Aug 20, 2024, 7:55 AM

#

you right

small wedge Aug 20, 2024, 7:55 AM

#

not that you need to really optimize an evaluation function

teal sapphire Aug 20, 2024, 7:56 AM

#

small wedge not that you need to really optimize an evaluation function

making sure its correct is important

#

but yea

small wedge Aug 20, 2024, 7:56 AM

#

ofc ofc, I just mean it's a tiny tiny part of your runtime if you're training a model

teal sapphire Aug 20, 2024, 7:57 AM

#

Yes

#

optimizing stuff like

#

model artitecture is more important

small wedge Aug 20, 2024, 7:57 AM

#

yeah

cosmic willow Aug 20, 2024, 7:58 AM

#

how could i implement printing the train accurasy too? i feel like it may be learning that only.

small wedge Aug 20, 2024, 7:59 AM

#

the function spartan gave you does calculate accuracy, just print it

cosmic willow Aug 20, 2024, 8:00 AM

#

small wedge the function spartan gave you does calculate accuracy, just print it

on the test data not the train data

teal sapphire Aug 20, 2024, 8:00 AM

#

u can add a method to calculate training accuracy

#

similar to evaluate

#

and you can update the learn method to print the accuracy of both trainings and test datasets

hard fern Aug 20, 2024, 8:01 AM

#

finally got my first data science job!

teal sapphire Aug 20, 2024, 8:02 AM

#

hard fern finally got my first data science job!

that is great man

cosmic willow Aug 20, 2024, 8:05 AM

#

the train and test accurasy seems to be the same but still it starts at like 60% goes to 67% and goes up and down there

solemn warren Aug 20, 2024, 8:07 AM

#

hard fern finally got my first data science job!

congrats

cosmic willow Aug 20, 2024, 8:07 AM

#

hard fern finally got my first data science job!

congrats

craggy agate Aug 20, 2024, 2:48 PM

#

Is a 4080 Super good enough?

#

I know that it is capped at 16GB VRAM.

serene scaffold Aug 20, 2024, 2:49 PM

#

craggy agate Is a 4080 Super good enough?

Gaming GPUs and enterprise ML GPUs are now two separate classes, basically

craggy agate Aug 20, 2024, 2:49 PM

#

serene scaffold Gaming GPUs and enterprise ML GPUs are now two separate classes, basically

Yes but enterprise GPUs are pretty expensive.

agile cobalt Aug 20, 2024, 2:49 PM

#

craggy agate Is a 4080 Super good enough?

depends on what you want to run, I've seen some things that require >20GB

craggy agate Aug 20, 2024, 2:51 PM

#

agile cobalt depends on what you want to run, I've seen some things that require >20GB

I was hoping to work with smaller LLMs like Llama-3-8B and train neural nets

spare forum Aug 20, 2024, 2:51 PM

#

craggy agate Yes but enterprise GPUs are pretty expensive.

Do you need it tho

serene scaffold Aug 20, 2024, 2:51 PM

#

basically, any CUDA-enabled GPU is good enough for whatever you can fit on it. And if you're training/fine-tuning a model, you need to factor in the memory footprint of that as well.

If you're trying to fine-tune an interactive LLM that came out within the last year, gaming-tier GPUs might not be enough.

craggy agate Aug 20, 2024, 2:52 PM

#

serene scaffold basically, any CUDA-enabled GPU is good enough for whatever you can fit on it. A...

Hmm, even an 8B model?

serene scaffold Aug 20, 2024, 2:52 PM

#

calculate how much VRAM you would need to load the 8B param model, and then how much extra room you would need for fine-tuning.

#

if that can fit on a 4080, yay. if not, you might be able to quantize it.

spare forum Aug 20, 2024, 2:53 PM

#

You can use some computational ressources from any cloud provider, don't need to have the latest GPU at home

serene scaffold Aug 20, 2024, 2:54 PM

#

I've never done the math on that, but buying compute time on an enterprise GPU for the specific experiments that you want to do is probably going to be cheaper than buying a gaming GPU.

#

(I don't train models on my gaming computer because then I wouldn't be able to game while I wait for the model to train.)

craggy agate Aug 20, 2024, 2:57 PM

#

serene scaffold I've never done the math on that, but buying compute time on an enterprise GPU f...

Yeah

spare forum Aug 20, 2024, 2:58 PM

#

There is no way it's economical to buy entreprise GPU for this, it's more of a geeky satisfying thing

#

(which is fine)

craggy agate Aug 20, 2024, 2:59 PM

#

spare forum There is no way it's economical to buy entreprise GPU for this, it's more of a g...

true

agile cobalt Aug 20, 2024, 2:59 PM

#

serene scaffold (I don't train models on my gaming computer because then I wouldn't be able to g...

use a cloud gaming service to free up your GPU /s

craggy agate Aug 20, 2024, 3:06 PM

#

agile cobalt use a cloud gaming service to free up your GPU /s

Can I combine a bunch of these with NVlink or SLI:
https://www.amazon.ca/dp/B09SJ2BZ85?ref_=cm_sw_r_cp_ud_dp_QGCCXX2FHAP0HQX8S34R

PNY NVIDIA RTX A2000 12GB

NVIDIA® RTX™ A2000 12GB brings the power of RTX to more professionals with a powerful low-profile, dual-slot GPU design, delivering real-time ray tracing, AI-accelerated compute, and high-performance graphics to your desktop. The VR ready RTX A2000 12GB combines 26 second-generation RT Cores, 104...

#

Actually, 2 of them would also do the job

#

24GB VRAM

serene scaffold Aug 20, 2024, 3:07 PM

#

craggy agate Can I combine a bunch of these with NVlink or SLI: https://www.amazon.ca/dp/B09S...

you'll get a performance penalty each time a computation spans more than one device

craggy agate Aug 20, 2024, 3:08 PM

#

serene scaffold you'll get a performance penalty each time a computation spans more than one dev...

I see, so I'd benefit when tasks are larger than 12GB?

#

By tasks I mean models and Datasets

serene scaffold Aug 20, 2024, 3:09 PM

#

having two 12GB GPUs is worse than having one 24GB GPU, because data will occasionally need to move from one device to the other.

craggy agate Aug 20, 2024, 3:09 PM

#

serene scaffold having two 12GB GPUs is worse than having one 24GB GPU, because data will occasi...

Yes, true, but it would save me some money.

#

Whereas a 4090 would be nearly 1k$ more.

serene scaffold Aug 20, 2024, 3:09 PM

#

sure. I'm just letting you know that that's how it works.

craggy agate Aug 20, 2024, 3:10 PM

#

serene scaffold sure. I'm just letting you know that that's how it works.

How much of a performance impact could I face?

serene scaffold Aug 20, 2024, 3:10 PM

#

I'm not sure

craggy agate Aug 20, 2024, 3:13 PM

#

serene scaffold I'm not sure

Ohk, thanks though.

#

Either this or 2 used 3090s.

#

Cause new ones are very expensive.

jaunty helm Aug 20, 2024, 3:33 PM

#

craggy agate Cause new ones are very expensive.

there's also techniques like qlora or tools like unsloth to help reduce vram requirements

#

for running the llm you need a lot less

craggy agate Aug 20, 2024, 3:34 PM

#

jaunty helm there's also techniques like qlora or tools like unsloth to help reduce vram req...

Yeah, I thought LoRA was only to reduce training times tho?

#

does it help with VRAM limitations as well?

jaunty helm Aug 20, 2024, 3:34 PM

#

craggy agate Yeah, I thought LoRA was only to reduce training times tho?

qlora is like training on quantized models, and quantized models are smaller obv

craggy agate Aug 20, 2024, 3:35 PM

#

jaunty helm qlora is like training on quantized models, and quantized models are smaller obv

I thought LoRA was when you changed and trained 1-3% of the model's weights or something.

jaunty helm Aug 20, 2024, 3:35 PM

#

craggy agate I thought LoRA was when you changed and trained 1-3% of the model's weights or s...

the q in qlora is quantized
so you're doing stuff on a quantized model, and quantized models are smaller than not-quantized models, thus takes less vram

craggy agate Aug 20, 2024, 3:36 PM

#

jaunty helm the `q` in qlora is `quantized` so you're doing stuff on a quantized model, and ...

Oh okay!

jaunty helm Aug 20, 2024, 3:36 PM

#

craggy agate I thought LoRA was when you changed and trained 1-3% of the model's weights or s...

here's a table on llama factory
https://github.com/hiyouga/LLaMA-Factory#hardware-requirement

GitHub

GitHub - hiyouga/LLaMA-Factory: Efficiently Fine-Tune 100+ LLMs in ...

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024) - hiyouga/LLaMA-Factory

craggy agate Aug 20, 2024, 3:38 PM

#

jaunty helm here's a table on llama factory https://github.com/hiyouga/LLaMA-Factory#hardwar...

I see thanks!

jaunty helm Aug 20, 2024, 3:39 PM

#

craggy agate I see thanks!

actually, I see people buying tesla P40s for inference, not sure how they are when it comes to training
they're pretty old, but has 24gb vram and definitely cheaper than a 4090

craggy agate Aug 20, 2024, 3:40 PM

#

jaunty helm actually, I see people buying tesla P40s for inference, not sure how they are wh...

Oh, let me check that out

#

https://a.co/d/dBwfBMz

NVIDIA Video Card 900-22080-0000-000 Tesla K80 24GB DDR5 PCI-Expres...

Memory size (GDDR5): 24GB CUDA cores: 4992 Number Of GPUs: 2x GK120 GPUs Delivers 5-10x Boost In Key Application Performance for applications such as STAC-A2, RTM, SPECFEM3D, CAFFE, miniFEE, LSMS, Cloverleaf, CHROMA, Quantum Espresso, QMCPACK, HOOMD- Blue, NAMD, LAMMPS, GROMACS, AMBER

#

I found this for $200

#

It seems to have 24GB RAM

agile cobalt Aug 20, 2024, 3:44 PM

#

4992 cores sounds pretty low?

craggy agate Aug 20, 2024, 3:44 PM

#

agile cobalt 4992 cores sounds pretty low?

Seems to have more than a T4

#

Not bad

#

not good either

#

but not bad, especially for the price.

agile cobalt Aug 20, 2024, 3:45 PM

#

4080 SUPER was 10240?
hmm not as low as I thought

craggy agate Aug 20, 2024, 3:45 PM

#

agile cobalt 4080 SUPER was 10240? hmm not as low as I thought

Indeed

#

but won't the gen of the CUDA cores also play a role?

#

or are they all the same?

jaunty helm Aug 20, 2024, 3:50 PM

#

craggy agate https://a.co/d/dBwfBMz

that's a k80, even older than a p40
nonetheless, these are all old cards, and thus support an old version of CUDA, so I'd check compatibility at least before purchasing

agile cobalt Aug 20, 2024, 3:51 PM

#

craggy agate or are they all the same?

idk
also I have no idea what this means but from a review on the amazon page: This is NOT a graphics card, it is a graphics accelerator

jaunty helm Aug 20, 2024, 3:52 PM

#

like it's kinda special hardware and it doesn't function as your normal gaming gpu for example

serene grail Aug 20, 2024, 3:55 PM

#

serene scaffold I've never done the math on that, but buying compute time on an enterprise GPU f...

^This makes sense
YB, may I ask why do you prefer buying your own GPU instead of buying compute time?

craggy agate Aug 20, 2024, 4:19 PM

#

agile cobalt idk also I have no idea what this means but from a review on the amazon page: `T...

Oh okay

craggy agate Aug 20, 2024, 4:20 PM

#

serene grail ^This makes sense YB, may I ask why do you prefer buying your own GPU instead of...

No particular reason but I'd like running things locally, idk why.

craggy agate Aug 20, 2024, 4:26 PM

#

jaunty helm like it's kinda special hardware and it doesn't function as your normal gaming g...

I see

craggy agate Aug 20, 2024, 4:26 PM

#

jaunty helm like it's kinda special hardware and it doesn't function as your normal gaming g...

so i'd need a GPU to go with this?

jaunty helm Aug 20, 2024, 4:28 PM

#

craggy agate so i'd need a GPU to go with this?

no, but I think you need a power converter or something
I'm not too knowledgeable either and you should probably research it just in case

craggy agate Aug 20, 2024, 4:39 PM

#

jaunty helm no, but I think you need a power converter or something I'm not too knowledgeabl...

I think I know a server with people who are familiar with this kinda stuff.

jaunty helm Aug 20, 2024, 4:41 PM

#

craggy agate I think I know a server with people who are familiar with this kinda stuff.

you're definitely better off asking them about it then 😅
and gl

craggy agate Aug 20, 2024, 4:41 PM

#

jaunty helm you're definitely better off asking them about it then 😅 and gl

ty

lapis sequoia Aug 20, 2024, 5:02 PM

#

https://en.wikipedia.org/wiki/Metaphors_We_Live_By

Metaphors We Live By

Metaphors We Live By is a book by George Lakoff and Mark Johnson published in 1980. The book suggests metaphor is a tool that enables people to use what they know about their direct physical and social experiences to understand more abstract things like work, time, mental activity and feelings.

serene scaffold Aug 20, 2024, 5:14 PM

#

lapis sequoia https://en.wikipedia.org/wiki/Metaphors_We_Live_By

why have you linked this?

lapis sequoia Aug 20, 2024, 5:15 PM

#

just thought it was interesting

serene grail Aug 20, 2024, 5:19 PM

#

Is this related to the paper about how NNs don't have separate mental representations that you linked before?

lapis sequoia Aug 20, 2024, 5:22 PM

#

yeah, last part of the paper, they say that perceptions are the basis of concepts

#

this is less related, but really crazy https://en.wikipedia.org/wiki/Ideasthesia

Ideasthesia

Ideasthesia (alternative spelling ideaesthesia) is a neuropsychological phenomenon in which activations of concepts (inducers) evoke perception-like sensory experiences (concurrents). The name comes from the Ancient Greek ἰδέα (idéa) and αἴσθησις (aísthēsis), meaning 'sensing concepts' or 'sensing ideas'. The notion was introduced by neuroscient...

#

(that the trigger of some perceptions or the why is semantic, as its believed in the case of synesthesia.)

serene grail Aug 20, 2024, 5:30 PM

#

lapis sequoia yeah, last part of the paper, they say that perceptions are the basis of concept...

That's interesting, I wonder what a "perception" would be for an NN
I've heard some people say that one of the major things that prevent these models from being closer to human performance is that they don't "learn on the fly", so to speak. And humans do, you learn something from every perception
But I'm not sure how human brains do this, is this just because biological neurons are extremely different from artificial "neurons" or is there something else

oblique isle Aug 20, 2024, 5:34 PM

#

guys what is the best environmnt to train and work on a chatbot ?

lapis sequoia Aug 20, 2024, 5:41 PM

#

serene grail That's interesting, I wonder what a "perception" would be for an NN I've heard s...

yes, that's discussed in the paper, but they just describe it more or less like you did, there isn't a clear solution

serene scaffold Aug 20, 2024, 5:41 PM

#

oblique isle guys what is the best environmnt to train and work on a chatbot ?

A Linux machine with a large GPU.

lapis sequoia Aug 20, 2024, 5:42 PM

#

that they don't "learn on the fly", so to speak.
i think some are trying related stuff to solve the arc problem (by chollet; he offers 1M prize.)

#

not sure though, i vaguely remember.

oblique isle Aug 20, 2024, 5:48 PM

#

serene scaffold A Linux machine with a large GPU.

i dont have a large gpu

#

so clearly i need a cloud env or smtg like this

serene scaffold Aug 20, 2024, 5:48 PM

#

Yes

oblique isle Aug 20, 2024, 5:48 PM

#

what do u suggest

unkempt apex Aug 20, 2024, 6:27 PM

#

oblique isle what do u suggest

aws, if you want free then use kaggle or colab

#

#

okay so right one is predicted mask , but it's not that accurate

oblique isle Aug 20, 2024, 7:27 PM

#

thanks

left plover Aug 20, 2024, 7:28 PM

#

unkempt apex

Use cv2.Canny for it

unkempt apex Aug 20, 2024, 7:29 PM

#

left plover Use cv2.Canny for it

for what? predicted image>?
and why?

left plover Aug 20, 2024, 7:29 PM

#

It has better edge detection

unkempt apex Aug 20, 2024, 7:30 PM

#

heh? bruhh I am using U-Net model to train the images, so why to put canny here ?

spring field Aug 20, 2024, 7:43 PM

#

oblique isle what do u suggest

paperspace is really nice (mostly paid, but there are certain free things as well)

spring field Aug 20, 2024, 7:43 PM

#

unkempt apex aws, if you want free then use kaggle or colab

have you tried paperspace? they're pretty nice

unkempt apex Aug 20, 2024, 7:44 PM

#

spring field have you tried paperspace? they're pretty nice

currently on aws! on T4 gpu

lapis sequoia Aug 20, 2024, 7:46 PM

#

Isn't this a proof that in NNs the inputs can be considered random variables ? https://en.wikipedia.org/wiki/Independent_and_identically_distributed_random_variables#In_machine_learning

Independent and identically distributed random variables

In probability theory and statistics, a collection of random variables is independent and identically distributed if each random variable has the same probability distribution as the others and all are mutually independent. This property is usually abbreviated as i.i.d., iid, or IID. IID was first defined in statistics and finds application in d...

unkempt apex Aug 20, 2024, 7:49 PM

#

lapis sequoia Isn't this a proof that in NNs the inputs can be considered random variables ? h...

but what random variable??
it could only be bounded within datasets

unkempt apex Aug 20, 2024, 7:50 PM

#

spring field have you tried paperspace? they're pretty nice

paperspace is also nice though

lapis sequoia Aug 20, 2024, 7:50 PM

#

the random variable is the set of pixels (for images), each time for example.

spare forum Aug 20, 2024, 7:51 PM

#

Pixels are not iid

#

What you send relates to ml with tabular data

lapis sequoia Aug 20, 2024, 7:52 PM

#

each time you withdraw an image it comes from the same distribution

#

i.e the training set

#

so it is iid imho

serene grail Aug 20, 2024, 7:54 PM

#

is every pixel fully independent?